:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gu, Jiawei, Liang, Shangsong
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2506.00396
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
by: Huang, Jen-tse, et al.
Published: (2024)

D-SCoRE: Document-Centric Segmentation and CoT Reasoning with Structured Export for QA-CoT Data Generation
by: Zhou, Weibo, et al.
Published: (2025)

On the Decision-Making Abilities in Role-Playing using Large Language Models
by: Shen, Chenglei, et al.
Published: (2024)

CLEX: Continuous Length Extrapolation for Large Language Models
by: Chen, Guanzheng, et al.
Published: (2023)

S2J: Bridging the Gap Between Solving and Judging Ability in Generative Reward Models
by: Sun, Shaoning, et al.
Published: (2025)

Effective Distillation of Table-based Reasoning Ability from LLMs
by: Yang, Bohao, et al.
Published: (2023)

Towards Cost-Effective Reward Guided Text Generation
by: Rashid, Ahmad, et al.
Published: (2025)

Cascaded Language Models for Cost-effective Human-AI Decision-Making
by: Fanconi, Claudio, et al.
Published: (2025)

Cognitive Bias in Decision-Making with LLMs
by: Echterhoff, Jessica, et al.
Published: (2024)

Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs
by: Liu, Chris Yuhao, et al.
Published: (2024)

S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs
by: Zhong, Wei, et al.
Published: (2024)

Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
by: Chen, Yifang, et al.
Published: (2024)

LETToT: Label-Free Evaluation of Large Language Models On Tourism Using Expert Tree-of-Thought
by: Qi, Ruiyan, et al.
Published: (2025)

Intrinsic Mutual Information as a Modulator for Preference Optimization
by: Liao, Peng, et al.
Published: (2026)

Out-of-Vocabulary Sampling Boosts Speculative Decoding
by: Timor, Nadav, et al.
Published: (2025)

Reward-Guided Speculative Decoding for Efficient LLM Reasoning
by: Liao, Baohao, et al.
Published: (2025)

Cost-Aware Diffusion Draft Trees for Speculative Decoding
by: Zhang, Shuai, et al.
Published: (2026)

Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare
by: Khaokaew, Yonchanok, et al.
Published: (2025)

Boosting Reward Model with Preference-Conditional Multi-Aspect Synthetic Data Generation
by: Shen, Jiaming, et al.
Published: (2024)

Cost-Effective Hallucination Detection for LLMs
by: Valentin, Simon, et al.
Published: (2024)

Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
by: Qiu, Pengcheng, et al.
Published: (2025)

Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
by: Zheng, Qinqing, et al.
Published: (2024)

Harnessing LLMs Explanations to Boost Surrogate Models in Tabular Data Classification
by: Shi, Ruxue, et al.
Published: (2025)

Self-Generated Critiques Boost Reward Modeling for Language Models
by: Yu, Yue, et al.
Published: (2024)

Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner
by: Li, Bolian, et al.
Published: (2025)

Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions
by: Wu, Zirui, et al.
Published: (2025)

Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
by: Li, Manling, et al.
Published: (2024)

Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain
by: Du, Yanrui, et al.
Published: (2023)

Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding
by: Wang, Pei-Shuo, et al.
Published: (2025)

Cost-Efficient Estimation of General Abilities Across Benchmarks
by: Krumdick, Michael, et al.
Published: (2026)

Atomic Thinking of LLMs: Decoupling and Exploring Mathematical Reasoning Abilities
by: Kuang, Jiayi, et al.
Published: (2025)

Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
by: Zhou, Shijie, et al.
Published: (2025)

Efficient Reasoning for LLMs through Speculative Chain-of-Thought
by: Wang, Jikai, et al.
Published: (2025)

Accelerating Production LLMs with Combined Token/Embedding Speculators
by: Wertheimer, Davis, et al.
Published: (2024)

Discrimination by LLMs: Cross-lingual Bias Assessment and Mitigation in Decision-Making and Summarisation
by: Huijzer, Willem, et al.
Published: (2025)

MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making
by: Kim, Yubin, et al.
Published: (2024)

MGM: Global Understanding of Audience Overlap Graphs for Predicting the Factuality and the Bias of News Media
by: Manzoor, Muhammad Arslan, et al.
Published: (2024)

Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs
by: Chan, Yung-Chieh, et al.
Published: (2024)

Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation
by: Ouyang, Siru, et al.
Published: (2024)

Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters
by: Loya, Manikanta, et al.
Published: (2023)