Saved in:
| Main Authors: | Gu, Jiawei, Liang, Shangsong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.00396 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
by: Huang, Jen-tse, et al.
Published: (2024)
by: Huang, Jen-tse, et al.
Published: (2024)
D-SCoRE: Document-Centric Segmentation and CoT Reasoning with Structured Export for QA-CoT Data Generation
by: Zhou, Weibo, et al.
Published: (2025)
by: Zhou, Weibo, et al.
Published: (2025)
On the Decision-Making Abilities in Role-Playing using Large Language Models
by: Shen, Chenglei, et al.
Published: (2024)
by: Shen, Chenglei, et al.
Published: (2024)
CLEX: Continuous Length Extrapolation for Large Language Models
by: Chen, Guanzheng, et al.
Published: (2023)
by: Chen, Guanzheng, et al.
Published: (2023)
S2J: Bridging the Gap Between Solving and Judging Ability in Generative Reward Models
by: Sun, Shaoning, et al.
Published: (2025)
by: Sun, Shaoning, et al.
Published: (2025)
Effective Distillation of Table-based Reasoning Ability from LLMs
by: Yang, Bohao, et al.
Published: (2023)
by: Yang, Bohao, et al.
Published: (2023)
Towards Cost-Effective Reward Guided Text Generation
by: Rashid, Ahmad, et al.
Published: (2025)
by: Rashid, Ahmad, et al.
Published: (2025)
Cascaded Language Models for Cost-effective Human-AI Decision-Making
by: Fanconi, Claudio, et al.
Published: (2025)
by: Fanconi, Claudio, et al.
Published: (2025)
Cognitive Bias in Decision-Making with LLMs
by: Echterhoff, Jessica, et al.
Published: (2024)
by: Echterhoff, Jessica, et al.
Published: (2024)
Skywork-Reward: Bag of Tricks for Reward Modeling in LLMs
by: Liu, Chris Yuhao, et al.
Published: (2024)
by: Liu, Chris Yuhao, et al.
Published: (2024)
S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs
by: Zhong, Wei, et al.
Published: (2024)
by: Zhong, Wei, et al.
Published: (2024)
Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
by: Chen, Yifang, et al.
Published: (2024)
by: Chen, Yifang, et al.
Published: (2024)
LETToT: Label-Free Evaluation of Large Language Models On Tourism Using Expert Tree-of-Thought
by: Qi, Ruiyan, et al.
Published: (2025)
by: Qi, Ruiyan, et al.
Published: (2025)
Intrinsic Mutual Information as a Modulator for Preference Optimization
by: Liao, Peng, et al.
Published: (2026)
by: Liao, Peng, et al.
Published: (2026)
Out-of-Vocabulary Sampling Boosts Speculative Decoding
by: Timor, Nadav, et al.
Published: (2025)
by: Timor, Nadav, et al.
Published: (2025)
Reward-Guided Speculative Decoding for Efficient LLM Reasoning
by: Liao, Baohao, et al.
Published: (2025)
by: Liao, Baohao, et al.
Published: (2025)
Cost-Aware Diffusion Draft Trees for Speculative Decoding
by: Zhang, Shuai, et al.
Published: (2026)
by: Zhang, Shuai, et al.
Published: (2026)
Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare
by: Khaokaew, Yonchanok, et al.
Published: (2025)
by: Khaokaew, Yonchanok, et al.
Published: (2025)
Boosting Reward Model with Preference-Conditional Multi-Aspect Synthetic Data Generation
by: Shen, Jiaming, et al.
Published: (2024)
by: Shen, Jiaming, et al.
Published: (2024)
Cost-Effective Hallucination Detection for LLMs
by: Valentin, Simon, et al.
Published: (2024)
by: Valentin, Simon, et al.
Published: (2024)
Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases
by: Qiu, Pengcheng, et al.
Published: (2025)
by: Qiu, Pengcheng, et al.
Published: (2025)
Online Intrinsic Rewards for Decision Making Agents from Large Language Model Feedback
by: Zheng, Qinqing, et al.
Published: (2024)
by: Zheng, Qinqing, et al.
Published: (2024)
Harnessing LLMs Explanations to Boost Surrogate Models in Tabular Data Classification
by: Shi, Ruxue, et al.
Published: (2025)
by: Shi, Ruxue, et al.
Published: (2025)
Self-Generated Critiques Boost Reward Modeling for Language Models
by: Yu, Yue, et al.
Published: (2024)
by: Yu, Yue, et al.
Published: (2024)
Reward-Shifted Speculative Sampling Is An Efficient Test-Time Weak-to-Strong Aligner
by: Li, Bolian, et al.
Published: (2025)
by: Li, Bolian, et al.
Published: (2025)
Haste Makes Waste: Evaluating Planning Abilities of LLMs for Efficient and Feasible Multitasking with Time Constraints Between Actions
by: Wu, Zirui, et al.
Published: (2025)
by: Wu, Zirui, et al.
Published: (2025)
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making
by: Li, Manling, et al.
Published: (2024)
by: Li, Manling, et al.
Published: (2024)
Don't Ignore Dual Logic Ability of LLMs while Privatizing: A Data-Intensive Analysis in Medical Domain
by: Du, Yanrui, et al.
Published: (2023)
by: Du, Yanrui, et al.
Published: (2023)
Speculate Deep and Accurate: Lossless and Training-Free Acceleration for Offloaded LLMs via Substitute Speculative Decoding
by: Wang, Pei-Shuo, et al.
Published: (2025)
by: Wang, Pei-Shuo, et al.
Published: (2025)
Cost-Efficient Estimation of General Abilities Across Benchmarks
by: Krumdick, Michael, et al.
Published: (2026)
by: Krumdick, Michael, et al.
Published: (2026)
Atomic Thinking of LLMs: Decoupling and Exploring Mathematical Reasoning Abilities
by: Kuang, Jiayi, et al.
Published: (2025)
by: Kuang, Jiayi, et al.
Published: (2025)
Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
by: Zhou, Shijie, et al.
Published: (2025)
by: Zhou, Shijie, et al.
Published: (2025)
Efficient Reasoning for LLMs through Speculative Chain-of-Thought
by: Wang, Jikai, et al.
Published: (2025)
by: Wang, Jikai, et al.
Published: (2025)
Accelerating Production LLMs with Combined Token/Embedding Speculators
by: Wertheimer, Davis, et al.
Published: (2024)
by: Wertheimer, Davis, et al.
Published: (2024)
Discrimination by LLMs: Cross-lingual Bias Assessment and Mitigation in Decision-Making and Summarisation
by: Huijzer, Willem, et al.
Published: (2025)
by: Huijzer, Willem, et al.
Published: (2025)
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making
by: Kim, Yubin, et al.
Published: (2024)
by: Kim, Yubin, et al.
Published: (2024)
MGM: Global Understanding of Audience Overlap Graphs for Predicting the Factuality and the Bias of News Media
by: Manzoor, Muhammad Arslan, et al.
Published: (2024)
by: Manzoor, Muhammad Arslan, et al.
Published: (2024)
Balancing Cost and Effectiveness of Synthetic Data Generation Strategies for LLMs
by: Chan, Yung-Chieh, et al.
Published: (2024)
by: Chan, Yung-Chieh, et al.
Published: (2024)
Temperature-Centric Investigation of Speculative Decoding with Knowledge Distillation
by: Ouyang, Siru, et al.
Published: (2024)
by: Ouyang, Siru, et al.
Published: (2024)
Exploring the Sensitivity of LLMs' Decision-Making Capabilities: Insights from Prompt Variation and Hyperparameters
by: Loya, Manikanta, et al.
Published: (2023)
by: Loya, Manikanta, et al.
Published: (2023)
Similar Items
-
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
by: Huang, Jen-tse, et al.
Published: (2024) -
D-SCoRE: Document-Centric Segmentation and CoT Reasoning with Structured Export for QA-CoT Data Generation
by: Zhou, Weibo, et al.
Published: (2025) -
On the Decision-Making Abilities in Role-Playing using Large Language Models
by: Shen, Chenglei, et al.
Published: (2024) -
CLEX: Continuous Length Extrapolation for Large Language Models
by: Chen, Guanzheng, et al.
Published: (2023) -
S2J: Bridging the Gap Between Solving and Judging Ability in Generative Reward Models
by: Sun, Shaoning, et al.
Published: (2025)