:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Tan, Runyan, Wu, Shuang, Howard, Phillip
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Computation and Language
Online Access:	https://arxiv.org/abs/2509.23234
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Hyperparameter-Free Approach for Faster Minimum Bayes Risk Decoding
by: Jinnai, Yuu, et al.
Published: (2024)

Hybrid Verified Decoding: Learning to Allocate Verification in Speculative Decoding
by: Su, Xin, et al.
Published: (2026)

Tuning-Free Accountable Intervention for LLM Deployment -- A Metacognitive Approach
by: Tan, Zhen, et al.
Published: (2024)

Is Your Paper Being Reviewed by an LLM? Benchmarking AI Text Detection in Peer Review
by: Yu, Sungduk, et al.
Published: (2025)

Is Your Paper Being Reviewed by an LLM? Investigating AI Text Detectability in Peer Review
by: Yu, Sungduk, et al.
Published: (2024)

Speculative Decoding for Multi-Sample Inference
by: Li, Yiwei, et al.
Published: (2025)

Human-Instruction-Free LLM Self-Alignment with Limited Samples
by: Guo, Hongyi, et al.
Published: (2024)

Robust Planning with Compound LLM Architectures: An LLM-Modulo Approach
by: Gundawar, Atharva, et al.
Published: (2024)

Cultural Awareness in Vision-Language Models: A Cross-Country Exploration
by: Madasu, Avinash, et al.
Published: (2025)

Distribution-Aligned Decoding for Efficient LLM Task Adaptation
by: Hu, Senkang, et al.
Published: (2025)

Token Constraint Decoding Improves Robustness on Question Answering for Large Language Models
by: Yao, Jui-Ming, et al.
Published: (2025)

Cross-Cultural Value Awareness in Large Vision-Language Models
by: Howard, Phillip, et al.
Published: (2026)

Stay Tuned: An Empirical Study of the Impact of Hyperparameters on LLM Tuning in Real-World Applications
by: Halfon, Alon, et al.
Published: (2024)

LLMSYS-HPOBench: Hyperparameter Optimization Benchmark Suite for Real-World LLM Systems
by: Wu, Siyu, et al.
Published: (2026)

Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding
by: Zhang, Kexun, et al.
Published: (2023)

LLM-Confidence Reranker: A Training-Free Approach for Enhancing Retrieval-Augmented Generation Systems
by: Song, Zhipeng, et al.
Published: (2026)

Language Ranker: A Lightweight Ranking framework for LLM Decoding
by: Zhang, Chenheng, et al.
Published: (2025)

A Language-Guided Bayesian Optimization for Efficient LoRA Hyperparameter Search
by: Seong-Eun, Baek, et al.
Published: (2026)

A Cascade Dual-Decoder Model for Joint Entity and Relation Extraction
by: Cheng, Jian, et al.
Published: (2021)

Advancing Decoding Strategies: Enhancements in Locally Typical Sampling for LLMs
by: Sen, Jaydip, et al.
Published: (2025)

LycheeDecode: Accelerating Long-Context LLM Inference via Hybrid-Head Sparse Decoding
by: Lin, Gang, et al.
Published: (2026)

Sampling-Efficient Test-Time Scaling: Self-Estimating the Best-of-N Sampling in Early Decoding
by: Wang, Yiming, et al.
Published: (2025)

Goose: Anisotropic Speculation Trees for Training-Free Speculative Decoding
by: Jin, Tao, et al.
Published: (2026)

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
by: Xiong, Wei, et al.
Published: (2025)

Reward-Guided Speculative Decoding for Efficient LLM Reasoning
by: Liao, Baohao, et al.
Published: (2025)

Personalized LLM Decoding via Contrasting Personal Preference
by: Bu, Hyungjune, et al.
Published: (2025)

Expanding Search Space with Diverse Prompting Agents: An Efficient Sampling Approach for LLM Mathematical Reasoning
by: Lee, Gisang, et al.
Published: (2024)

Information-Theoretic Distillation for Reference-less Summarization
by: Jung, Jaehun, et al.
Published: (2024)

OptBA: Optimizing Hyperparameters with the Bees Algorithm for Improved Medical Text Classification
by: Shaaban, Mai A., et al.
Published: (2023)

Entropy-Aware Speculative Decoding Toward Improved LLM Reasoning
by: Su, Tiancheng, et al.
Published: (2025)

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment
by: Chakraborty, Souradip, et al.
Published: (2025)

Copy-as-Decode: Grammar-Constrained Parallel Prefill for LLM Editing
by: Liu, Ziyang
Published: (2026)

Thinking by Subtraction: Confidence-Driven Contrastive Decoding for LLM Reasoning
by: Tang, Lexiang, et al.
Published: (2026)

Reasoning Aware Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling
by: Wan, Guangya, et al.
Published: (2024)

Semantic Voting: A Self-Evaluation-Free Approach for Efficient LLM Self-Improvement on Unverifiable Open-ended Tasks
by: Jiang, Chunyang, et al.
Published: (2025)

C2T: A Classifier-Based Tree Construction Method in Speculative Decoding
by: Huo, Feiye, et al.
Published: (2025)

A Patient-Doctor-NLP-System to contest inequality for less privileged
by: Dikshit, Subrit, et al.
Published: (2025)

Mixture of Decoding: An Attention-Inspired Adaptive Decoding Strategy to Mitigate Hallucinations in Large Vision-Language Models
by: Chen, Xinlong, et al.
Published: (2025)

Contrastive Decoding Mitigates Score Range Bias in LLM-as-a-Judge
by: Fujinuma, Yoshinari
Published: (2025)

Reasoning Is Not Free: Robust Adaptive Cost-Efficient Routing for LLM-as-a-Judge
by: Zhang, Wenbo, et al.
Published: (2026)