:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Chu, Xu, Tan, Zhijie, Xue, Hanlin, Wang, Guanyu, Mo, Tong, Li, Weiping
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2501.14431
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

GraphSOS: Graph Sampling and Order Selection to Help LLMs Understand Graphs Better
by: Chu, Xu, et al.
Published: (2025)

Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information
by: Chu, Xu, et al.
Published: (2025)

Towards Order Fairness: Mitigating LLMs Order Sensitivity through Dual Group Advantage Optimization
by: Chu, Xu, et al.
Published: (2026)

Adaptive Spatiotemporal Augmentation for Improving Dynamic Graph Learning
by: Chu, Xu, et al.
Published: (2025)

High-Stakes Personalization: Rethinking LLM Customization for Individual Investor Decision-Making
by: Sawant, Yash Ganpat
Published: (2026)

On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions
by: Nguyen, Dang, et al.
Published: (2025)

Explainable LLM Unlearning Through Reasoning
by: Liao, Junfeng, et al.
Published: (2026)

Uncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM Systems
by: Agrawal, Aakriti, et al.
Published: (2025)

Beyond Sequential Reranking: Reranker-Guided Search Improves Reasoning Intensive Retrieval
by: Xu, Haike, et al.
Published: (2025)

SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution
by: Wang, Hanlin, et al.
Published: (2025)

On the Robustness of Answer Formats in Medical Reasoning Models
by: Taveekitworachai, Pittawat, et al.
Published: (2025)

An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning
by: Zamai, Andrew, et al.
Published: (2025)

STeCa: Step-level Trajectory Calibration for LLM Agent Learning
by: Wang, Hanlin, et al.
Published: (2025)

InterrogateLLM: Zero-Resource Hallucination Detection in LLM-Generated Answers
by: Yehuda, Yakir, et al.
Published: (2024)

TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning
by: Wu, Jinyang, et al.
Published: (2025)

Representation Consistency for Accurate and Coherent LLM Answer Aggregation
by: Jiang, Junqi, et al.
Published: (2025)

Curse of High Dimensionality Issue in Transformer for Long-context Modeling
by: Zhang, Shuhai, et al.
Published: (2025)

Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data
by: Wu, Xue, et al.
Published: (2024)

Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations
by: Li, Jiyi
Published: (2024)

Order Matters: Exploring Order Sensitivity in Multimodal Large Language Models
by: Tan, Zhijie, et al.
Published: (2024)

Scaling Speculative Decoding with Lookahead Reasoning
by: Fu, Yichao, et al.
Published: (2025)

Unlocking the Black Box of Latent Reasoning: An Interpretability-Guided Approach to Intervention
by: Chang, Shuochen, et al.
Published: (2026)

ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
by: Wang, Zhilin, et al.
Published: (2025)

Label Smoothing Improves Gradient Ascent in LLM Unlearning
by: Pang, Zirui, et al.
Published: (2025)

Probabilistic Soundness Guarantees in LLM Reasoning Chains
by: You, Weiqiu, et al.
Published: (2025)

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
by: Cheng, Zhoujun, et al.
Published: (2025)

RelayLLM: Efficient Reasoning via Collaborative Decoding
by: Huang, Chengsong, et al.
Published: (2026)

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
by: Xiong, Wei, et al.
Published: (2025)

Hi-ZFO: Hierarchical Zeroth- and First-Order LLM Fine-Tuning via Importance-Guided Tensor Selection
by: Jin, Feihu, et al.
Published: (2026)

Measuring and Reducing LLM Hallucination without Gold-Standard Answers
by: Wei, Jiaheng, et al.
Published: (2024)

GCC-Spam: Spam Detection via GAN, Contrastive Learning, and Character Similarity Networks
by: Wang, Zhijie, et al.
Published: (2025)

Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets
by: Younsi, Adam, et al.
Published: (2025)

Beyond Answers: Transferring Reasoning Capabilities to Smaller LLMs Using Multi-Teacher Knowledge Distillation
by: Tian, Yijun, et al.
Published: (2024)

Dual-Uncertainty Guided Policy Learning for Multimodal Reasoning
by: Liu, Rui, et al.
Published: (2025)

CRAFT: Calibrated Reasoning with Answer-Faithful Traces via Reinforcement Learning for Multi-Hop Question Answering
by: Liu, Yu, et al.
Published: (2026)

RM-R1: Reward Modeling as Reasoning
by: Chen, Xiusi, et al.
Published: (2025)

iFairy: the First 2-bit Complex LLM with All Parameters in $\{\pm1, \pm i\}$
by: Wang, Feiyu, et al.
Published: (2025)

RAVR: Reference-Answer-guided Variational Reasoning for Large Language Models
by: Lin, Tianqianjin, et al.
Published: (2025)

Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
by: Huang, Yixiao, et al.
Published: (2025)

Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning
by: Zhou, Zhi, et al.
Published: (2025)