Saved in:
| Main Authors: | Chu, Xu, Tan, Zhijie, Xue, Hanlin, Wang, Guanyu, Mo, Tong, Li, Weiping |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.14431 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
GraphSOS: Graph Sampling and Order Selection to Help LLMs Understand Graphs Better
by: Chu, Xu, et al.
Published: (2025)
by: Chu, Xu, et al.
Published: (2025)
Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information
by: Chu, Xu, et al.
Published: (2025)
by: Chu, Xu, et al.
Published: (2025)
Towards Order Fairness: Mitigating LLMs Order Sensitivity through Dual Group Advantage Optimization
by: Chu, Xu, et al.
Published: (2026)
by: Chu, Xu, et al.
Published: (2026)
Adaptive Spatiotemporal Augmentation for Improving Dynamic Graph Learning
by: Chu, Xu, et al.
Published: (2025)
by: Chu, Xu, et al.
Published: (2025)
High-Stakes Personalization: Rethinking LLM Customization for Individual Investor Decision-Making
by: Sawant, Yash Ganpat
Published: (2026)
by: Sawant, Yash Ganpat
Published: (2026)
On the Effectiveness and Generalization of Race Representations for Debiasing High-Stakes Decisions
by: Nguyen, Dang, et al.
Published: (2025)
by: Nguyen, Dang, et al.
Published: (2025)
Explainable LLM Unlearning Through Reasoning
by: Liao, Junfeng, et al.
Published: (2026)
by: Liao, Junfeng, et al.
Published: (2026)
Uncertainty-Aware Answer Selection for Improved Reasoning in Multi-LLM Systems
by: Agrawal, Aakriti, et al.
Published: (2025)
by: Agrawal, Aakriti, et al.
Published: (2025)
Beyond Sequential Reranking: Reranker-Guided Search Improves Reasoning Intensive Retrieval
by: Xu, Haike, et al.
Published: (2025)
by: Xu, Haike, et al.
Published: (2025)
SPA-RL: Reinforcing LLM Agents via Stepwise Progress Attribution
by: Wang, Hanlin, et al.
Published: (2025)
by: Wang, Hanlin, et al.
Published: (2025)
On the Robustness of Answer Formats in Medical Reasoning Models
by: Taveekitworachai, Pittawat, et al.
Published: (2025)
by: Taveekitworachai, Pittawat, et al.
Published: (2025)
An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning
by: Zamai, Andrew, et al.
Published: (2025)
by: Zamai, Andrew, et al.
Published: (2025)
STeCa: Step-level Trajectory Calibration for LLM Agent Learning
by: Wang, Hanlin, et al.
Published: (2025)
by: Wang, Hanlin, et al.
Published: (2025)
InterrogateLLM: Zero-Resource Hallucination Detection in LLM-Generated Answers
by: Yehuda, Yakir, et al.
Published: (2024)
by: Yehuda, Yakir, et al.
Published: (2024)
TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning
by: Wu, Jinyang, et al.
Published: (2025)
by: Wu, Jinyang, et al.
Published: (2025)
Representation Consistency for Accurate and Coherent LLM Answer Aggregation
by: Jiang, Junqi, et al.
Published: (2025)
by: Jiang, Junqi, et al.
Published: (2025)
Curse of High Dimensionality Issue in Transformer for Long-context Modeling
by: Zhang, Shuhai, et al.
Published: (2025)
by: Zhang, Shuhai, et al.
Published: (2025)
Thinking with Knowledge Graphs: Enhancing LLM Reasoning Through Structured Data
by: Wu, Xue, et al.
Published: (2024)
by: Wu, Xue, et al.
Published: (2024)
Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations
by: Li, Jiyi
Published: (2024)
by: Li, Jiyi
Published: (2024)
Order Matters: Exploring Order Sensitivity in Multimodal Large Language Models
by: Tan, Zhijie, et al.
Published: (2024)
by: Tan, Zhijie, et al.
Published: (2024)
Scaling Speculative Decoding with Lookahead Reasoning
by: Fu, Yichao, et al.
Published: (2025)
by: Fu, Yichao, et al.
Published: (2025)
Unlocking the Black Box of Latent Reasoning: An Interpretability-Guided Approach to Intervention
by: Chang, Shuochen, et al.
Published: (2026)
by: Chang, Shuochen, et al.
Published: (2026)
ProfBench: Multi-Domain Rubrics requiring Professional Knowledge to Answer and Judge
by: Wang, Zhilin, et al.
Published: (2025)
by: Wang, Zhilin, et al.
Published: (2025)
Label Smoothing Improves Gradient Ascent in LLM Unlearning
by: Pang, Zirui, et al.
Published: (2025)
by: Pang, Zirui, et al.
Published: (2025)
Probabilistic Soundness Guarantees in LLM Reasoning Chains
by: You, Weiqiu, et al.
Published: (2025)
by: You, Weiqiu, et al.
Published: (2025)
Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
by: Cheng, Zhoujun, et al.
Published: (2025)
by: Cheng, Zhoujun, et al.
Published: (2025)
RelayLLM: Efficient Reasoning via Collaborative Decoding
by: Huang, Chengsong, et al.
Published: (2026)
by: Huang, Chengsong, et al.
Published: (2026)
A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
by: Xiong, Wei, et al.
Published: (2025)
by: Xiong, Wei, et al.
Published: (2025)
Hi-ZFO: Hierarchical Zeroth- and First-Order LLM Fine-Tuning via Importance-Guided Tensor Selection
by: Jin, Feihu, et al.
Published: (2026)
by: Jin, Feihu, et al.
Published: (2026)
Measuring and Reducing LLM Hallucination without Gold-Standard Answers
by: Wei, Jiaheng, et al.
Published: (2024)
by: Wei, Jiaheng, et al.
Published: (2024)
GCC-Spam: Spam Detection via GAN, Contrastive Learning, and Character Similarity Networks
by: Wang, Zhijie, et al.
Published: (2025)
by: Wang, Zhijie, et al.
Published: (2025)
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets
by: Younsi, Adam, et al.
Published: (2025)
by: Younsi, Adam, et al.
Published: (2025)
Beyond Answers: Transferring Reasoning Capabilities to Smaller LLMs Using Multi-Teacher Knowledge Distillation
by: Tian, Yijun, et al.
Published: (2024)
by: Tian, Yijun, et al.
Published: (2024)
Dual-Uncertainty Guided Policy Learning for Multimodal Reasoning
by: Liu, Rui, et al.
Published: (2025)
by: Liu, Rui, et al.
Published: (2025)
CRAFT: Calibrated Reasoning with Answer-Faithful Traces via Reinforcement Learning for Multi-Hop Question Answering
by: Liu, Yu, et al.
Published: (2026)
by: Liu, Yu, et al.
Published: (2026)
RM-R1: Reward Modeling as Reasoning
by: Chen, Xiusi, et al.
Published: (2025)
by: Chen, Xiusi, et al.
Published: (2025)
iFairy: the First 2-bit Complex LLM with All Parameters in $\{\pm1, \pm i\}$
by: Wang, Feiyu, et al.
Published: (2025)
by: Wang, Feiyu, et al.
Published: (2025)
RAVR: Reference-Answer-guided Variational Reasoning for Large Language Models
by: Lin, Tianqianjin, et al.
Published: (2025)
by: Lin, Tianqianjin, et al.
Published: (2025)
Generalization or Hallucination? Understanding Out-of-Context Reasoning in Transformers
by: Huang, Yixiao, et al.
Published: (2025)
by: Huang, Yixiao, et al.
Published: (2025)
Bridging Internal Probability and Self-Consistency for Effective and Efficient LLM Reasoning
by: Zhou, Zhi, et al.
Published: (2025)
by: Zhou, Zhi, et al.
Published: (2025)
Similar Items
-
GraphSOS: Graph Sampling and Order Selection to Help LLMs Understand Graphs Better
by: Chu, Xu, et al.
Published: (2025) -
Qwen Look Again: Guiding Vision-Language Reasoning Models to Re-attention Visual Information
by: Chu, Xu, et al.
Published: (2025) -
Towards Order Fairness: Mitigating LLMs Order Sensitivity through Dual Group Advantage Optimization
by: Chu, Xu, et al.
Published: (2026) -
Adaptive Spatiotemporal Augmentation for Improving Dynamic Graph Learning
by: Chu, Xu, et al.
Published: (2025) -
High-Stakes Personalization: Rethinking LLM Customization for Individual Investor Decision-Making
by: Sawant, Yash Ganpat
Published: (2026)