Saved in:
| Main Authors: | Deng, Jie, Liang, Shining, Li, Jun, Li, Hongzhi, Xie, Yutao |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01472 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Beyond Rejection Sampling: Trajectory Fusion for Scaling Mathematical Reasoning
by: Deng, Jie, et al.
Published: (2026)
by: Deng, Jie, et al.
Published: (2026)
PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch
by: Yin, Shangjian, et al.
Published: (2025)
by: Yin, Shangjian, et al.
Published: (2025)
Speech LLMs are Contextual Reasoning Transcribers
by: Deng, Keqi, et al.
Published: (2026)
by: Deng, Keqi, et al.
Published: (2026)
ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning
by: Qiao, Ziqing, et al.
Published: (2025)
by: Qiao, Ziqing, et al.
Published: (2025)
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
by: Li, Junjie, et al.
Published: (2026)
by: Li, Junjie, et al.
Published: (2026)
ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning
by: Hu, Minda, et al.
Published: (2026)
by: Hu, Minda, et al.
Published: (2026)
ConRAG: Consensus-Driven Multi-View Retrieval for Multi-Hop Question Answering
by: Zhu, Yikai, et al.
Published: (2026)
by: Zhu, Yikai, et al.
Published: (2026)
Quantifying and Improving the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data
by: Yang, Shiping, et al.
Published: (2025)
by: Yang, Shiping, et al.
Published: (2025)
MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads
by: Liu, Weihao, et al.
Published: (2025)
by: Liu, Weihao, et al.
Published: (2025)
T$^2$: An Adaptive Test-Time Scaling Strategy for Contextual Question Answering
by: Zhao, Zhengyi, et al.
Published: (2025)
by: Zhao, Zhengyi, et al.
Published: (2025)
Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering
by: Chu, Zheng, et al.
Published: (2025)
by: Chu, Zheng, et al.
Published: (2025)
ConVerse: Benchmarking Contextual Safety in Agent-to-Agent Conversations
by: Gomaa, Amr, et al.
Published: (2025)
by: Gomaa, Amr, et al.
Published: (2025)
GraphOTTER: Evolving LLM-based Graph Reasoning for Complex Table Question Answering
by: Li, Qianlong, et al.
Published: (2024)
by: Li, Qianlong, et al.
Published: (2024)
ConsPrompt: Exploiting Contrastive Samples for Fewshot Prompt Learning
by: Weng, Jinta, et al.
Published: (2022)
by: Weng, Jinta, et al.
Published: (2022)
Agentic Conversational Search with Contextualized Reasoning via Reinforcement Learning
by: Mo, Fengran, et al.
Published: (2026)
by: Mo, Fengran, et al.
Published: (2026)
HART: Data-Driven Hallucination Attribution and Evidence-Based Tracing for Large Language Models
by: Liang, Shize, et al.
Published: (2026)
by: Liang, Shize, et al.
Published: (2026)
Recent Advances of Foundation Language Models-based Continual Learning: A Survey
by: Yang, Yutao, et al.
Published: (2024)
by: Yang, Yutao, et al.
Published: (2024)
Not All Thoughts are Generated Equal: Efficient LLM Reasoning via Multi-Turn Reinforcement Learning
by: Ning, Yansong, et al.
Published: (2025)
by: Ning, Yansong, et al.
Published: (2025)
Context Reasoner: Incentivizing Reasoning Capability for Contextualized Privacy and Safety Compliance via Reinforcement Learning
by: Hu, Wenbin, et al.
Published: (2025)
by: Hu, Wenbin, et al.
Published: (2025)
Temporal-Aware Heterogeneous Graph Reasoning with Multi-View Fusion for Temporal Question Answering
by: Wen, Wuzhenghong, et al.
Published: (2026)
by: Wen, Wuzhenghong, et al.
Published: (2026)
Consistency-Aware Parameter-Preserving Knowledge Editing Framework for Multi-Hop Question Answering
by: Deng, Lingwen, et al.
Published: (2025)
by: Deng, Lingwen, et al.
Published: (2025)
BELLE: A Bi-Level Multi-Agent Reasoning Framework for Multi-Hop Question Answering
by: Zhang, Taolin, et al.
Published: (2025)
by: Zhang, Taolin, et al.
Published: (2025)
Self-Improvement Programming for Temporal Knowledge Graph Question Answering
by: Chen, Zhuo, et al.
Published: (2024)
by: Chen, Zhuo, et al.
Published: (2024)
The Essence of Contextual Understanding in Theory of Mind: A Study on Question Answering with Story Characters
by: Zhou, Chulun, et al.
Published: (2025)
by: Zhou, Chulun, et al.
Published: (2025)
REA-RL: Reflection-Aware Online Reinforcement Learning for Efficient Reasoning
by: Deng, Hexuan, et al.
Published: (2025)
by: Deng, Hexuan, et al.
Published: (2025)
Seek and Solve Reasoning for Table Question Answering
by: Jiang, Ruya, et al.
Published: (2024)
by: Jiang, Ruya, et al.
Published: (2024)
Question Classification with Deep Contextualized Transformer
by: Luo, Haozheng, et al.
Published: (2019)
by: Luo, Haozheng, et al.
Published: (2019)
FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making
by: Yu, Yangyang, et al.
Published: (2024)
by: Yu, Yangyang, et al.
Published: (2024)
LaSER: Internalizing Explicit Reasoning into Latent Space for Dense Retrieval
by: Jin, Jiajie, et al.
Published: (2026)
by: Jin, Jiajie, et al.
Published: (2026)
METER: Evaluating Multi-Level Contextual Causal Reasoning in Large Language Models
by: Li, Pengfeng, et al.
Published: (2026)
by: Li, Pengfeng, et al.
Published: (2026)
Gradual Learning: Optimizing Fine-Tuning with Partially Mastered Knowledge in Large Language Models
by: Li, Bozhou, et al.
Published: (2024)
by: Li, Bozhou, et al.
Published: (2024)
ConGA: Guidelines for Contextual Gender Annotation. A Framework for Annotating Gender in Machine Translation
by: Rescigno, Argentina Anna, et al.
Published: (2026)
by: Rescigno, Argentina Anna, et al.
Published: (2026)
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
by: Chen, Nuo, et al.
Published: (2023)
by: Chen, Nuo, et al.
Published: (2023)
Neural Probe-Based Hallucination Detection for Large Language Models
by: Liang, Shize, et al.
Published: (2025)
by: Liang, Shize, et al.
Published: (2025)
Comparison of Large Language Models for Generating Contextually Relevant Questions
by: Molina, Ivo Lodovico, et al.
Published: (2024)
by: Molina, Ivo Lodovico, et al.
Published: (2024)
Modeling Contextual Passage Utility for Multihop Question Answering
by: Jain, Akriti, et al.
Published: (2025)
by: Jain, Akriti, et al.
Published: (2025)
AQA: Adaptive Question Answering in a Society of LLMs via Contextual Multi-Armed Bandit
by: Hoveyda, Mohanna, et al.
Published: (2024)
by: Hoveyda, Mohanna, et al.
Published: (2024)
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
by: Xue, Chao, et al.
Published: (2024)
by: Xue, Chao, et al.
Published: (2024)
To Think or Not To Think, That is The Question for Large Reasoning Models in Theory of Mind Tasks
by: Gong, Nanxu, et al.
Published: (2026)
by: Gong, Nanxu, et al.
Published: (2026)
GlobalRAG: Enhancing Global Reasoning in Multi-hop Question Answering via Reinforcement Learning
by: Luo, Jinchang, et al.
Published: (2025)
by: Luo, Jinchang, et al.
Published: (2025)
Similar Items
-
Beyond Rejection Sampling: Trajectory Fusion for Scaling Mathematical Reasoning
by: Deng, Jie, et al.
Published: (2026) -
PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch
by: Yin, Shangjian, et al.
Published: (2025) -
Speech LLMs are Contextual Reasoning Transcribers
by: Deng, Keqi, et al.
Published: (2026) -
ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning
by: Qiao, Ziqing, et al.
Published: (2025) -
Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
by: Li, Junjie, et al.
Published: (2026)