Saved in:
| Main Authors: | Ji, Yixin, Li, Juntao, Xiang, Yang, Ye, Hai, Wu, Kaixin, Yao, Kai, Xu, Jia, Mo, Linjian, Zhang, Min |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.02497 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models
by: Xiang, Yang, et al.
Published: (2025)
by: Xiang, Yang, et al.
Published: (2025)
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
by: Wu, Kaixin, et al.
Published: (2024)
by: Wu, Kaixin, et al.
Published: (2024)
When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning
by: Xiang, Yang, et al.
Published: (2026)
by: Xiang, Yang, et al.
Published: (2026)
When to Trust Tools? Adaptive Tool Trust Calibration For Tool-Integrated Math Reasoning
by: Xu, Ruotao, et al.
Published: (2026)
by: Xu, Ruotao, et al.
Published: (2026)
Taming the Titans: A Survey of Efficient LLM Inference Serving
by: Zhen, Ranran, et al.
Published: (2025)
by: Zhen, Ranran, et al.
Published: (2025)
Intuitive or Dependent? Investigating LLMs' Behavior Style to Conflicting Prompts
by: Ying, Jiahao, et al.
Published: (2023)
by: Ying, Jiahao, et al.
Published: (2023)
Adaptive Feature-based Low-Rank Compression of Large Language Models via Bayesian Optimization
by: Ji, Yixin, et al.
Published: (2024)
by: Ji, Yixin, et al.
Published: (2024)
Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation
by: Zhang, Haoran, et al.
Published: (2025)
by: Zhang, Haoran, et al.
Published: (2025)
100 Days After DeepSeek-R1: A Survey on Replication Studies and More Directions for Reasoning Language Models
by: Zhang, Chong, et al.
Published: (2025)
by: Zhang, Chong, et al.
Published: (2025)
Demonstration Augmentation for Zero-shot In-context Learning
by: Su, Yi, et al.
Published: (2024)
by: Su, Yi, et al.
Published: (2024)
MIND Your Reasoning: A Meta-Cognitive Intuitive-Reflective Network for Dual-Reasoning in Multimodal Stance Detection
by: Wang, Bingbing, et al.
Published: (2025)
by: Wang, Bingbing, et al.
Published: (2025)
Unlocking the Power of Multi-Agent LLM for Reasoning: From Lazy Agents to Deliberation
by: Zhang, Zhiwei, et al.
Published: (2025)
by: Zhang, Zhiwei, et al.
Published: (2025)
GradOT: Training-free Gradient-preserving Offsite-tuning for Large Language Models
by: Yao, Kai, et al.
Published: (2025)
by: Yao, Kai, et al.
Published: (2025)
Less Data Less Tokens: Multilingual Unification Learning for Efficient Test-Time Reasoning in LLMs
by: Chen, Kang, et al.
Published: (2025)
by: Chen, Kang, et al.
Published: (2025)
Refine Thought: A Test-Time Inference Method for Embedding Model Reasoning
by: Wang, Guangzhi, et al.
Published: (2025)
by: Wang, Guangzhi, et al.
Published: (2025)
Enhancing Language Model Rationality with Bi-Directional Deliberation Reasoning
by: Zhang, Yadong, et al.
Published: (2024)
by: Zhang, Yadong, et al.
Published: (2024)
Forest-of-Thought: Scaling Test-Time Compute for Enhancing LLM Reasoning
by: Bi, Zhenni, et al.
Published: (2024)
by: Bi, Zhenni, et al.
Published: (2024)
OmniGuard: Unified Omni-Modal Guardrails with Deliberate Reasoning
by: Zhu, Boyu, et al.
Published: (2025)
by: Zhu, Boyu, et al.
Published: (2025)
Beware of Calibration Data for Pruning Large Language Models
by: Ji, Yixin, et al.
Published: (2024)
by: Ji, Yixin, et al.
Published: (2024)
Chronos: Learning Temporal Dynamics of Reasoning Chains for Test-Time Scaling
by: Zhang, Kai, et al.
Published: (2026)
by: Zhang, Kai, et al.
Published: (2026)
From Debate to Deliberation: Structured Collective Reasoning with Typed Epistemic Acts
by: Prakash, Sunil
Published: (2026)
by: Prakash, Sunil
Published: (2026)
Textual Self-attention Network: Test-Time Preference Optimization through Textual Gradient-based Attention
by: Mo, Shibing, et al.
Published: (2025)
by: Mo, Shibing, et al.
Published: (2025)
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving
by: Chen, Guizhen, et al.
Published: (2025)
by: Chen, Guizhen, et al.
Published: (2025)
BRIEF: Bridging Retrieval and Inference for Multi-hop Reasoning via Compression
by: Li, Yuankai, et al.
Published: (2024)
by: Li, Yuankai, et al.
Published: (2024)
Efficient Reasoning for LLMs through Speculative Chain-of-Thought
by: Wang, Jikai, et al.
Published: (2025)
by: Wang, Jikai, et al.
Published: (2025)
MemReread: Enhancing Agentic Long-Context Reasoning via Memory-Guided Rereading
by: Ji, Baibei, et al.
Published: (2026)
by: Ji, Baibei, et al.
Published: (2026)
Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute
by: Liu, Sheng, et al.
Published: (2025)
by: Liu, Sheng, et al.
Published: (2025)
Chain of Methodologies: Scaling Test Time Computation without Training
by: Liu, Cong, et al.
Published: (2025)
by: Liu, Cong, et al.
Published: (2025)
SAND: Boosting LLM Agents with Self-Taught Action Deliberation
by: Xia, Yu, et al.
Published: (2025)
by: Xia, Yu, et al.
Published: (2025)
Towards Safety Reasoning in LLMs: AI-agentic Deliberation for Policy-embedded CoT Data Creation
by: Kumarage, Tharindu, et al.
Published: (2025)
by: Kumarage, Tharindu, et al.
Published: (2025)
Test-Time Scaling of Reasoning Models for Machine Translation
by: Li, Zihao, et al.
Published: (2025)
by: Li, Zihao, et al.
Published: (2025)
Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures
by: Hu, Yi, et al.
Published: (2026)
by: Hu, Yi, et al.
Published: (2026)
Efficient Inference for Large Reasoning Models: A Survey
by: Liu, Yue, et al.
Published: (2025)
by: Liu, Yue, et al.
Published: (2025)
Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration
by: Ye, Hai, et al.
Published: (2024)
by: Ye, Hai, et al.
Published: (2024)
Deliberate Reasoning in Language Models as Structure-Aware Planning with an Accurate World Model
by: Xiong, Siheng, et al.
Published: (2024)
by: Xiong, Siheng, et al.
Published: (2024)
Quantum-Audit: Evaluating the Reasoning Limits of LLMs on Quantum Computing
by: Afane, Mohamed, et al.
Published: (2026)
by: Afane, Mohamed, et al.
Published: (2026)
Parallel Loop Transformer for Efficient Test-Time Computation Scaling
by: Wu, Bohong, et al.
Published: (2025)
by: Wu, Bohong, et al.
Published: (2025)
Diagnosing Memorization in Chain-of-Thought Reasoning, One Token at a Time
by: Li, Huihan, et al.
Published: (2025)
by: Li, Huihan, et al.
Published: (2025)
DeepCritic: Deliberate Critique with Large Language Models
by: Yang, Wenkai, et al.
Published: (2025)
by: Yang, Wenkai, et al.
Published: (2025)
Beyond Test-Time Compute Strategies: Advocating Energy-per-Token in LLM Inference
by: Wilhelm, Patrick, et al.
Published: (2026)
by: Wilhelm, Patrick, et al.
Published: (2026)
Similar Items
-
Think Before You Prune: Selective Self-Generated Calibration for Pruning Large Reasoning Models
by: Xiang, Yang, et al.
Published: (2025) -
CPRM: A LLM-based Continual Pre-training Framework for Relevance Modeling in Commercial Search
by: Wu, Kaixin, et al.
Published: (2024) -
When Is Thinking Enough? Early Exit via Sufficiency Assessment for Efficient Reasoning
by: Xiang, Yang, et al.
Published: (2026) -
When to Trust Tools? Adaptive Tool Trust Calibration For Tool-Integrated Math Reasoning
by: Xu, Ruotao, et al.
Published: (2026) -
Taming the Titans: A Survey of Efficient LLM Inference Serving
by: Zhen, Ranran, et al.
Published: (2025)