Saved in:
| Main Authors: | Chen, Zijie, Lin, Zhenghao, Liu, Xiao, Lan, Zhenzhong, Gong, Yeyun, Cheng, Peng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.08321 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
by: Park, Sungjin, et al.
Published: (2024)
by: Park, Sungjin, et al.
Published: (2024)
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data
by: Deng, Haoran, et al.
Published: (2025)
by: Deng, Haoran, et al.
Published: (2025)
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection
by: He, Hongyi, et al.
Published: (2025)
by: He, Hongyi, et al.
Published: (2025)
Enhancing Large Language Model Performance with Gradient-Based Parameter Selection
by: Li, Haoling, et al.
Published: (2024)
by: Li, Haoling, et al.
Published: (2024)
Process-based Self-Rewarding Language Models
by: Zhang, Shimao, et al.
Published: (2025)
by: Zhang, Shimao, et al.
Published: (2025)
LayerNorm Induces Recency Bias in Transformer Decoders
by: Kim, Junu, et al.
Published: (2025)
by: Kim, Junu, et al.
Published: (2025)
AnnoLLM: Making Large Language Models to Be Better Crowdsourced Annotators
by: He, Xingwei, et al.
Published: (2023)
by: He, Xingwei, et al.
Published: (2023)
Optimizing Large Language Model Training Using FP4 Quantization
by: Wang, Ruizhe, et al.
Published: (2025)
by: Wang, Ruizhe, et al.
Published: (2025)
Overcoming Vocabulary Mismatch: Vocabulary-agnostic Teacher Guided Language Modeling
by: Shin, Haebin, et al.
Published: (2025)
by: Shin, Haebin, et al.
Published: (2025)
Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph
by: Sun, Jiashuo, et al.
Published: (2023)
by: Sun, Jiashuo, et al.
Published: (2023)
Improving Phishing Email Detection Performance of Small Large Language Models
by: Lin, Zijie, et al.
Published: (2025)
by: Lin, Zijie, et al.
Published: (2025)
DND: Boosting Large Language Models with Dynamic Nested Depth
by: Chen, Tieyuan, et al.
Published: (2025)
by: Chen, Tieyuan, et al.
Published: (2025)
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning
by: Huang, Yiming, et al.
Published: (2024)
by: Huang, Yiming, et al.
Published: (2024)
QUBE: Enhancing Automatic Heuristic Design via Quality-Uncertainty Balanced Evolution
by: Chen, Zijie, et al.
Published: (2024)
by: Chen, Zijie, et al.
Published: (2024)
Ensuring Safe and High-Quality Outputs: A Guideline Library Approach for Language Models
by: Luo, Yi, et al.
Published: (2024)
by: Luo, Yi, et al.
Published: (2024)
Enhancing Chain-of-Thoughts Prompting with Iterative Bootstrapping in Large Language Models
by: Sun, Jiashuo, et al.
Published: (2023)
by: Sun, Jiashuo, et al.
Published: (2023)
Training LLMs for Divide-and-Conquer Reasoning Elevates Test-Time Scalability
by: Liang, Xiao, et al.
Published: (2026)
by: Liang, Xiao, et al.
Published: (2026)
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training
by: Luo, Zheheng, et al.
Published: (2024)
by: Luo, Zheheng, et al.
Published: (2024)
Exploring the Mystery of Influential Data for Mathematical Reasoning
by: Ni, Xinzhe, et al.
Published: (2024)
by: Ni, Xinzhe, et al.
Published: (2024)
FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design
by: Lan, Kai, et al.
Published: (2025)
by: Lan, Kai, et al.
Published: (2025)
DeepThink: Aligning Language Models with Domain-Specific User Intents
by: Li, Yang, et al.
Published: (2025)
by: Li, Yang, et al.
Published: (2025)
Enhancing Large Language Model Reasoning with Reward Models: An Analytical Survey
by: Liu, Qiyuan, et al.
Published: (2025)
by: Liu, Qiyuan, et al.
Published: (2025)
Do Large Language Models Truly Grasp Addition? A Rule-Focused Diagnostic Using Two-Integer Arithmetic
by: Yan, Yang, et al.
Published: (2025)
by: Yan, Yang, et al.
Published: (2025)
HiCaM: A Hierarchical-Causal Modification Framework for Long-Form Text Modification
by: Shi, Yuntao, et al.
Published: (2025)
by: Shi, Yuntao, et al.
Published: (2025)
APOLLO: An Optimized Training Approach for Long-form Numerical Reasoning
by: Sun, Jiashuo, et al.
Published: (2022)
by: Sun, Jiashuo, et al.
Published: (2022)
Breaking the Data Barrier -- Building GUI Agents Through Task Generalization
by: Zhang, Junlei, et al.
Published: (2025)
by: Zhang, Junlei, et al.
Published: (2025)
Competition-Level Problems are Effective LLM Evaluators
by: Huang, Yiming, et al.
Published: (2023)
by: Huang, Yiming, et al.
Published: (2023)
Rho-1: Not All Tokens Are What You Need
by: Lin, Zhenghao, et al.
Published: (2024)
by: Lin, Zhenghao, et al.
Published: (2024)
Data Mixing Agent: Learning to Re-weight Domains for Continual Pre-training
by: Yang, Kailai, et al.
Published: (2025)
by: Yang, Kailai, et al.
Published: (2025)
CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credit
by: Wang, Kangyu, et al.
Published: (2025)
by: Wang, Kangyu, et al.
Published: (2025)
Dynamics of Instruction Fine-Tuning for Chinese Large Language Models
by: Song, Chiyu, et al.
Published: (2023)
by: Song, Chiyu, et al.
Published: (2023)
RECAP: Resistance Capture in Text-based Mental Health Counseling with Large Language Models
by: Li, Anqi, et al.
Published: (2026)
by: Li, Anqi, et al.
Published: (2026)
CRITIC: Large Language Models Can Self-Correct with Tool-Interactive Critiquing
by: Gou, Zhibin, et al.
Published: (2023)
by: Gou, Zhibin, et al.
Published: (2023)
Scientific Knowledge-driven Decoding Constraints Improving the Reliability of LLMs
by: Ma, Maotian, et al.
Published: (2026)
by: Ma, Maotian, et al.
Published: (2026)
Facilitating Pornographic Text Detection for Open-Domain Dialogue Systems via Knowledge Distillation of Large Language Models
by: Qiu, Huachuan, et al.
Published: (2024)
by: Qiu, Huachuan, et al.
Published: (2024)
Predicting the Big Five Personality Traits in Chinese Counselling Dialogues Using Large Language Models
by: Yan, Yang, et al.
Published: (2024)
by: Yan, Yang, et al.
Published: (2024)
Inclusion Arena: An Open Platform for Evaluating Large Foundation Models with Real-World Apps
by: Wang, Kangyu, et al.
Published: (2025)
by: Wang, Kangyu, et al.
Published: (2025)
LogiNumSynth: Synthesizing Joint Logical-Numerical Reasoning Problems for Language Models
by: Liu, Yiwei, et al.
Published: (2025)
by: Liu, Yiwei, et al.
Published: (2025)
Reasoning Beyond Chain-of-Thought: A Latent Computational Mode in Large Language Models
by: He, Zhenghao, et al.
Published: (2026)
by: He, Zhenghao, et al.
Published: (2026)
CARE: An Explainable Computational Framework for Assessing Client-Perceived Therapeutic Alliance Using Large Language Models
by: Li, Anqi, et al.
Published: (2026)
by: Li, Anqi, et al.
Published: (2026)
Similar Items
-
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning
by: Park, Sungjin, et al.
Published: (2024) -
Beyond Length: Quantifying Long-Range Information for Long-Context LLM Pretraining Data
by: Deng, Haoran, et al.
Published: (2025) -
Learning from the Best, Differently: A Diversity-Driven Rethinking on Data Selection
by: He, Hongyi, et al.
Published: (2025) -
Enhancing Large Language Model Performance with Gradient-Based Parameter Selection
by: Li, Haoling, et al.
Published: (2024) -
Process-based Self-Rewarding Language Models
by: Zhang, Shimao, et al.
Published: (2025)