:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Deng, Jie, Tong, Hanshuang, Li, Jun, Liang, Shining, Wu, Ning, Li, Hongzhi, Xie, Yutao
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.04391
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ConPress: Learning Efficient Reasoning from Multi-Question Contextual Pressure
by: Deng, Jie, et al.
Published: (2026)

PIKA: Expert-Level Synthetic Datasets for Post-Training Alignment from Scratch
by: Yin, Shangjian, et al.
Published: (2025)

Ploutos: Towards interpretable stock movement prediction with financial large language model
by: Tong, Hanshuang, et al.
Published: (2024)

Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
by: Chen, Nuo, et al.
Published: (2023)

Evaluating Mathematical Reasoning Beyond Accuracy
by: Xia, Shijie, et al.
Published: (2024)

Quantifying and Improving the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data
by: Yang, Shiping, et al.
Published: (2025)

DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving
by: Tong, Yuxuan, et al.
Published: (2024)

A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce
by: Xiong, Wei, et al.
Published: (2025)

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs
by: Li, Junjie, et al.
Published: (2026)

Reasons to Reject? Aligning Language Models with Judgments
by: Xu, Weiwen, et al.
Published: (2023)

Selected Languages are All You Need for Cross-lingual Truthfulness Transfer
by: Liu, Weihao, et al.
Published: (2024)

Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
by: Huang, Hongzhi, et al.
Published: (2025)

Scaling Flaws of Verifier-Guided Search in Mathematical Reasoning
by: Yu, Fei, et al.
Published: (2025)

Optimizing Chain-of-Thought Reasoners via Gradient Variance Minimization in Rejection Sampling and RL
by: Yao, Jiarui, et al.
Published: (2025)

Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning
by: Cao, Lang, et al.
Published: (2024)

Toward Automated Robustness Evaluation of Mathematical Reasoning
by: Hou, Yutao, et al.
Published: (2025)

Dynamic Sampling that Adapts: Self-Aware Iterative Data Persistent Optimization for Mathematical Reasoning
by: Rao, Jun, et al.
Published: (2025)

Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations
by: Chen, Nuo, et al.
Published: (2023)

Beyond "I cannot fulfill this request": Alleviating Rigid Rejection in LLMs via Label Enhancement
by: Zhang, Ying, et al.
Published: (2026)

ScaleDiff: Scaling Difficult Problems for Advanced Mathematical Reasoning
by: Pei, Qizhi, et al.
Published: (2025)

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
by: Zhang, Di, et al.
Published: (2024)

Rethinking Expert Trajectory Utilization in LLM Post-training for Mathematical Reasoning
by: Ding, Bowen, et al.
Published: (2025)

MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads
by: Liu, Weihao, et al.
Published: (2025)

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
by: Zhang, Zhihan, et al.
Published: (2024)

Reasoning Aware Self-Consistency: Leveraging Reasoning Paths for Efficient LLM Sampling
by: Wan, Guangya, et al.
Published: (2024)

Examining False Positives under Inference Scaling for Mathematical Reasoning
by: Wang, Yu, et al.
Published: (2025)

Legal Mathematical Reasoning with LLMs: Procedural Alignment through Two-Stage Reinforcement Learning
by: Zhang, Kepu, et al.
Published: (2025)

Aligning Reasoning LLMs for Materials Discovery with Physics-aware Rejection Sampling
by: Hyun, Lee, et al.
Published: (2025)

What Really Improves Mathematical Reasoning: Structured Reasoning Signals Beyond Pure Code
by: Zhao, Yuze, et al.
Published: (2026)

Statistical Rejection Sampling Improves Preference Optimization
by: Liu, Tianqi, et al.
Published: (2023)

DuaShepherd: Integrating Stepwise Correctness and Potential Rewards for Mathematical Reasoning
by: Wu, Yuanhao, et al.
Published: (2025)

AERO: Autonomous Evolutionary Reasoning Optimization via Endogenous Dual-Loop Feedback
by: Gao, Zhitao, et al.
Published: (2026)

Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning
by: Zhao, Jun, et al.
Published: (2024)

Constrained Adaptive Rejection Sampling
by: Parys, Paweł, et al.
Published: (2025)

Beyond Input Understanding: Diagnosing Multilingual Mathematical Reasoning with Directed Acyclic Trace Graphs
by: Zhang, Jiaqiao, et al.
Published: (2026)

MultiLingPoT: Enhancing Mathematical Reasoning with Multilingual Program Fine-tuning
by: Li, Nianqi, et al.
Published: (2024)

Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning
by: Wang, Yiming, et al.
Published: (2024)

Linguistic Generalizability of Test-Time Scaling in Mathematical Reasoning
by: Son, Guijin, et al.
Published: (2025)

Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning
by: Li, Zhen, et al.
Published: (2025)

Reasoning Pattern Alignment Merging for Adaptive Reasoning
by: Zhong, Zhaofeng, et al.
Published: (2026)