Saved in:
| Main Authors: | Luo, Yun, Wang, Futing, Cheng, Qianjia, Yu, Fangchen, Lei, Haodi, Yan, Jianhao, Li, Chenxi, Chen, Jiacheng, Zhao, Yufeng, Wan, Haiyuan, Zhang, Yuchen, Zheng, Shenghe, Yao, Junchi, Zhang, Qingyang, He, Haonan, Zeng, Wenxuan, Sheng, Li, Xie, Chengxing, Zuo, Yuxin, Li, Yizhuo, Wu, Yulun, Huang, Rui, Zhou, Dongzhan, Chen, Kai, Qiao, Yu, Bai, Lei, Cheng, Yu, Ding, Ning, Zhou, Bowen, Ye, Peng, Cui, Ganqu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.09443 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
P1: Mastering Physics Olympiads with Reinforcement Learning
by: Chen, Jiacheng, et al.
Published: (2025)
by: Chen, Jiacheng, et al.
Published: (2025)
HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
by: Yu, Fangchen, et al.
Published: (2025)
by: Yu, Fangchen, et al.
Published: (2025)
Scaling Physical Reasoning with the PHYSICS Dataset
by: Zheng, Shenghe, et al.
Published: (2025)
by: Zheng, Shenghe, et al.
Published: (2025)
PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System
by: Yu, Fangchen, et al.
Published: (2025)
by: Yu, Fangchen, et al.
Published: (2025)
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
by: Li, Yafu, et al.
Published: (2026)
by: Li, Yafu, et al.
Published: (2026)
Draft-OPD: On-Policy Distillation for Speculative Draft Models
by: Lei, Haodi, et al.
Published: (2026)
by: Lei, Haodi, et al.
Published: (2026)
Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
by: Yang, Cheng, et al.
Published: (2025)
by: Yang, Cheng, et al.
Published: (2025)
SCI-Verifier: Scientific Verifier with Thinking
by: Zheng, Shenghe, et al.
Published: (2025)
by: Zheng, Shenghe, et al.
Published: (2025)
Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
by: Wang, Futing, et al.
Published: (2026)
by: Wang, Futing, et al.
Published: (2026)
From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning
by: Yang, Cheng, et al.
Published: (2025)
by: Yang, Cheng, et al.
Published: (2025)
Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning
by: Cheng, Qianjia, et al.
Published: (2026)
by: Cheng, Qianjia, et al.
Published: (2026)
Can Knowledge-Graph-based Retrieval Augmented Generation Really Retrieve What You Need?
by: Yu, Junchi, et al.
Published: (2025)
by: Yu, Junchi, et al.
Published: (2025)
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
by: Zhang, Di, et al.
Published: (2024)
by: Zhang, Di, et al.
Published: (2024)
LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
by: Zhang, Di, et al.
Published: (2024)
by: Zhang, Di, et al.
Published: (2024)
Potential and Challenges of Model Editing for Social Debiasing
by: Yan, Jianhao, et al.
Published: (2024)
by: Yan, Jianhao, et al.
Published: (2024)
Learning to Reason under Off-Policy Guidance
by: Yan, Jianhao, et al.
Published: (2025)
by: Yan, Jianhao, et al.
Published: (2025)
SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
by: Fan, Kaixuan, et al.
Published: (2025)
by: Fan, Kaixuan, et al.
Published: (2025)
LIONs: An Empirically Optimized Approach to Align Language Models
by: Yu, Xiao, et al.
Published: (2024)
by: Yu, Xiao, et al.
Published: (2024)
Improved Laguerre Spectral Methods with Less Round-off Errors and Better Stability
by: Huang, Shenghe, et al.
Published: (2022)
by: Huang, Shenghe, et al.
Published: (2022)
OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Model
by: Chen, Qiguang, et al.
Published: (2026)
by: Chen, Qiguang, et al.
Published: (2026)
Keys to Robust Edits: from Theoretical Insights to Practical Advances
by: Yan, Jianhao, et al.
Published: (2024)
by: Yan, Jianhao, et al.
Published: (2024)
DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
by: Wan, Haiyuan, et al.
Published: (2025)
by: Wan, Haiyuan, et al.
Published: (2025)
ELICIT: LLM Augmentation via External In-Context Capability
by: Wang, Futing, et al.
Published: (2024)
by: Wang, Futing, et al.
Published: (2024)
From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
by: Li, Yafu, et al.
Published: (2025)
by: Li, Yafu, et al.
Published: (2025)
Deformation-based In-Context Learning for Point Cloud Understanding
by: Lin, Chengxing, et al.
Published: (2026)
by: Lin, Chengxing, et al.
Published: (2026)
UltraIF: Advancing Instruction Following from the Wild
by: An, Kaikai, et al.
Published: (2025)
by: An, Kaikai, et al.
Published: (2025)
MokA: Multimodal Low-Rank Adaptation for MLLMs
by: Wei, Yake, et al.
Published: (2025)
by: Wei, Yake, et al.
Published: (2025)
Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs
by: Tu, Chongjun, et al.
Published: (2025)
by: Tu, Chongjun, et al.
Published: (2025)
Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing
by: Wang, Zhilin, et al.
Published: (2025)
by: Wang, Zhilin, et al.
Published: (2025)
Submodular flows and extreme flows on measurable spaces
by: Yu, Jing, et al.
Published: (2026)
by: Yu, Jing, et al.
Published: (2026)
Impacts of Food‐Based Flock Size on Foraging Patterns, Activity Time Budget and Foraging Efficiency: Flexible Behavioral Responses of the Wintering Oriental Storks (Ciconia boyciana) to Changes in Aquaculture at Shengjin Lake, China
by: Lei Cheng, et al.
Published: (2025)
by: Lei Cheng, et al.
Published: (2025)
Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
by: Zhang, Xinyu, et al.
Published: (2024)
by: Zhang, Xinyu, et al.
Published: (2024)
GuardTrace-VL: Detecting Unsafe Multimodel Reasoning via Iterative Safety Supervision
by: Xiang, Yuxiao, et al.
Published: (2025)
by: Xiang, Yuxiao, et al.
Published: (2025)
LabBuilder: Protocol-Grounded 3D Layout Generation for Interactable and Safe Laboratory
by: Cao, Jianbao, et al.
Published: (2026)
by: Cao, Jianbao, et al.
Published: (2026)
Bench2Drive-VL: Benchmarks for Closed-Loop Autonomous Driving with Vision-Language Models
by: Jia, Xiaosong, et al.
Published: (2026)
by: Jia, Xiaosong, et al.
Published: (2026)
TEMPO: Scaling Test-time Training for Large Reasoning Models
by: Zhang, Qingyang, et al.
Published: (2026)
by: Zhang, Qingyang, et al.
Published: (2026)
Adaptive Stopping for Multi-Turn LLM Reasoning
by: Zhou, Xiaofan, et al.
Published: (2026)
by: Zhou, Xiaofan, et al.
Published: (2026)
Invisible Backdoor Attacks on Diffusion Models
by: Li, Sen, et al.
Published: (2024)
by: Li, Sen, et al.
Published: (2024)
Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation
by: Li, Yi-Chen, et al.
Published: (2024)
by: Li, Yi-Chen, et al.
Published: (2024)
Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
by: Zhou, Yitong, et al.
Published: (2024)
by: Zhou, Yitong, et al.
Published: (2024)
Similar Items
-
P1: Mastering Physics Olympiads with Reinforcement Learning
by: Chen, Jiacheng, et al.
Published: (2025) -
HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
by: Yu, Fangchen, et al.
Published: (2025) -
Scaling Physical Reasoning with the PHYSICS Dataset
by: Zheng, Shenghe, et al.
Published: (2025) -
PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System
by: Yu, Fangchen, et al.
Published: (2025) -
Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
by: Li, Yafu, et al.
Published: (2026)