:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Luo, Yun, Wang, Futing, Cheng, Qianjia, Yu, Fangchen, Lei, Haodi, Yan, Jianhao, Li, Chenxi, Chen, Jiacheng, Zhao, Yufeng, Wan, Haiyuan, Zhang, Yuchen, Zheng, Shenghe, Yao, Junchi, Zhang, Qingyang, He, Haonan, Zeng, Wenxuan, Sheng, Li, Xie, Chengxing, Zuo, Yuxin, Li, Yizhuo, Wu, Yulun, Huang, Rui, Zhou, Dongzhan, Chen, Kai, Qiao, Yu, Bai, Lei, Cheng, Yu, Ding, Ning, Zhou, Bowen, Ye, Peng, Cui, Ganqu
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.09443
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

P1: Mastering Physics Olympiads with Reinforcement Learning
by: Chen, Jiacheng, et al.
Published: (2025)

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?
by: Yu, Fangchen, et al.
Published: (2025)

Scaling Physical Reasoning with the PHYSICS Dataset
by: Zheng, Shenghe, et al.
Published: (2025)

PhysicsMinions: Winning Gold Medals in the Latest Physics Olympiads with a Coevolutionary Multimodal Multi-Agent System
by: Yu, Fangchen, et al.
Published: (2025)

Achieving Gold-Medal-Level Olympiad Reasoning via Simple and Unified Scaling
by: Li, Yafu, et al.
Published: (2026)

Draft-OPD: On-Policy Distillation for Speculative Draft Models
by: Lei, Haodi, et al.
Published: (2026)

Reasoning via Video: The First Evaluation of Video Models' Reasoning Abilities through Maze-Solving Tasks
by: Yang, Cheng, et al.
Published: (2025)

SCI-Verifier: Scientific Verifier with Thinking
by: Zheng, Shenghe, et al.
Published: (2025)

Think Longer to Explore Deeper: Learn to Explore In-Context via Length-Incentivized Reinforcement Learning
by: Wang, Futing, et al.
Published: (2026)

From What to Why: A Multi-Agent System for Evidence-based Chemical Reaction Condition Reasoning
by: Yang, Cheng, et al.
Published: (2025)

Teaching Thinking Models to Reason with Tools: A Full-Pipeline Recipe for Tool-Integrated Reasoning
by: Cheng, Qianjia, et al.
Published: (2026)

Can Knowledge-Graph-based Retrieval Augmented Generation Really Retrieve What You Need?
by: Yu, Junchi, et al.
Published: (2025)

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B
by: Zhang, Di, et al.
Published: (2024)

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning
by: Zhang, Di, et al.
Published: (2024)

Potential and Challenges of Model Editing for Social Debiasing
by: Yan, Jianhao, et al.
Published: (2024)

Learning to Reason under Off-Policy Guidance
by: Yan, Jianhao, et al.
Published: (2025)

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward
by: Fan, Kaixuan, et al.
Published: (2025)

LIONs: An Empirically Optimized Approach to Align Language Models
by: Yu, Xiao, et al.
Published: (2024)

Improved Laguerre Spectral Methods with Less Round-off Errors and Better Stability
by: Huang, Shenghe, et al.
Published: (2022)

OMIBench: Benchmarking Olympiad-Level Multi-Image Reasoning in Large Vision-Language Model
by: Chen, Qiguang, et al.
Published: (2026)

Keys to Robust Edits: from Theoretical Insights to Practical Advances
by: Yan, Jianhao, et al.
Published: (2024)

DeepResearch Arena: The First Exam of LLMs' Research Abilities via Seminar-Grounded Tasks
by: Wan, Haiyuan, et al.
Published: (2025)

ELICIT: LLM Augmentation via External In-Context Capability
by: Wang, Futing, et al.
Published: (2024)

From Drafts to Answers: Unlocking LLM Potential via Aggregation Fine-Tuning
by: Li, Yafu, et al.
Published: (2025)

Deformation-based In-Context Learning for Point Cloud Understanding
by: Lin, Chengxing, et al.
Published: (2026)

UltraIF: Advancing Instruction Following from the Wild
by: An, Kaikai, et al.
Published: (2025)

MokA: Multimodal Low-Rank Adaptation for MLLMs
by: Wei, Yake, et al.
Published: (2025)

Attention Reallocation: Towards Zero-cost and Controllable Hallucination Mitigation of MLLMs
by: Tu, Chongjun, et al.
Published: (2025)

Unveiling Attractor Cycles in Large Language Models: A Dynamical Systems View of Successive Paraphrasing
by: Wang, Zhilin, et al.
Published: (2025)

Submodular flows and extreme flows on measurable spaces
by: Yu, Jing, et al.
Published: (2026)

Impacts of Food‐Based Flock Size on Foraging Patterns, Activity Time Budget and Foraging Efficiency: Flexible Behavioral Responses of the Wintering Oriental Storks (Ciconia boyciana) to Changes in Aquaculture at Shengjin Lake, China
by: Lei Cheng, et al.
Published: (2025)

Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics
by: Zhang, Xinyu, et al.
Published: (2024)

GuardTrace-VL: Detecting Unsafe Multimodel Reasoning via Iterative Safety Supervision
by: Xiang, Yuxiao, et al.
Published: (2025)

LabBuilder: Protocol-Grounded 3D Layout Generation for Interactable and Safe Laboratory
by: Cao, Jianbao, et al.
Published: (2026)

Bench2Drive-VL: Benchmarks for Closed-Loop Autonomous Driving with Vision-Language Models
by: Jia, Xiaosong, et al.
Published: (2026)

TEMPO: Scaling Test-time Training for Large Reasoning Models
by: Zhang, Qingyang, et al.
Published: (2026)

Adaptive Stopping for Multi-Turn LLM Reasoning
by: Zhou, Xiaofan, et al.
Published: (2026)

Invisible Backdoor Attacks on Diffusion Models
by: Li, Sen, et al.
Published: (2024)

Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation
by: Li, Yi-Chen, et al.
Published: (2024)

Enhancing Table Recognition with Vision LLMs: A Benchmark and Neighbor-Guided Toolchain Reasoner
by: Zhou, Yitong, et al.
Published: (2024)