Saved in:
| Main Authors: | Xie, Jian, Chu, Zhendong, Zhong, Aoxiao, Zhang, Kai, Han, Mingzhe, Fan, Xing, Shen, Jialie, Wen, Qingsong |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.08163 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
UniEDU: A Unified Language and Vision Assistant for Education Applications
by: Chu, Zhendong, et al.
Published: (2025)
by: Chu, Zhendong, et al.
Published: (2025)
ARM: Adaptive Reasoning Model
by: Wu, Siye, et al.
Published: (2025)
by: Wu, Siye, et al.
Published: (2025)
LLM Agents for Education: Advances and Applications
by: Chu, Zhendong, et al.
Published: (2025)
by: Chu, Zhendong, et al.
Published: (2025)
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
by: Yan, Yibo, et al.
Published: (2024)
by: Yan, Yibo, et al.
Published: (2024)
League: Leaderboard Generation on Demand
by: Wu, Jian, et al.
Published: (2025)
by: Wu, Jian, et al.
Published: (2025)
RCAgent: Cloud Root Cause Analysis by Autonomous Agents with Tool-Augmented Large Language Models
by: Wang, Zefan, et al.
Published: (2023)
by: Wang, Zefan, et al.
Published: (2023)
Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning
by: Yan, Yibo, et al.
Published: (2025)
by: Yan, Yibo, et al.
Published: (2025)
Multimodal AI Teacher: Integrating Edge Computing and Reasoning Models for Enhanced Student Error Analysis
by: Tianlong Xu, et al.
Published: (2025)
by: Tianlong Xu, et al.
Published: (2025)
CulturePark: Boosting Cross-cultural Understanding in Large Language Models
by: Li, Cheng, et al.
Published: (2024)
by: Li, Cheng, et al.
Published: (2024)
Steering Large Language Models between Code Execution and Textual Reasoning
by: Chen, Yongchao, et al.
Published: (2024)
by: Chen, Yongchao, et al.
Published: (2024)
CultureVLM: Characterizing and Improving Cultural Understanding of Vision-Language Models for over 100 Countries
by: Liu, Shudong, et al.
Published: (2025)
by: Liu, Shudong, et al.
Published: (2025)
CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding
by: Shi, Yuling, et al.
Published: (2026)
by: Shi, Yuling, et al.
Published: (2026)
VideoARM: Agentic Reasoning over Hierarchical Memory for Long-Form Video Understanding
by: Yin, Yufei, et al.
Published: (2025)
by: Yin, Yufei, et al.
Published: (2025)
NExT: Teaching Large Language Models to Reason about Code Execution
by: Ni, Ansong, et al.
Published: (2024)
by: Ni, Ansong, et al.
Published: (2024)
AI-Driven Virtual Teacher for Enhanced Educational Efficiency: Leveraging Large Pretrain Models for Autonomous Error Analysis and Correction
by: Xu, Tianlong, et al.
Published: (2024)
by: Xu, Tianlong, et al.
Published: (2024)
Large Language Model Empowered Recommendation Meets All-domain Continual Pre-Training
by: Ma, Haokai, et al.
Published: (2025)
by: Ma, Haokai, et al.
Published: (2025)
Code Execution as Grounded Supervision for LLM Reasoning
by: Jung, Dongwon, et al.
Published: (2025)
by: Jung, Dongwon, et al.
Published: (2025)
StepCodeReasoner: Aligning Code Reasoning with Stepwise Execution Traces via Reinforcement Learning
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
Route to Reason: Adaptive Routing for LLM and Reasoning Strategy Selection
by: Pan, Zhihong, et al.
Published: (2025)
by: Pan, Zhihong, et al.
Published: (2025)
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
by: Wen, Jiaxin, et al.
Published: (2024)
by: Wen, Jiaxin, et al.
Published: (2024)
CodeScope: An Execution-based Multilingual Multitask Multidimensional Benchmark for Evaluating LLMs on Code Understanding and Generation
by: Yan, Weixiang, et al.
Published: (2023)
by: Yan, Weixiang, et al.
Published: (2023)
HyperGVL: Benchmarking and Improving Large Vision-Language Models in Hypergraph Understanding and Reasoning
by: Wei, Yanbin, et al.
Published: (2026)
by: Wei, Yanbin, et al.
Published: (2026)
CodeBenchGen: Creating Scalable Execution-based Code Generation Benchmarks
by: Xie, Yiqing, et al.
Published: (2024)
by: Xie, Yiqing, et al.
Published: (2024)
Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging
by: Chen, Shiqi, et al.
Published: (2025)
by: Chen, Shiqi, et al.
Published: (2025)
SkillBrew: Multi-Objective Curation of Skill Banks for LLM Agents
by: Hu, Wentao, et al.
Published: (2026)
by: Hu, Wentao, et al.
Published: (2026)
CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning
by: Wu, Siye, et al.
Published: (2026)
by: Wu, Siye, et al.
Published: (2026)
Bridging Code Graphs and Large Language Models for Better Code Understanding
by: Chen, Zeqi, et al.
Published: (2025)
by: Chen, Zeqi, et al.
Published: (2025)
Code-Guided Reasoning for Small Language Models: Evaluating Executable MCQA Scaffolds
by: Biswas, Prateek, et al.
Published: (2026)
by: Biswas, Prateek, et al.
Published: (2026)
LLM-Virus: Evolutionary Jailbreak Attack on Large Language Models
by: Yu, Miao, et al.
Published: (2024)
by: Yu, Miao, et al.
Published: (2024)
Markov Chain of Thought for Efficient Mathematical Reasoning
by: Yang, Wen, et al.
Published: (2024)
by: Yang, Wen, et al.
Published: (2024)
ARM: Discovering Agentic Reasoning Modules for Generalizable Multi-Agent Systems
by: Yao, Bohan, et al.
Published: (2025)
by: Yao, Bohan, et al.
Published: (2025)
Execution-Verified Reinforcement Learning for Optimization Modeling
by: Guan, Runda, et al.
Published: (2026)
by: Guan, Runda, et al.
Published: (2026)
Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities
by: Wang, Hanbin, et al.
Published: (2025)
by: Wang, Hanbin, et al.
Published: (2025)
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
by: Fan, Chenghao, et al.
Published: (2026)
by: Fan, Chenghao, et al.
Published: (2026)
\texttt{ReMind}: Understanding Deductive Code Reasoning in LLMs
by: Gao, Jun, et al.
Published: (2025)
by: Gao, Jun, et al.
Published: (2025)
A Case Study of Selected PTQ Baselines for Reasoning LLMs on Ascend NPU
by: Luo, Yuchen, et al.
Published: (2026)
by: Luo, Yuchen, et al.
Published: (2026)
CRAFTQA: A Code-Driven Adaptive Framework for Complex Structured Data Reasoning
by: Gan, Chengtao, et al.
Published: (2026)
by: Gan, Chengtao, et al.
Published: (2026)
Stop Fixating on Prompts: Reasoning Hijacking and Constraint Tightening for Red-Teaming LLM Agents
by: Mao, Yanxu, et al.
Published: (2026)
by: Mao, Yanxu, et al.
Published: (2026)
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges
by: Yan, Yibo, et al.
Published: (2024)
by: Yan, Yibo, et al.
Published: (2024)
Case-Based Calibration of Adaptive Reasoning and Execution for LLM Tool Use
by: Pang, Renning, et al.
Published: (2026)
by: Pang, Renning, et al.
Published: (2026)
Similar Items
-
UniEDU: A Unified Language and Vision Assistant for Education Applications
by: Chu, Zhendong, et al.
Published: (2025) -
ARM: Adaptive Reasoning Model
by: Wu, Siye, et al.
Published: (2025) -
LLM Agents for Education: Advances and Applications
by: Chu, Zhendong, et al.
Published: (2025) -
ErrorRadar: Benchmarking Complex Mathematical Reasoning of Multimodal Large Language Models Via Error Detection
by: Yan, Yibo, et al.
Published: (2024) -
League: Leaderboard Generation on Demand
by: Wu, Jian, et al.
Published: (2025)