Guardado en:
| Autores principales: | Wang, Teng, Jiang, Zhangyi, He, Zhenqi, Tong, Shenyang, Yang, Wenhan, Zheng, Yanan, Li, Zeyu, He, Zifan, Gong, Hailei, Ye, Zewen, Ma, Shengjie, Zhang, Jianping |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2503.13551 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
BPP-Search: Enhancing Tree of Thought Reasoning for Mathematical Modeling Problem Solving
por: Wang, Teng, et al.
Publicado: (2024)
por: Wang, Teng, et al.
Publicado: (2024)
Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models
por: Liu, Yule, et al.
Publicado: (2025)
por: Liu, Yule, et al.
Publicado: (2025)
Reward Reasoning Model
por: Guo, Jiaxin, et al.
Publicado: (2025)
por: Guo, Jiaxin, et al.
Publicado: (2025)
Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual Prompts
por: Wang, Teng, et al.
Publicado: (2024)
por: Wang, Teng, et al.
Publicado: (2024)
HyperGuide: Hyperbolic Guidance for Efficient Multi-Step Reasoning in Large Language Models
por: Liu, Yuyu, et al.
Publicado: (2026)
por: Liu, Yuyu, et al.
Publicado: (2026)
SEAL: Semantic-Aware Hierarchical Learning for Generalized Category Discovery
por: He, Zhenqi, et al.
Publicado: (2025)
por: He, Zhenqi, et al.
Publicado: (2025)
Beyond Correctness: Confidence-Aware Reward Modeling for Enhancing Large Language Model Reasoning
por: He, Qianxi, et al.
Publicado: (2025)
por: He, Qianxi, et al.
Publicado: (2025)
Leveraging Large Language Models for Solving Rare MIP Challenges
por: Wang, Teng, et al.
Publicado: (2024)
por: Wang, Teng, et al.
Publicado: (2024)
SAGE: Strategy-Adaptive Generation Engine for Query Rewriting
por: Wang, Teng, et al.
Publicado: (2025)
por: Wang, Teng, et al.
Publicado: (2025)
Benchmarking Multi-Step Legal Reasoning and Analyzing Chain-of-Thought Effects in Large Language Models
por: Yu, Wenhan, et al.
Publicado: (2025)
por: Yu, Wenhan, et al.
Publicado: (2025)
Enhancing Causal Reasoning in Large Language Models: A Causal Attribution Model for Precision Fine-Tuning
por: Cai, Hengrui, et al.
Publicado: (2023)
por: Cai, Hengrui, et al.
Publicado: (2023)
LUT-LLM: Efficient Large Language Model Inference with Memory-based Computations on FPGAs
por: He, Zifan, et al.
Publicado: (2025)
por: He, Zifan, et al.
Publicado: (2025)
TableReasoner: Advancing Table Reasoning Framework with Large Language Models
por: Xiong, Sishi, et al.
Publicado: (2025)
por: Xiong, Sishi, et al.
Publicado: (2025)
Towards Lifelong Learning of Large Language Models: A Survey
por: Zheng, Junhao, et al.
Publicado: (2024)
por: Zheng, Junhao, et al.
Publicado: (2024)
ReasonGRM: Enhancing Generative Reward Models through Large Reasoning Models
por: Chen, Bin, et al.
Publicado: (2025)
por: Chen, Bin, et al.
Publicado: (2025)
AWPO: Enhancing Tool-Use of Large Language Models through Adaptive Integration of Reasoning Rewards
por: Lin, Zihan, et al.
Publicado: (2025)
por: Lin, Zihan, et al.
Publicado: (2025)
MedPRMBench: A Fine-grained Benchmark for Process Reward Models in Medical Reasoning
por: Wu, Lingyan, et al.
Publicado: (2026)
por: Wu, Lingyan, et al.
Publicado: (2026)
Optimal scheduling of interim analyses in group sequential trials
por: He, Zhangyi, et al.
Publicado: (2025)
por: He, Zhangyi, et al.
Publicado: (2025)
GAS: Enhancing Reward-Cost Balance of Generative Model-assisted Offline Safe RL
por: Liu, Zifan, et al.
Publicado: (2026)
por: Liu, Zifan, et al.
Publicado: (2026)
Can Large Language Models Adequately Perform Symbolic Reasoning Over Time Series?
por: Liu, Zewen, et al.
Publicado: (2025)
por: Liu, Zewen, et al.
Publicado: (2025)
Understand and Accelerate Memory Processing Pipeline for Large Language Model Inference
por: He, Zifan, et al.
Publicado: (2026)
por: He, Zifan, et al.
Publicado: (2026)
A Hierarchical Signal Coordination and Control System Using a Hybrid Model-based and Reinforcement Learning Approach
por: Peng, Xianyue, et al.
Publicado: (2025)
por: Peng, Xianyue, et al.
Publicado: (2025)
Enhancing Large Language Model Reasoning with Reward Models: An Analytical Survey
por: Liu, Qiyuan, et al.
Publicado: (2025)
por: Liu, Qiyuan, et al.
Publicado: (2025)
APR: Penalizing Structural Redundancy in Large Reasoning Models via Anchor-based Process Rewards
por: Chang, Kaiyan, et al.
Publicado: (2026)
por: Chang, Kaiyan, et al.
Publicado: (2026)
VLA-R1: Enhancing Reasoning in Vision-Language-Action Models
por: Ye, Angen, et al.
Publicado: (2025)
por: Ye, Angen, et al.
Publicado: (2025)
Omanic: Towards Step-wise Evaluation of Multi-hop Reasoning in Large Language Models
por: Gu, Xiaojie, et al.
Publicado: (2026)
por: Gu, Xiaojie, et al.
Publicado: (2026)
Role-Play Paradox in Large Language Models: Reasoning Performance Gains and Ethical Dilemmas
por: Zhao, Jinman, et al.
Publicado: (2024)
por: Zhao, Jinman, et al.
Publicado: (2024)
CORAL: COntextual Reasoning And Local Planning in A Hierarchical VLM Framework for Underwater Monitoring
por: Wu, Zhenqi, et al.
Publicado: (2026)
por: Wu, Zhenqi, et al.
Publicado: (2026)
GasAgent: A Multi-Agent Framework for Automated Gas Optimization in Smart Contracts
por: Zheng, Jingyi, et al.
Publicado: (2025)
por: Zheng, Jingyi, et al.
Publicado: (2025)
Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards
por: Yuan, Youliang, et al.
Publicado: (2025)
por: Yuan, Youliang, et al.
Publicado: (2025)
Improving Data and Reward Design for Scientific Reasoning in Large Language Models
por: Chen, Zijie, et al.
Publicado: (2026)
por: Chen, Zijie, et al.
Publicado: (2026)
Step-by-Step Mastery: Enhancing Soft Constraint Following Ability of Large Language Models
por: Ren, Qingyu, et al.
Publicado: (2025)
por: Ren, Qingyu, et al.
Publicado: (2025)
Self Model for Embodied Intelligence: Modeling Full-Body Human Musculoskeletal System and Locomotion Control with Hierarchical Low-Dimensional Representation
por: Zuo, Chenhui, et al.
Publicado: (2023)
por: Zuo, Chenhui, et al.
Publicado: (2023)
Instruct-of-Reflection: Enhancing Large Language Models Iterative Reflection Capabilities via Dynamic-Meta Instruction
por: Liu, Liping, et al.
Publicado: (2025)
por: Liu, Liping, et al.
Publicado: (2025)
When Do Symbolic Solvers Enhance Reasoning in Large Language Models?
por: He, Zhiyuan, et al.
Publicado: (2025)
por: He, Zhiyuan, et al.
Publicado: (2025)
Explicit Preference Optimization: No Need for an Implicit Reward Model
por: Hu, Xiangkun, et al.
Publicado: (2025)
por: Hu, Xiangkun, et al.
Publicado: (2025)
Step-wise Rubric Rewards for LLM Reasoning
por: Xie, Weichu, et al.
Publicado: (2026)
por: Xie, Weichu, et al.
Publicado: (2026)
Geo-Align: Video Generation Alignment via Metric Geometry Reward
por: Li, Zizun, et al.
Publicado: (2026)
por: Li, Zizun, et al.
Publicado: (2026)
Towards Robust Model-Based Reinforcement Learning Against Adversarial Corruption
por: Ye, Chenlu, et al.
Publicado: (2024)
por: Ye, Chenlu, et al.
Publicado: (2024)
H$^{2}$MT: Semantic Hierarchy-Aware Hierarchical Memory Transformer
por: Haghifam, Maryam, et al.
Publicado: (2026)
por: Haghifam, Maryam, et al.
Publicado: (2026)
Ejemplares similares
-
BPP-Search: Enhancing Tree of Thought Reasoning for Mathematical Modeling Problem Solving
por: Wang, Teng, et al.
Publicado: (2024) -
Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models
por: Liu, Yule, et al.
Publicado: (2025) -
Reward Reasoning Model
por: Guo, Jiaxin, et al.
Publicado: (2025) -
Large Language Models are Good Multi-lingual Learners : When LLMs Meet Cross-lingual Prompts
por: Wang, Teng, et al.
Publicado: (2024) -
HyperGuide: Hyperbolic Guidance for Efficient Multi-Step Reasoning in Large Language Models
por: Liu, Yuyu, et al.
Publicado: (2026)