Saved in:
| Main Authors: | Huang, Zenan, Li, Mingwei, Zhou, Zheng, Jiang, Youxin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.10642 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Discovery and Reinforcement of Tool-Integrated Reasoning Chains via Rollout Trees
by: Li, Kun, et al.
Published: (2026)
by: Li, Kun, et al.
Published: (2026)
The Art of Efficient Reasoning: Data, Reward, and Optimization
by: Wu, Taiqiang, et al.
Published: (2026)
by: Wu, Taiqiang, et al.
Published: (2026)
Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients
by: Xu, Mingwei, et al.
Published: (2026)
by: Xu, Mingwei, et al.
Published: (2026)
MedKGI: Iterative Differential Diagnosis with Medical Knowledge Graphs and Information-Guided Inquiring
by: Wang, Qipeng, et al.
Published: (2025)
by: Wang, Qipeng, et al.
Published: (2025)
Time-Critical Multimodal Medical Transportation: Organs, Patients, and Medical Supplies
by: Varnousfaderani, Elaheh Sabziyan, et al.
Published: (2026)
by: Varnousfaderani, Elaheh Sabziyan, et al.
Published: (2026)
Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework
by: Xu, Zenan, et al.
Published: (2025)
by: Xu, Zenan, et al.
Published: (2025)
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization
by: Wu, Weiqi, et al.
Published: (2025)
by: Wu, Weiqi, et al.
Published: (2025)
FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback
by: Li, Youquan, et al.
Published: (2024)
by: Li, Youquan, et al.
Published: (2024)
EvolveSearch: An Iterative Self-Evolving Search Agent
by: Zhang, Dingchu, et al.
Published: (2025)
by: Zhang, Dingchu, et al.
Published: (2025)
SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting
by: Shi, Weijie, et al.
Published: (2026)
by: Shi, Weijie, et al.
Published: (2026)
ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning
by: Hu, Minda, et al.
Published: (2026)
by: Hu, Minda, et al.
Published: (2026)
BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search
by: Sun, Linzhuang, et al.
Published: (2024)
by: Sun, Linzhuang, et al.
Published: (2024)
Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM
by: Liang, Jingcong, et al.
Published: (2024)
by: Liang, Jingcong, et al.
Published: (2024)
Zero-Shot Privacy-Aware Text Rewriting via Iterative Tree Search
by: Huang, Shuo, et al.
Published: (2025)
by: Huang, Shuo, et al.
Published: (2025)
Homogeneous Keys, Heterogeneous Values: Exploiting Local KV Cache Asymmetry for Long-Context LLMs
by: Cui, Wanyun, et al.
Published: (2025)
by: Cui, Wanyun, et al.
Published: (2025)
ELITE: Embedding-Less retrieval with Iterative Text Exploration
by: Wang, Zhangyu, et al.
Published: (2025)
by: Wang, Zhangyu, et al.
Published: (2025)
AIR: Complex Instruction Generation via Automatic Iterative Refinement
by: Liu, Wei, et al.
Published: (2025)
by: Liu, Wei, et al.
Published: (2025)
PAT: Pruning-Aware Tuning for Large Language Models
by: Liu, Yijiang, et al.
Published: (2024)
by: Liu, Yijiang, et al.
Published: (2024)
AD-CDO: A Lightweight Ontology for Representing Eligibility Criteria in Alzheimer's Disease Clinical Trials
by: Sun, Zenan, et al.
Published: (2025)
by: Sun, Zenan, et al.
Published: (2025)
IterAlign: Iterative Constitutional Alignment of Large Language Models
by: Chen, Xiusi, et al.
Published: (2024)
by: Chen, Xiusi, et al.
Published: (2024)
AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data
by: Wu, JiaRu, et al.
Published: (2025)
by: Wu, JiaRu, et al.
Published: (2025)
DiffuseDef: Improved Robustness to Adversarial Attacks via Iterative Denoising
by: Li, Zhenhao, et al.
Published: (2024)
by: Li, Zhenhao, et al.
Published: (2024)
Data Proportion Detection for Optimized Data Management for Large Language Models
by: Liang, Hao, et al.
Published: (2024)
by: Liang, Hao, et al.
Published: (2024)
Iterative Forward Tuning Boosts In-Context Learning in Language Models
by: Yang, Jiaxi, et al.
Published: (2023)
by: Yang, Jiaxi, et al.
Published: (2023)
Iterative Data Generation with Large Language Models for Aspect-based Sentiment Analysis
by: Zhong, Qihuang, et al.
Published: (2024)
by: Zhong, Qihuang, et al.
Published: (2024)
Iterative Multilingual Spectral Attribute Erasure
by: Shao, Shun, et al.
Published: (2025)
by: Shao, Shun, et al.
Published: (2025)
ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs
by: Ding, Hongxin, et al.
Published: (2025)
by: Ding, Hongxin, et al.
Published: (2025)
Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question Answering
by: Jiapeng, Li, et al.
Published: (2024)
by: Jiapeng, Li, et al.
Published: (2024)
Quantifying Self-diagnostic Atomic Knowledge in Chinese Medical Foundation Model: A Computational Analysis
by: Fan, Yaxin, et al.
Published: (2023)
by: Fan, Yaxin, et al.
Published: (2023)
RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises
by: Zhai, Zenan, et al.
Published: (2025)
by: Zhai, Zenan, et al.
Published: (2025)
Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency
by: Li, Zenan, et al.
Published: (2024)
by: Li, Zenan, et al.
Published: (2024)
RAIR: Retrieval-Augmented Iterative Refinement for Chinese Spelling Correction
by: Liang, Junhong, et al.
Published: (2025)
by: Liang, Junhong, et al.
Published: (2025)
DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective Partitioning
by: Lu, Keer, et al.
Published: (2024)
by: Lu, Keer, et al.
Published: (2024)
Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models
by: Jiang, Songtao, et al.
Published: (2024)
by: Jiang, Songtao, et al.
Published: (2024)
RIVAL: Reinforcement Learning with Iterative and Adversarial Optimization for Machine Translation
by: Li, Tianjiao, et al.
Published: (2025)
by: Li, Tianjiao, et al.
Published: (2025)
ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation
by: Zeng, Xingshan, et al.
Published: (2026)
by: Zeng, Xingshan, et al.
Published: (2026)
Text2MDT: Extracting Medical Decision Trees from Medical Texts
by: Zhu, Wei, et al.
Published: (2024)
by: Zhu, Wei, et al.
Published: (2024)
On the (In)Effectiveness of Large Language Models for Chinese Text Correction
by: Li, Yinghui, et al.
Published: (2023)
by: Li, Yinghui, et al.
Published: (2023)
Retrieve-Refine-Calibrate: A Framework for Complex Claim Fact-Checking
by: Sun, Mingwei, et al.
Published: (2026)
by: Sun, Mingwei, et al.
Published: (2026)
MathScape: Benchmarking Multimodal Large Language Models in Real-World Mathematical Contexts
by: Liang, Hao, et al.
Published: (2024)
by: Liang, Hao, et al.
Published: (2024)
Similar Items
-
Discovery and Reinforcement of Tool-Integrated Reasoning Chains via Rollout Trees
by: Li, Kun, et al.
Published: (2026) -
The Art of Efficient Reasoning: Data, Reward, and Optimization
by: Wu, Taiqiang, et al.
Published: (2026) -
Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients
by: Xu, Mingwei, et al.
Published: (2026) -
MedKGI: Iterative Differential Diagnosis with Medical Knowledge Graphs and Information-Guided Inquiring
by: Wang, Qipeng, et al.
Published: (2025) -
Time-Critical Multimodal Medical Transportation: Organs, Patients, and Medical Supplies
by: Varnousfaderani, Elaheh Sabziyan, et al.
Published: (2026)