:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huang, Zenan, Li, Mingwei, Zhou, Zheng, Jiang, Youxin
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2501.10642
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Discovery and Reinforcement of Tool-Integrated Reasoning Chains via Rollout Trees
by: Li, Kun, et al.
Published: (2026)

The Art of Efficient Reasoning: Data, Reward, and Optimization
by: Wu, Taiqiang, et al.
Published: (2026)

Beyond Negative Rollouts: Positive-Only Policy Optimization with Implicit Negative Gradients
by: Xu, Mingwei, et al.
Published: (2026)

MedKGI: Iterative Differential Diagnosis with Medical Knowledge Graphs and Information-Guided Inquiring
by: Wang, Qipeng, et al.
Published: (2025)

Time-Critical Multimodal Medical Transportation: Organs, Patients, and Medical Supplies
by: Varnousfaderani, Elaheh Sabziyan, et al.
Published: (2026)

Adaptive Termination for Multi-round Parallel Reasoning: An Universal Semantic Entropy-Guided Framework
by: Xu, Zenan, et al.
Published: (2025)

Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization
by: Wu, Weiqi, et al.
Published: (2025)

FB-Bench: A Fine-Grained Multi-Task Benchmark for Evaluating LLMs' Responsiveness to Human Feedback
by: Li, Youquan, et al.
Published: (2024)

EvolveSearch: An Iterative Self-Evolving Search Agent
by: Zhang, Dingchu, et al.
Published: (2025)

SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting
by: Shi, Weijie, et al.
Published: (2026)

ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning
by: Hu, Minda, et al.
Published: (2026)

BEATS: Optimizing LLM Mathematical Capabilities with BackVerify and Adaptive Disambiguate based Efficient Tree Search
by: Sun, Linzhuang, et al.
Published: (2024)

Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM
by: Liang, Jingcong, et al.
Published: (2024)

Zero-Shot Privacy-Aware Text Rewriting via Iterative Tree Search
by: Huang, Shuo, et al.
Published: (2025)

Homogeneous Keys, Heterogeneous Values: Exploiting Local KV Cache Asymmetry for Long-Context LLMs
by: Cui, Wanyun, et al.
Published: (2025)

ELITE: Embedding-Less retrieval with Iterative Text Exploration
by: Wang, Zhangyu, et al.
Published: (2025)

AIR: Complex Instruction Generation via Automatic Iterative Refinement
by: Liu, Wei, et al.
Published: (2025)

PAT: Pruning-Aware Tuning for Large Language Models
by: Liu, Yijiang, et al.
Published: (2024)

AD-CDO: A Lightweight Ontology for Representing Eligibility Criteria in Alzheimer's Disease Clinical Trials
by: Sun, Zenan, et al.
Published: (2025)

IterAlign: Iterative Constitutional Alignment of Large Language Models
by: Chen, Xiusi, et al.
Published: (2024)

AutoEvoEval: An Automated Framework for Evolving Close-Ended LLM Evaluation Data
by: Wu, JiaRu, et al.
Published: (2025)

DiffuseDef: Improved Robustness to Adversarial Attacks via Iterative Denoising
by: Li, Zhenhao, et al.
Published: (2024)

Data Proportion Detection for Optimized Data Management for Large Language Models
by: Liang, Hao, et al.
Published: (2024)

Iterative Forward Tuning Boosts In-Context Learning in Language Models
by: Yang, Jiaxi, et al.
Published: (2023)

Iterative Data Generation with Large Language Models for Aspect-based Sentiment Analysis
by: Zhong, Qihuang, et al.
Published: (2024)

Iterative Multilingual Spectral Attribute Erasure
by: Shao, Shun, et al.
Published: (2025)

ProMed: Shapley Information Gain Guided Reinforcement Learning for Proactive Medical LLMs
by: Ding, Hongxin, et al.
Published: (2025)

Tree of Reviews: A Tree-based Dynamic Iterative Retrieval Framework for Multi-hop Question Answering
by: Jiapeng, Li, et al.
Published: (2024)

Quantifying Self-diagnostic Atomic Knowledge in Chinese Medical Foundation Model: A Computational Analysis
by: Fan, Yaxin, et al.
Published: (2023)

RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises
by: Zhai, Zenan, et al.
Published: (2025)

Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency
by: Li, Zenan, et al.
Published: (2024)

RAIR: Retrieval-Augmented Iterative Refinement for Chinese Spelling Correction
by: Liang, Junhong, et al.
Published: (2025)

DataSculpt: Crafting Data Landscapes for Long-Context LLMs through Multi-Objective Partitioning
by: Lu, Keer, et al.
Published: (2024)

Med-MoE: Mixture of Domain-Specific Experts for Lightweight Medical Vision-Language Models
by: Jiang, Songtao, et al.
Published: (2024)

RIVAL: Reinforcement Learning with Iterative and Adversarial Optimization for Machine Translation
by: Li, Tianjiao, et al.
Published: (2025)

ARTIS: Agentic Risk-Aware Test-Time Scaling via Iterative Simulation
by: Zeng, Xingshan, et al.
Published: (2026)

Text2MDT: Extracting Medical Decision Trees from Medical Texts
by: Zhu, Wei, et al.
Published: (2024)

On the (In)Effectiveness of Large Language Models for Chinese Text Correction
by: Li, Yinghui, et al.
Published: (2023)

Retrieve-Refine-Calibrate: A Framework for Complex Claim Fact-Checking
by: Sun, Mingwei, et al.
Published: (2026)

MathScape: Benchmarking Multimodal Large Language Models in Real-World Mathematical Contexts
by: Liang, Hao, et al.
Published: (2024)