Saved in:
| Main Authors: | Cao, Mingyu, Correia, Alvaro H. C., Louizos, Christos, Liu, Shiwei, Yin, Lu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.10953 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Guarding the Meaning: Self-Supervised Training for Semantic Robustness in Guard Models
by: Pinneri, Cristina, et al.
Published: (2025)
by: Pinneri, Cristina, et al.
Published: (2025)
Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?
by: Li, Pengxiang, et al.
Published: (2026)
by: Li, Pengxiang, et al.
Published: (2026)
Diffusion Language Models Know the Answer Before Decoding
by: Li, Pengxiang, et al.
Published: (2025)
by: Li, Pengxiang, et al.
Published: (2025)
Accelerating Large Language Model Reasoning via Speculative Search
by: Wang, Zhihai, et al.
Published: (2025)
by: Wang, Zhihai, et al.
Published: (2025)
Diffusion Language Model Inference with Monte Carlo Tree Search
by: Huang, Zheng, et al.
Published: (2025)
by: Huang, Zheng, et al.
Published: (2025)
Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory
by: Anwar, Usman, et al.
Published: (2026)
by: Anwar, Usman, et al.
Published: (2026)
Weak-to-Strong Search: Align Large Language Models via Searching over Small Language Models
by: Zhou, Zhanhui, et al.
Published: (2024)
by: Zhou, Zhanhui, et al.
Published: (2024)
Affective and Dynamic Beam Search for Story Generation
by: Huang, Tenghao, et al.
Published: (2023)
by: Huang, Tenghao, et al.
Published: (2023)
PropRAG: Guiding Retrieval with Beam Search over Proposition Paths
by: Wang, Jingjin, et al.
Published: (2025)
by: Wang, Jingjin, et al.
Published: (2025)
Stream of Search (SoS): Learning to Search in Language
by: Gandhi, Kanishk, et al.
Published: (2024)
by: Gandhi, Kanishk, et al.
Published: (2024)
ToxSearch: Evolving Prompts for Toxicity Search in Large Language Models
by: Shelar, Onkar, et al.
Published: (2025)
by: Shelar, Onkar, et al.
Published: (2025)
Continual Learning with Embedding Layer Surgery and Task-wise Beam Search using Whisper
by: Kwok, Chin Yuen, et al.
Published: (2025)
by: Kwok, Chin Yuen, et al.
Published: (2025)
Tree Search for Language Model Agents
by: Koh, Jing Yu, et al.
Published: (2024)
by: Koh, Jing Yu, et al.
Published: (2024)
Outlier-weighed Layerwise Sampling for LLM Fine-tuning
by: Li, Pengxiang, et al.
Published: (2024)
by: Li, Pengxiang, et al.
Published: (2024)
ICR: Iterative Clarification and Rewriting for Conversational Search
by: Cao, Zhiyu, et al.
Published: (2025)
by: Cao, Zhiyu, et al.
Published: (2025)
Re-Search for The Truth: Multi-round Retrieval-augmented Large Language Models are Strong Fake News Detectors
by: Li, Guanghua, et al.
Published: (2024)
by: Li, Guanghua, et al.
Published: (2024)
Robust Search with Uncertainty-Aware Value Models for Language Model Reasoning
by: Yu, Fei, et al.
Published: (2025)
by: Yu, Fei, et al.
Published: (2025)
SEM: Reinforcement Learning for Search-Efficient Large Language Models
by: Sha, Zeyang, et al.
Published: (2025)
by: Sha, Zeyang, et al.
Published: (2025)
CoSearchAgent: A Lightweight Collaborative Search Agent with Large Language Models
by: Gong, Peiyuan, et al.
Published: (2024)
by: Gong, Peiyuan, et al.
Published: (2024)
Hypothesis Search: Inductive Reasoning with Language Models
by: Wang, Ruocheng, et al.
Published: (2023)
by: Wang, Ruocheng, et al.
Published: (2023)
BIPEFT: Budget-Guided Iterative Search for Parameter Efficient Fine-Tuning of Large Pretrained Language Models
by: Chang, Aofei, et al.
Published: (2024)
by: Chang, Aofei, et al.
Published: (2024)
The Role of Model Confidence on Bias Effects in Measured Uncertainties for Vision-Language Models
by: Liu, Xinyi, et al.
Published: (2025)
by: Liu, Xinyi, et al.
Published: (2025)
BPP-Search: Enhancing Tree of Thought Reasoning for Mathematical Modeling Problem Solving
by: Wang, Teng, et al.
Published: (2024)
by: Wang, Teng, et al.
Published: (2024)
DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search
by: Wu, Fang, et al.
Published: (2025)
by: Wu, Fang, et al.
Published: (2025)
CARE-RFT: Confidence-Anchored Reinforcement Finetuning for Reliable Reasoning in Large Language Models
by: Li, Shuozhe, et al.
Published: (2026)
by: Li, Shuozhe, et al.
Published: (2026)
Model-Document Protocol for AI Search
by: Qian, Hongjin, et al.
Published: (2025)
by: Qian, Hongjin, et al.
Published: (2025)
LRAS: Advanced Legal Reasoning with Agentic Search
by: Zhou, Yujin, et al.
Published: (2026)
by: Zhou, Yujin, et al.
Published: (2026)
CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization
by: Sun, Weiwei, et al.
Published: (2025)
by: Sun, Weiwei, et al.
Published: (2025)
Learning to Better Search with Language Models via Guided Reinforced Self-Training
by: Moon, Seungyong, et al.
Published: (2024)
by: Moon, Seungyong, et al.
Published: (2024)
CreditDecoding: Accelerating Parallel Decoding in Diffusion Large Language Models with Trace Credit
by: Wang, Kangyu, et al.
Published: (2025)
by: Wang, Kangyu, et al.
Published: (2025)
Confidence-Calibrated Small-Large Language Model Collaboration for Cost-Efficient Reasoning
by: Zhang, Chuang, et al.
Published: (2026)
by: Zhang, Chuang, et al.
Published: (2026)
Creative Beam Search: LLM-as-a-Judge For Improving Response Generation
by: Franceschelli, Giorgio, et al.
Published: (2024)
by: Franceschelli, Giorgio, et al.
Published: (2024)
The Confidence Shortcut: A Reasoning Failure Mode of Masked Diffusion Models
by: Kim, Dueun, et al.
Published: (2026)
by: Kim, Dueun, et al.
Published: (2026)
Stable Diffusion-based Data Augmentation for Federated Learning with Non-IID Data
by: Morafah, Mahdi, et al.
Published: (2024)
by: Morafah, Mahdi, et al.
Published: (2024)
Closing the Confidence-Faithfulness Gap in Large Language Models
by: Miao, Miranda Muqing, et al.
Published: (2026)
by: Miao, Miranda Muqing, et al.
Published: (2026)
When Quantization Affects Confidence of Large Language Models?
by: Proskurina, Irina, et al.
Published: (2024)
by: Proskurina, Irina, et al.
Published: (2024)
Dep-Search: Learning Dependency-Aware Reasoning Traces with Persistent Memory
by: Liu, Yanming, et al.
Published: (2026)
by: Liu, Yanming, et al.
Published: (2026)
Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration
by: He, Bowei, et al.
Published: (2026)
by: He, Bowei, et al.
Published: (2026)
Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering
by: Guan, Xinyan, et al.
Published: (2024)
by: Guan, Xinyan, et al.
Published: (2024)
Multi-Faceted Self-Consistent Preference Alignment for Query Rewriting in Conversational Search
by: Cao, Zhiyu, et al.
Published: (2026)
by: Cao, Zhiyu, et al.
Published: (2026)
Similar Items
-
Guarding the Meaning: Self-Supervised Training for Semantic Robustness in Guard Models
by: Pinneri, Cristina, et al.
Published: (2025) -
Why Diffusion Language Models Struggle with Truly Parallel (Non-Autoregressive) Decoding?
by: Li, Pengxiang, et al.
Published: (2026) -
Diffusion Language Models Know the Answer Before Decoding
by: Li, Pengxiang, et al.
Published: (2025) -
Accelerating Large Language Model Reasoning via Speculative Search
by: Wang, Zhihai, et al.
Published: (2025) -
Diffusion Language Model Inference with Monte Carlo Tree Search
by: Huang, Zheng, et al.
Published: (2025)