Saved in:
| Main Authors: | Li, Yuran, Wu, Di, Boulet, Benoit |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.20441 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models
by: Sun, Chongren, et al.
Published: (2025)
by: Sun, Chongren, et al.
Published: (2025)
Self-Trained Verification for Training- and Test-Time Self-Improvement
by: Wu, Chen Henry, et al.
Published: (2026)
by: Wu, Chen Henry, et al.
Published: (2026)
An Online Self-learning Graph-based Lateral Controller for Self-Driving Cars
by: Samiuddin, Jilan, et al.
Published: (2024)
by: Samiuddin, Jilan, et al.
Published: (2024)
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
by: Zhang, Wenqi, et al.
Published: (2024)
by: Zhang, Wenqi, et al.
Published: (2024)
MARS: Memory-Enhanced Agents with Reflective Self-improvement
by: Liang, Xuechen, et al.
Published: (2025)
by: Liang, Xuechen, et al.
Published: (2025)
Progress or Regress? Self-Improvement Reversal in Post-training
by: Wu, Ting, et al.
Published: (2024)
by: Wu, Ting, et al.
Published: (2024)
TTSR: Test-Time Self-Reflection for Continual Reasoning Improvement
by: He, Haoyang, et al.
Published: (2026)
by: He, Haoyang, et al.
Published: (2026)
Re-ReST: Reflection-Reinforced Self-Training for Language Agents
by: Dou, Zi-Yi, et al.
Published: (2024)
by: Dou, Zi-Yi, et al.
Published: (2024)
Towards Verifiable Text Generation with Evolving Memory and Self-Reflection
by: Sun, Hao, et al.
Published: (2023)
by: Sun, Hao, et al.
Published: (2023)
Enabling Language Models to Implicitly Learn Self-Improvement
by: Wang, Ziqi, et al.
Published: (2023)
by: Wang, Ziqi, et al.
Published: (2023)
FABSVer: Faster Training and Better Self-Verification for LLM Mathematical Reasoning
by: Pan, Haihui, et al.
Published: (2026)
by: Pan, Haihui, et al.
Published: (2026)
Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments
by: Li, Yuran, et al.
Published: (2025)
by: Li, Yuran, et al.
Published: (2025)
WebSeer: Training Deeper Search Agents through Reinforcement Learning with Self-Reflection
by: He, Guanzhong, et al.
Published: (2025)
by: He, Guanzhong, et al.
Published: (2025)
Semantic Voting: A Self-Evaluation-Free Approach for Efficient LLM Self-Improvement on Unverifiable Open-ended Tasks
by: Jiang, Chunyang, et al.
Published: (2025)
by: Jiang, Chunyang, et al.
Published: (2025)
MetaMem: Evolving Meta-Memory for Knowledge Utilization through Self-Reflective Symbolic Optimization
by: Xin, Haidong, et al.
Published: (2026)
by: Xin, Haidong, et al.
Published: (2026)
DIVE: Diversified Iterative Self-Improvement
by: Qin, Yiwei, et al.
Published: (2025)
by: Qin, Yiwei, et al.
Published: (2025)
$V_1$: Unifying Generation and Self-Verification for Parallel Reasoners
by: Singh, Harman, et al.
Published: (2026)
by: Singh, Harman, et al.
Published: (2026)
Training LLM-Based Agents with Synthetic Self-Reflected Trajectories and Partial Masking
by: Chen, Yihan, et al.
Published: (2025)
by: Chen, Yihan, et al.
Published: (2025)
Mitigating Tail Narrowing in LLM Self-Improvement via Socratic-Guided Sampling
by: Ding, Yiwen, et al.
Published: (2024)
by: Ding, Yiwen, et al.
Published: (2024)
Generating Equivalent Representations of Code By A Self-Reflection Approach
by: Li, Jia, et al.
Published: (2024)
by: Li, Jia, et al.
Published: (2024)
The Flip Side of RLHF: On-Policy Feedback for Reward Model Self-Supervised Improvement
by: Wang, Xiaobo, et al.
Published: (2026)
by: Wang, Xiaobo, et al.
Published: (2026)
Efficient and Accurate Prompt Optimization: the Benefit of Memory in Exemplar-Guided Reflection
by: Yan, Cilin, et al.
Published: (2024)
by: Yan, Cilin, et al.
Published: (2024)
Language Self-Play For Data-Free Training
by: Kuba, Jakub Grudzien, et al.
Published: (2025)
by: Kuba, Jakub Grudzien, et al.
Published: (2025)
A Survey on LLM Inference-Time Self-Improvement
by: Dong, Xiangjue, et al.
Published: (2024)
by: Dong, Xiangjue, et al.
Published: (2024)
Reference Trustable Decoding: A Training-Free Augmentation Paradigm for Large Language Models
by: Shi, Luohe, et al.
Published: (2024)
by: Shi, Luohe, et al.
Published: (2024)
Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training
by: Li, Dongfang, et al.
Published: (2023)
by: Li, Dongfang, et al.
Published: (2023)
DRDT3: Diffusion-Refined Decision Test-Time Training Model
by: Huang, Xingshuai, et al.
Published: (2025)
by: Huang, Xingshuai, et al.
Published: (2025)
Self-Improvement Programming for Temporal Knowledge Graph Question Answering
by: Chen, Zhuo, et al.
Published: (2024)
by: Chen, Zhuo, et al.
Published: (2024)
Continuous Self-Improvement of Large Language Models by Test-time Training with Verifier-Driven Sample Selection
by: Moradi, Mohammad Mahdi, et al.
Published: (2025)
by: Moradi, Mohammad Mahdi, et al.
Published: (2025)
Self-Reflective Generation at Test Time
by: Mu, Jian, et al.
Published: (2025)
by: Mu, Jian, et al.
Published: (2025)
SRTJ: Self-Evolving Rule-Driven Training-Free LLM Jailbreaking
by: Li, Jindong, et al.
Published: (2026)
by: Li, Jindong, et al.
Published: (2026)
MuSC: Improving Complex Instruction Following with Multi-granularity Self-Contrastive Training
by: Huang, Hui, et al.
Published: (2025)
by: Huang, Hui, et al.
Published: (2025)
Self-Improvement in Multimodal Large Language Models: A Survey
by: Deng, Shijian, et al.
Published: (2025)
by: Deng, Shijian, et al.
Published: (2025)
Class Distillation with Mahalanobis Contrast: An Efficient Training Paradigm for Pragmatic Language Understanding Tasks
by: Wang, Chenlu, et al.
Published: (2025)
by: Wang, Chenlu, et al.
Published: (2025)
Efficient Self-Evaluation for Diffusion Language Models via Sequence Regeneration
by: Zhong, Linhao, et al.
Published: (2026)
by: Zhong, Linhao, et al.
Published: (2026)
Large Language Models Can Self-Correct with Key Condition Verification
by: Wu, Zhenyu, et al.
Published: (2024)
by: Wu, Zhenyu, et al.
Published: (2024)
I-SHEEP: Self-Alignment of LLM from Scratch through an Iterative Self-Enhancement Paradigm
by: Liang, Yiming, et al.
Published: (2024)
by: Liang, Yiming, et al.
Published: (2024)
Transplant Then Regenerate: A New Paradigm for Text Data Augmentation
by: Wang, Guangzhan, et al.
Published: (2025)
by: Wang, Guangzhan, et al.
Published: (2025)
SciInstruct: a Self-Reflective Instruction Annotated Dataset for Training Scientific Language Models
by: Zhang, Dan, et al.
Published: (2024)
by: Zhang, Dan, et al.
Published: (2024)
Efficient Self-Improvement in Multimodal Large Language Models: A Model-Level Judge-Free Approach
by: Deng, Shijian, et al.
Published: (2024)
by: Deng, Shijian, et al.
Published: (2024)
Similar Items
-
OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models
by: Sun, Chongren, et al.
Published: (2025) -
Self-Trained Verification for Training- and Test-Time Self-Improvement
by: Wu, Chen Henry, et al.
Published: (2026) -
An Online Self-learning Graph-based Lateral Controller for Self-Driving Cars
by: Samiuddin, Jilan, et al.
Published: (2024) -
Self-Contrast: Better Reflection Through Inconsistent Solving Perspectives
by: Zhang, Wenqi, et al.
Published: (2024) -
MARS: Memory-Enhanced Agents with Reflective Self-improvement
by: Liang, Xuechen, et al.
Published: (2025)