:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Liu, Siyuan, Liu, Wenjing, Xu, Zhiwei, Wang, Xin, Chen, Bo, Li, Tao
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Machine Learning Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2507.15903
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Mitigating LLM Hallucination via Behaviorally Calibrated Reinforcement Learning
von: Wu, Jiayun, et al.
Veröffentlicht: (2025)

Mitigating LLM Hallucinations via Conformal Abstention
von: Yadkori, Yasin Abbasi, et al.
Veröffentlicht: (2024)

On Mitigating Code LLM Hallucinations with API Documentation
von: Jain, Nihal, et al.
Veröffentlicht: (2024)

ICLShield: Exploring and Mitigating In-Context Learning Backdoor Attacks
von: Ren, Zhiyao, et al.
Veröffentlicht: (2025)

REX: Rapid Exploration and eXploitation for AI Agents
von: Murthy, Rithesh, et al.
Veröffentlicht: (2023)

Data Difficulty and the Generalization--Extrapolation Tradeoff in LLM Fine-Tuning
von: Liu, Siyuan, et al.
Veröffentlicht: (2026)

GoAgent: Group-of-Agents Communication Topology Generation for LLM-based Multi-Agent Systems
von: Chen, Hongjiang, et al.
Veröffentlicht: (2026)

Beyond Noisy-TVs: Noise-Robust Exploration Via Learning Progress Monitoring
von: Hou, Zhibo, et al.
Veröffentlicht: (2025)

Reliable Weak-to-Strong Monitoring of LLM Agents
von: Kale, Neil, et al.
Veröffentlicht: (2025)

Towards Mitigating Architecture Overfitting on Distilled Datasets
von: Zhong, Xuyang, et al.
Veröffentlicht: (2023)

Generation Constraint Scaling Can Mitigate Hallucination
von: Kollias, Georgios, et al.
Veröffentlicht: (2024)

Towards Mitigating Excessive Forgetting in LLM Unlearning via Entanglement-Guidance with Proxy Constraint
von: Liu, Zhihao, et al.
Veröffentlicht: (2025)

Group-in-Group Policy Optimization for LLM Agent Training
von: Feng, Lang, et al.
Veröffentlicht: (2025)

Adventurer: Exploration with BiGAN for Deep Reinforcement Learning
von: Liu, Yongshuai, et al.
Veröffentlicht: (2025)

In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation
von: Chen, Shiqi, et al.
Veröffentlicht: (2024)

FT-Dojo: Towards Autonomous LLM Fine-Tuning with Language Agents
von: Li, Qizheng, et al.
Veröffentlicht: (2026)

Awakening Dormant Experts:Counterfactual Routing to Mitigate MoE Hallucinations
von: Hu, Wentao, et al.
Veröffentlicht: (2026)

SHADE-Arena: Evaluating Sabotage and Monitoring in LLM Agents
von: Kutasov, Jonathan, et al.
Veröffentlicht: (2025)

QSIM: Mitigating Overestimation in Multi-Agent Reinforcement Learning via Action Similarity Weighted Q-Learning
von: Li, Yuanjun, et al.
Veröffentlicht: (2026)

Toward Efficient Exploration by Large Language Model Agents
von: Arumugam, Dilip, et al.
Veröffentlicht: (2025)

Regret Bounds and Reinforcement Learning Exploration of EXP-based Algorithms
von: Xu, Mengfan, et al.
Veröffentlicht: (2020)

How Much Thinking is Enough? Quantifying and Understanding Redundancy in LLM Reasoning
von: Zhai, Zhiyuan, et al.
Veröffentlicht: (2026)

APEX: Autonomous Policy Exploration for Self-Evolving LLM Agents
von: Li, Yibo, et al.
Veröffentlicht: (2026)

Label-free Monitoring of Self-Supervised Learning Progress
von: Xu, Isaac, et al.
Veröffentlicht: (2024)

Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
von: Zhou, Yang, et al.
Veröffentlicht: (2025)

Provably Efficient Exploration in Inverse Constrained Reinforcement Learning
von: Yue, Bo, et al.
Veröffentlicht: (2024)

Laplacian Score Sharpening for Mitigating Hallucination in Diffusion Models
von: C, Barath Chandran., et al.
Veröffentlicht: (2025)

SIRI: Self-Internalizing Reinforcement Learning with Intrinsic Skills for LLM Agent Training
von: He, Zhongyu, et al.
Veröffentlicht: (2026)

Identify, Isolate, and Purge: Mitigating Hallucinations in LVLMs via Self-Evolving Distillation
von: Li, Wenhao, et al.
Veröffentlicht: (2025)

A Unified Virtual Mixture-of-Experts Framework:Enhanced Inference and Hallucination Mitigation in Single-Model System
von: Liu, Mingyan
Veröffentlicht: (2025)

Structured Progressive Knowledge Activation for LLM-Driven Neural Architecture Search
von: Liu, Zhen, et al.
Veröffentlicht: (2026)

Mitigating spectral bias for the multiscale operator learning
von: Liu, Xinliang, et al.
Veröffentlicht: (2022)

ML-Master: Towards AI-for-AI via Integration of Exploration and Reasoning
von: Liu, Zexi, et al.
Veröffentlicht: (2025)

Beyond the Dirac Delta: Mitigating Diversity Collapse in Reinforcement Fine-Tuning for Versatile Image Generation
von: Liu, Jinmei, et al.
Veröffentlicht: (2026)

Mitigating Hallucinated Translations in Large Language Models with Hallucination-focused Preference Optimization
von: Tang, Zilu, et al.
Veröffentlicht: (2025)

Anomaly Detection and Early Warning Mechanism for Intelligent Monitoring Systems in Multi-Cloud Environments Based on LLM
von: Jin, Yihong, et al.
Veröffentlicht: (2025)

Adaptive-Boundary-Clipping GRPO: Ensuring Bounded Ratios for Stable and Generalizable Training
von: Liu, Chi, et al.
Veröffentlicht: (2026)

Towards Understanding Text Hallucination of Diffusion Models via Local Generation Bias
von: Lu, Rui, et al.
Veröffentlicht: (2025)

ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering
von: Liu, Zexi, et al.
Veröffentlicht: (2025)

Mitigating Geospatial Knowledge Hallucination in Large Language Models: Benchmarking and Dynamic Factuality Aligning
von: Wang, Shengyuan, et al.
Veröffentlicht: (2025)