:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	He, Lehan, Chen, Zeren, Shi, Zhelun, Yu, Tianyu, Shao, Jing, Sheng, Lu
Format:	Preprint
Veröffentlicht:	2024
Schlagworte:	Computation and Language Computer Vision and Pattern Recognition
Online-Zugang:	https://arxiv.org/abs/2411.17265
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

RH20T-P: A Primitive-Level Robotic Dataset Towards Composable Generalization Agents
von: Chen, Zeren, et al.
Veröffentlicht: (2024)

Octavius: Mitigating Task Interference in MLLMs via LoRA-MoE
von: Chen, Zeren, et al.
Veröffentlicht: (2023)

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
von: Lu, Xiaoya, et al.
Veröffentlicht: (2025)

Detecting and Mitigating Hallucination in Large Vision Language Models via Fine-Grained AI Feedback
von: Xiao, Wenyi, et al.
Veröffentlicht: (2024)

Geometrically-Constrained Agent for Spatial Reasoning
von: Chen, Zeren, et al.
Veröffentlicht: (2025)

Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization
von: Deng, Haolin, et al.
Veröffentlicht: (2026)

HomeGuard: VLM-based Embodied Safeguard for Identifying Contextual Risk in Household Task
von: Lu, Xiaoya, et al.
Veröffentlicht: (2026)

Mitigating Hallucinations in Large Vision-Language Models by Self-Injecting Hallucinations
von: Lu, Yifan, et al.
Veröffentlicht: (2025)

Mitigating Object Hallucination via Concentric Causal Attention
von: Xing, Yun, et al.
Veröffentlicht: (2024)

Mitigating Multimodal Hallucination via Phase-wise Self-reward
von: Zhang, Yu, et al.
Veröffentlicht: (2026)

OViP: Online Vision-Language Preference Learning for VLM Hallucination
von: Liu, Shujun, et al.
Veröffentlicht: (2025)

A Unified Hallucination Mitigation Framework for Large Vision-Language Models
von: Chang, Yue, et al.
Veröffentlicht: (2024)

Mitigating Object Hallucination via Robust Local Perception Search
von: Gao, Zixian, et al.
Veröffentlicht: (2025)

VLLaVO: Mitigating Visual Gap through LLMs
von: Chen, Shuhao, et al.
Veröffentlicht: (2024)

Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
von: Zhao, Zhiyuan, et al.
Veröffentlicht: (2023)

EnsemHalDet: Robust VLM Hallucination Detection via Ensemble of Internal State Detectors
von: Miyazato, Ryuhei, et al.
Veröffentlicht: (2026)

Mitigating Hallucinations in Large Vision-Language Models via Entity-Centric Multimodal Preference Optimization
von: Wu, Jiulong, et al.
Veröffentlicht: (2025)

Generalizable Entity Grounding via Assistance of Large Language Model
von: Qi, Lu, et al.
Veröffentlicht: (2024)

Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models
von: Li, Bin, et al.
Veröffentlicht: (2025)

URPO: A Unified Reward & Policy Optimization Framework for Large Language Models
von: Lu, Songshuo, et al.
Veröffentlicht: (2025)

Mitigating Hallucinations in Multimodal Spatial Relations through Constraint-Aware Prompting
von: Wu, Jiarui, et al.
Veröffentlicht: (2025)

Mitigating Hallucination in Multimodal Large Language Model via Hallucination-targeted Direct Preference Optimization
von: Fu, Yuhan, et al.
Veröffentlicht: (2024)

Mitigating Multilingual Hallucination in Large Vision-Language Models
von: Qu, Xiaoye, et al.
Veröffentlicht: (2024)

Analyzing and Mitigating Object Hallucination: A Training Bias Perspective
von: Li, Yifan, et al.
Veröffentlicht: (2025)

Mitigating Multimodal Hallucinations via Gradient-based Self-Reflection
von: Wang, Shan, et al.
Veröffentlicht: (2025)

Watch Closely: Mitigating Object Hallucinations in Large Vision-Language Models with Disentangled Decoding
von: Ma, Ruiqi, et al.
Veröffentlicht: (2025)

Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation
von: Gu, Jihao, et al.
Veröffentlicht: (2024)

Do Vision Encoders Truly Explain Object Hallucination?: Mitigating Object Hallucination via Simple Fine-Grained CLIPScore
von: Oh, Hongseok, et al.
Veröffentlicht: (2025)

Praxis-VLM: Vision-Grounded Decision Making via Text-Driven Reinforcement Learning
von: Hu, Zhe, et al.
Veröffentlicht: (2025)

Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective
von: Yue, Zihao, et al.
Veröffentlicht: (2024)

ESREAL: Exploiting Semantic Reconstruction to Mitigate Hallucinations in Vision-Language Models
von: Kim, Minchan, et al.
Veröffentlicht: (2024)

Volcano: Mitigating Multimodal Hallucination through Self-Feedback Guided Revision
von: Lee, Seongyun, et al.
Veröffentlicht: (2023)

VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs
von: Liu, Peng, et al.
Veröffentlicht: (2025)

StreamingVLM: Real-Time Understanding for Infinite Video Streams
von: Xu, Ruyi, et al.
Veröffentlicht: (2025)

Reassessing the Role of Supervised Fine-Tuning: An Empirical Study in VLM Reasoning
von: Yu, Yongcan, et al.
Veröffentlicht: (2025)

Mitigating Hallucinations in Multimodal LLMs via Object-aware Preference Optimization
von: Compagnoni, Alberto, et al.
Veröffentlicht: (2025)

EFUF: Efficient Fine-grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models
von: Xing, Shangyu, et al.
Veröffentlicht: (2024)

Mitigating Object Hallucinations in MLLMs via Multi-Frequency Perturbations
von: Li, Shuo, et al.
Veröffentlicht: (2025)

Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
von: An, Wenbin, et al.
Veröffentlicht: (2024)

Modality Bias in LVLMs: Analyzing and Mitigating Object Hallucination via Attention Lens
von: Zheng, Haohan, et al.
Veröffentlicht: (2025)