:: Library Catalog

Copertina

Salvato in:

Dettagli Bibliografici
Autori principali:	Yang, Dingchen, Cao, Bowen, Chen, Guang, Jiang, Changjun
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Computer Vision and Pattern Recognition
Accesso online:	https://arxiv.org/abs/2403.14401
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

Documenti analoghi

Beyond Intermediate States: Explaining Visual Redundancy through Language
di: Yang, Dingchen, et al.
Pubblicazione: (2025)

Global Context or Local Detail? Adaptive Visual Grounding for Hallucination Mitigation
di: Jiang, Yubo, et al.
Pubblicazione: (2026)

LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis
di: Zheng, Zehan, et al.
Pubblicazione: (2024)

Visual Attention Drifts,but Anchors Hold:Mitigating Hallucination in Multimodal Large Language Models via Cross-Layer Visual Anchors
di: Yang, Chengxu, et al.
Pubblicazione: (2026)

Token Preference Optimization with Self-Calibrated Visual-Anchored Rewards for Hallucination Mitigation
di: Gu, Jihao, et al.
Pubblicazione: (2024)

Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos
di: Wang, Ruoyu, et al.
Pubblicazione: (2025)

Mitigating Hallucination in Visual-Language Models via Re-Balancing Contrastive Decoding
di: Liang, Xiaoyu, et al.
Pubblicazione: (2024)

Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow
di: Yu, Xinlei, et al.
Pubblicazione: (2025)

Mitigating Object Hallucinations via Sentence-Level Early Intervention
di: Peng, Shangpin, et al.
Pubblicazione: (2025)

Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy
di: Sun, Yinan, et al.
Pubblicazione: (2025)

Poison as Cure: Visual Noise for Mitigating Object Hallucinations in LVMs
di: Zhang, Kejia, et al.
Pubblicazione: (2025)

RCDN: Towards Robust Camera-Insensitivity Collaborative Perception via Dynamic Feature-based 3D Neural Modeling
di: Wang, Tianhang, et al.
Pubblicazione: (2024)

VORD: Visual Ordinal Calibration for Mitigating Object Hallucinations in Large Vision-Language Models
di: Neo, Dexter, et al.
Pubblicazione: (2024)

SAVER: Mitigating Hallucinations in Large Vision-Language Models via Style-Aware Visual Early Revision
di: Li, Zhaoxu, et al.
Pubblicazione: (2025)

VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification
di: Zhuang, Xianwei, et al.
Pubblicazione: (2025)

See Different, Think Better: Visual Variations Mitigating Hallucinations in LVLMs
di: Dai, Ziyun, et al.
Pubblicazione: (2025)

Urban Architect: Steerable 3D Urban Scene Generation with Layout Prior
di: Lu, Fan, et al.
Pubblicazione: (2024)

Instruction-Aligned Visual Attention for Mitigating Hallucinations in Large Vision-Language Models
di: Li, Bin, et al.
Pubblicazione: (2025)

Extracting Visual Facts from Intermediate Layers for Mitigating Hallucinations in Multimodal Large Language Models
di: Zhou, Haoran, et al.
Pubblicazione: (2025)

Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration
di: Fazli, Mehrdad, et al.
Pubblicazione: (2025)

GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields
di: Xue, Weiyi, et al.
Pubblicazione: (2024)

Learning from Fine-Grained Visual Discrepancies: Mitigating Multimodal Hallucinations via In-Context Visual Contrastive Optimization
di: Deng, Haolin, et al.
Pubblicazione: (2026)

CAST: Mitigating Object Hallucination in Large Vision-Language Models via Caption-Guided Visual Attention Steering
di: Li, Qiming, et al.
Pubblicazione: (2026)

MM-Snowball: Evaluating and Mitigating Hallucination Snowballing in Multimodal Multi-Turn Dialogue
di: Jiang, Yue, et al.
Pubblicazione: (2026)

Revealing and Enhancing Core Visual Regions: Harnessing Internal Attention Dynamics for Hallucination Mitigation in LVLMs
di: Lyu, Guangtao, et al.
Pubblicazione: (2026)

InEx: Hallucination Mitigation via Introspection and Cross-Modal Multi-Agent Collaboration
di: Yang, Zhongyu, et al.
Pubblicazione: (2025)

HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation
di: Zou, Tianpei, et al.
Pubblicazione: (2024)

Mitigating Action-Relation Hallucinations in LVLMs via Relation-aware Visual Enhancement
di: Qin, Zhenxin, et al.
Pubblicazione: (2026)

Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models
di: Zou, Xin, et al.
Pubblicazione: (2024)

MIHBench: Benchmarking and Mitigating Multi-Image Hallucinations in Multimodal Large Language Models
di: Li, Jiale, et al.
Pubblicazione: (2025)

KVSmooth: Mitigating Hallucination in Multi-modal Large Language Models through Key-Value Smoothing
di: Jiang, Siyu, et al.
Pubblicazione: (2026)

Look Carefully: Adaptive Visual Reinforcements in Multimodal Large Language Models for Hallucination Mitigation
di: Zhu, Xingyu, et al.
Pubblicazione: (2026)

AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive Decoding
di: Jung, Chaeyoung, et al.
Pubblicazione: (2025)

SAVAA: Mitigating Hallucinations in LVLMs via Step-wise Adaptive Visual Attention Amplification
di: Zhang, Jiacheng, et al.
Pubblicazione: (2026)

Mitigating Multimodal Hallucination via Phase-wise Self-reward
di: Zhang, Yu, et al.
Pubblicazione: (2026)

Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Attention Lens
di: Jiang, Zhangqi, et al.
Pubblicazione: (2024)

Hallucination Score: Towards Mitigating Hallucinations in Generative Image Super-Resolution
di: Ren, Weiming, et al.
Pubblicazione: (2025)

One-shot Optimized Steering Vector for Hallucination Mitigation for VLMs
di: Shi, Youxu, et al.
Pubblicazione: (2026)

Delve into Visual Contrastive Decoding for Hallucination Mitigation of Large Vision-Language Models
di: Lee, Yi-Lun, et al.
Pubblicazione: (2024)

Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination
di: Zheng, Haojie, et al.
Pubblicazione: (2024)