:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Morbiato, Filippo, Romano, Luca, Persona, Alessandro
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2511.10671
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LLM-Bootstrapped Targeted Finding Guidance for Factual MLLM-based Medical Report Generation
by: Yang, Cunyuan, et al.
Published: (2026)

Can VLMs Recall Factual Associations From Visual References?
by: Ashok, Dhananjay, et al.
Published: (2025)

Towards Statistical Factuality Guarantee for Large Vision-Language Models
by: Li, Zhuohang, et al.
Published: (2025)

VC-Inspector: Advancing Reference-free Evaluation of Video Captions with Factual Analysis
by: Dipta, Shubhashis Roy, et al.
Published: (2025)

"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models
by: Gu, Jihao, et al.
Published: (2025)

LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts
by: Wang, Zhuhao, et al.
Published: (2024)

KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
by: Ren, Baochang, et al.
Published: (2025)

RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
by: Xia, Peng, et al.
Published: (2024)

T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
by: Huang, Ziwei, et al.
Published: (2024)

The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention
by: Wan, Yixin, et al.
Published: (2024)

OraPO: Oracle-educated Reinforcement Learning for Data-efficient and Factual Radiology Report Generation
by: Chen, Zhuoxiao, et al.
Published: (2025)

FAGER: Factually Grounded Evaluation and Refinement of Text-to-Image Models
by: Lim, Youngsun, et al.
Published: (2026)

Factuality Matters: When Image Generation and Editing Meet Structured Visuals
by: Zhuo, Le, et al.
Published: (2025)

The Curious Case of Factuality Finetuning: Models' Internal Beliefs Can Improve Factuality
by: Newman, Benjamin, et al.
Published: (2025)

Revolutionizing Radiology Workflow with Factual and Efficient CXR Report Generation
by: Sukjai, Pimchanok, et al.
Published: (2025)

OVFact: Measuring and Improving Open-Vocabulary Factuality for Long Caption Models
by: Wysoczańska, Monika, et al.
Published: (2025)

Understanding Finetuning for Factual Knowledge Extraction
by: Ghosal, Gaurav, et al.
Published: (2024)

Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
by: Cao, Meng, et al.
Published: (2025)

AFTER: Mitigating the Object Hallucination of LVLM via Adaptive Factual-Guided Activation Editing
by: Wang, Tianbo, et al.
Published: (2026)

FACE-net: Factual Calibration and Emotion Augmentation for Retrieval-enhanced Emotional Video Captioning
by: Chen, Weidong, et al.
Published: (2026)

MedFact-R1: Towards Factual Medical Reasoning via Pseudo-Label Augmentation
by: Li, Gengliang, et al.
Published: (2025)

Beyond Detection: Multi-Scale Hidden-Code for Natural Image Deepfake Recovery and Factual Retrieval
by: Chen, Yuan-Chih, et al.
Published: (2026)

Similarity over Factuality: Are we making progress on multimodal out-of-context misinformation detection?
by: Papadopoulos, Stefanos-Iordanis, et al.
Published: (2024)

INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs
by: Yang, Junqi, et al.
Published: (2026)

Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval
by: Lim, Youngsun, et al.
Published: (2024)

Open Multimodal Retrieval-Augmented Factual Image Generation
by: Tian, Yang, et al.
Published: (2025)

Multi-Modal Hallucination Control by Visual Information Grounding
by: Favero, Alessandro, et al.
Published: (2024)

Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage
by: Lee, Saehyung, et al.
Published: (2024)

Beyond Generation: Multi-Hop Reasoning for Factual Accuracy in Vision-Language Models
by: Hossain, Shamima
Published: (2025)

Visual Puns from Idioms: An Iterative LLM-T2IM-MLLM Framework
by: Xiao, Kelaiti, et al.
Published: (2025)

Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey
by: Kuang, Jiayi, et al.
Published: (2024)

Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations
by: Yang, Ziyan, et al.
Published: (2022)

GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images
by: Lan, Xiang, et al.
Published: (2025)

Using Similarity to Evaluate Factual Consistency in Summaries
by: Ye, Yuxuan, et al.
Published: (2024)

Watermarking for Factuality: Guiding Vision-Language Models Toward Truth via Tri-layer Contrastive Decoding
by: Back, Kyungryul, et al.
Published: (2025)

AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
by: Li, Zheng, et al.
Published: (2025)

HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs
by: Zhang, Zijian, et al.
Published: (2025)

Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report Generation
by: Liu, Kang, et al.
Published: (2024)

Learning to Reason for Factuality
by: Chen, Xilun, et al.
Published: (2025)

Factual Consistency of Multilingual Pretrained Language Models
by: Fierro, Constanza, et al.
Published: (2022)