Saved in:
| Main Authors: | Morbiato, Filippo, Romano, Luca, Persona, Alessandro |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.10671 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM-Bootstrapped Targeted Finding Guidance for Factual MLLM-based Medical Report Generation
by: Yang, Cunyuan, et al.
Published: (2026)
by: Yang, Cunyuan, et al.
Published: (2026)
Can VLMs Recall Factual Associations From Visual References?
by: Ashok, Dhananjay, et al.
Published: (2025)
by: Ashok, Dhananjay, et al.
Published: (2025)
Towards Statistical Factuality Guarantee for Large Vision-Language Models
by: Li, Zhuohang, et al.
Published: (2025)
by: Li, Zhuohang, et al.
Published: (2025)
VC-Inspector: Advancing Reference-free Evaluation of Video Captions with Factual Analysis
by: Dipta, Shubhashis Roy, et al.
Published: (2025)
by: Dipta, Shubhashis Roy, et al.
Published: (2025)
"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models
by: Gu, Jihao, et al.
Published: (2025)
by: Gu, Jihao, et al.
Published: (2025)
LLM-RG4: Flexible and Factual Radiology Report Generation across Diverse Input Contexts
by: Wang, Zhuhao, et al.
Published: (2024)
by: Wang, Zhuhao, et al.
Published: (2024)
KnowRL: Exploring Knowledgeable Reinforcement Learning for Factuality
by: Ren, Baochang, et al.
Published: (2025)
by: Ren, Baochang, et al.
Published: (2025)
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
by: Xia, Peng, et al.
Published: (2024)
by: Xia, Peng, et al.
Published: (2024)
T2I-FactualBench: Benchmarking the Factuality of Text-to-Image Models with Knowledge-Intensive Concepts
by: Huang, Ziwei, et al.
Published: (2024)
by: Huang, Ziwei, et al.
Published: (2024)
The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention
by: Wan, Yixin, et al.
Published: (2024)
by: Wan, Yixin, et al.
Published: (2024)
OraPO: Oracle-educated Reinforcement Learning for Data-efficient and Factual Radiology Report Generation
by: Chen, Zhuoxiao, et al.
Published: (2025)
by: Chen, Zhuoxiao, et al.
Published: (2025)
FAGER: Factually Grounded Evaluation and Refinement of Text-to-Image Models
by: Lim, Youngsun, et al.
Published: (2026)
by: Lim, Youngsun, et al.
Published: (2026)
Factuality Matters: When Image Generation and Editing Meet Structured Visuals
by: Zhuo, Le, et al.
Published: (2025)
by: Zhuo, Le, et al.
Published: (2025)
The Curious Case of Factuality Finetuning: Models' Internal Beliefs Can Improve Factuality
by: Newman, Benjamin, et al.
Published: (2025)
by: Newman, Benjamin, et al.
Published: (2025)
Revolutionizing Radiology Workflow with Factual and Efficient CXR Report Generation
by: Sukjai, Pimchanok, et al.
Published: (2025)
by: Sukjai, Pimchanok, et al.
Published: (2025)
OVFact: Measuring and Improving Open-Vocabulary Factuality for Long Caption Models
by: Wysoczańska, Monika, et al.
Published: (2025)
by: Wysoczańska, Monika, et al.
Published: (2025)
Understanding Finetuning for Factual Knowledge Extraction
by: Ghosal, Gaurav, et al.
Published: (2024)
by: Ghosal, Gaurav, et al.
Published: (2024)
Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models
by: Cao, Meng, et al.
Published: (2025)
by: Cao, Meng, et al.
Published: (2025)
AFTER: Mitigating the Object Hallucination of LVLM via Adaptive Factual-Guided Activation Editing
by: Wang, Tianbo, et al.
Published: (2026)
by: Wang, Tianbo, et al.
Published: (2026)
FACE-net: Factual Calibration and Emotion Augmentation for Retrieval-enhanced Emotional Video Captioning
by: Chen, Weidong, et al.
Published: (2026)
by: Chen, Weidong, et al.
Published: (2026)
MedFact-R1: Towards Factual Medical Reasoning via Pseudo-Label Augmentation
by: Li, Gengliang, et al.
Published: (2025)
by: Li, Gengliang, et al.
Published: (2025)
Beyond Detection: Multi-Scale Hidden-Code for Natural Image Deepfake Recovery and Factual Retrieval
by: Chen, Yuan-Chih, et al.
Published: (2026)
by: Chen, Yuan-Chih, et al.
Published: (2026)
Similarity over Factuality: Are we making progress on multimodal out-of-context misinformation detection?
by: Papadopoulos, Stefanos-Iordanis, et al.
Published: (2024)
by: Papadopoulos, Stefanos-Iordanis, et al.
Published: (2024)
INFACT: A Diagnostic Benchmark for Induced Faithfulness and Factuality Hallucinations in Video-LLMs
by: Yang, Junqi, et al.
Published: (2026)
by: Yang, Junqi, et al.
Published: (2026)
Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval
by: Lim, Youngsun, et al.
Published: (2024)
by: Lim, Youngsun, et al.
Published: (2024)
Open Multimodal Retrieval-Augmented Factual Image Generation
by: Tian, Yang, et al.
Published: (2025)
by: Tian, Yang, et al.
Published: (2025)
Multi-Modal Hallucination Control by Visual Information Grounding
by: Favero, Alessandro, et al.
Published: (2024)
by: Favero, Alessandro, et al.
Published: (2024)
Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage
by: Lee, Saehyung, et al.
Published: (2024)
by: Lee, Saehyung, et al.
Published: (2024)
Beyond Generation: Multi-Hop Reasoning for Factual Accuracy in Vision-Language Models
by: Hossain, Shamima
Published: (2025)
by: Hossain, Shamima
Published: (2025)
Visual Puns from Idioms: An Iterative LLM-T2IM-MLLM Framework
by: Xiao, Kelaiti, et al.
Published: (2025)
by: Xiao, Kelaiti, et al.
Published: (2025)
Natural Language Understanding and Inference with MLLM in Visual Question Answering: A Survey
by: Kuang, Jiayi, et al.
Published: (2024)
by: Kuang, Jiayi, et al.
Published: (2024)
Improving Visual Grounding by Encouraging Consistent Gradient-based Explanations
by: Yang, Ziyan, et al.
Published: (2022)
by: Yang, Ziyan, et al.
Published: (2022)
GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and Images
by: Lan, Xiang, et al.
Published: (2025)
by: Lan, Xiang, et al.
Published: (2025)
Using Similarity to Evaluate Factual Consistency in Summaries
by: Ye, Yuxuan, et al.
Published: (2024)
by: Ye, Yuxuan, et al.
Published: (2024)
Watermarking for Factuality: Guiding Vision-Language Models Toward Truth via Tri-layer Contrastive Decoding
by: Back, Kyungryul, et al.
Published: (2025)
by: Back, Kyungryul, et al.
Published: (2025)
AnchorOPT: Towards Optimizing Dynamic Anchors for Adaptive Prompt Learning
by: Li, Zheng, et al.
Published: (2025)
by: Li, Zheng, et al.
Published: (2025)
HKD4VLM: A Progressive Hybrid Knowledge Distillation Framework for Robust Multimodal Hallucination and Factuality Detection in VLMs
by: Zhang, Zijian, et al.
Published: (2025)
by: Zhang, Zijian, et al.
Published: (2025)
Factual Serialization Enhancement: A Key Innovation for Chest X-ray Report Generation
by: Liu, Kang, et al.
Published: (2024)
by: Liu, Kang, et al.
Published: (2024)
Learning to Reason for Factuality
by: Chen, Xilun, et al.
Published: (2025)
by: Chen, Xilun, et al.
Published: (2025)
Factual Consistency of Multilingual Pretrained Language Models
by: Fierro, Constanza, et al.
Published: (2022)
by: Fierro, Constanza, et al.
Published: (2022)
Similar Items
-
LLM-Bootstrapped Targeted Finding Guidance for Factual MLLM-based Medical Report Generation
by: Yang, Cunyuan, et al.
Published: (2026) -
Can VLMs Recall Factual Associations From Visual References?
by: Ashok, Dhananjay, et al.
Published: (2025) -
Towards Statistical Factuality Guarantee for Large Vision-Language Models
by: Li, Zhuohang, et al.
Published: (2025) -
VC-Inspector: Advancing Reference-free Evaluation of Video Captions with Factual Analysis
by: Dipta, Shubhashis Roy, et al.
Published: (2025) -
"See the World, Discover Knowledge": A Chinese Factuality Evaluation for Large Vision Language Models
by: Gu, Jihao, et al.
Published: (2025)