Saved in:
| Main Authors: | Chen, Boqi, Liu, Xudong, Ao, Yunke, Qiu, Jianing |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.23443 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Mask What Matters: Mitigating Object Hallucinations in Multimodal Large Language Models with Object-Aligned Visual Contrastive Decoding
by: Chen, Boqi, et al.
Published: (2026)
by: Chen, Boqi, et al.
Published: (2026)
MEDSYN: Benchmarking Multi-EviDence SYNthesis in Complex Clinical Cases for Multimodal Large Language Models
by: Chen, Boqi, et al.
Published: (2026)
by: Chen, Boqi, et al.
Published: (2026)
GCoT-Decoding: Unlocking Deep Reasoning Paths for Universal Question Answering
by: Luo, Guanran, et al.
Published: (2026)
by: Luo, Guanran, et al.
Published: (2026)
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
by: Xue, Chao, et al.
Published: (2024)
by: Xue, Chao, et al.
Published: (2024)
On the Calibration of Multilingual Question Answering LLMs
by: Yang, Yahan, et al.
Published: (2023)
by: Yang, Yahan, et al.
Published: (2023)
Code-Style In-Context Learning for Knowledge-Based Question Answering
by: Nie, Zhijie, et al.
Published: (2023)
by: Nie, Zhijie, et al.
Published: (2023)
NovelQA: Benchmarking Question Answering on Documents Exceeding 200K Tokens
by: Wang, Cunxiang, et al.
Published: (2024)
by: Wang, Cunxiang, et al.
Published: (2024)
Selectively Answering Visual Questions
by: Eisenschlos, Julian Martin, et al.
Published: (2024)
by: Eisenschlos, Julian Martin, et al.
Published: (2024)
Relevance-aware Multi-context Contrastive Decoding for Retrieval-augmented Visual Question Answering
by: Kim, Jongha, et al.
Published: (2026)
by: Kim, Jongha, et al.
Published: (2026)
An Entity Linking Agent for Question Answering
by: Luo, Yajie, et al.
Published: (2025)
by: Luo, Yajie, et al.
Published: (2025)
Calibrated Large Language Models for Binary Question Answering
by: Giovannotti, Patrizio, et al.
Published: (2024)
by: Giovannotti, Patrizio, et al.
Published: (2024)
Efficient Multimodal Planning Agent for Visual Question-Answering
by: Chen, Zhuo, et al.
Published: (2026)
by: Chen, Zhuo, et al.
Published: (2026)
AIM: Asymmetric Information Masking for Visual Question Answering Continual Learning
by: Zhang, Peifeng, et al.
Published: (2026)
by: Zhang, Peifeng, et al.
Published: (2026)
EVJVQA Challenge: Multilingual Visual Question Answering
by: Nguyen, Ngan Luu-Thuy, et al.
Published: (2023)
by: Nguyen, Ngan Luu-Thuy, et al.
Published: (2023)
QLSC: A Query Latent Semantic Calibrator for Robust Extractive Question Answering
by: Ouyang, Sheng, et al.
Published: (2024)
by: Ouyang, Sheng, et al.
Published: (2024)
Calibrated Confidence Estimation for Tabular Question Answering
by: Voss, Lukas
Published: (2026)
by: Voss, Lukas
Published: (2026)
Beyond Static: Related Questions Retrieval Through Conversations in Community Question Answering
by: Ao, Xiao, et al.
Published: (2026)
by: Ao, Xiao, et al.
Published: (2026)
Knowledge-Based Counterfactual Queries for Visual Question Answering
by: Stoikou, Theodoti, et al.
Published: (2023)
by: Stoikou, Theodoti, et al.
Published: (2023)
LIVE: Learnable In-Context Vector for Visual Question Answering
by: Peng, Yingzhe, et al.
Published: (2024)
by: Peng, Yingzhe, et al.
Published: (2024)
DARE: Diverse Visual Question Answering with Robustness Evaluation
by: Sterz, Hannah, et al.
Published: (2024)
by: Sterz, Hannah, et al.
Published: (2024)
Rationale-guided Prompting for Knowledge-based Visual Question Answering
by: Hu, Zhongjian, et al.
Published: (2024)
by: Hu, Zhongjian, et al.
Published: (2024)
Token Constraint Decoding Improves Robustness on Question Answering for Large Language Models
by: Yao, Jui-Ming, et al.
Published: (2025)
by: Yao, Jui-Ming, et al.
Published: (2025)
Memory-Augmented Knowledge Fusion with Safety-Aware Decoding for Domain-Adaptive Question Answering
by: Fu, Lei, et al.
Published: (2025)
by: Fu, Lei, et al.
Published: (2025)
A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions
by: Inadumi, Shun, et al.
Published: (2024)
by: Inadumi, Shun, et al.
Published: (2024)
Compositional Consistency-Guided Decoding for Three-Way Logical Question Answering
by: Huang, Tianyi, et al.
Published: (2026)
by: Huang, Tianyi, et al.
Published: (2026)
Hallucination Benchmark in Medical Visual Question Answering
by: Wu, Jinge, et al.
Published: (2024)
by: Wu, Jinge, et al.
Published: (2024)
Computed Tomography Visual Question Answering with Cross-modal Feature Graphing
by: Tian, Yuanhe, et al.
Published: (2025)
by: Tian, Yuanhe, et al.
Published: (2025)
Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective
by: Rajput, Krishna Singh, et al.
Published: (2025)
by: Rajput, Krishna Singh, et al.
Published: (2025)
A Comprehensive Evaluation of GPT-4V on Knowledge-Intensive Visual Question Answering
by: Li, Yunxin, et al.
Published: (2023)
by: Li, Yunxin, et al.
Published: (2023)
An Analysis of Decoding Methods for LLM-based Agents for Faithful Multi-Hop Question Answering
by: Murphy, Alexander, et al.
Published: (2025)
by: Murphy, Alexander, et al.
Published: (2025)
Context Quality Matters in Training Fusion-in-Decoder for Extractive Open-Domain Question Answering
by: Akimoto, Kosuke, et al.
Published: (2024)
by: Akimoto, Kosuke, et al.
Published: (2024)
OWLViz: An Open-World Benchmark for Visual Question Answering
by: Nguyen, Thuy, et al.
Published: (2025)
by: Nguyen, Thuy, et al.
Published: (2025)
Multimodal Reranking for Knowledge-Intensive Visual Question Answering
by: Wen, Haoyang, et al.
Published: (2024)
by: Wen, Haoyang, et al.
Published: (2024)
Multimodal Commonsense Knowledge Distillation for Visual Question Answering
by: Yang, Shuo, et al.
Published: (2024)
by: Yang, Shuo, et al.
Published: (2024)
Benchmarking Uncertainty Calibration in Large Language Model Long-Form Question Answering
by: Müller, Philip, et al.
Published: (2026)
by: Müller, Philip, et al.
Published: (2026)
Knowing or Guessing? Robust Medical Visual Question Answering via Joint Consistency and Contrastive Learning
by: Jiang, Songtao, et al.
Published: (2025)
by: Jiang, Songtao, et al.
Published: (2025)
A Semantic-Sampling Framework for Evaluating Calibration in Open-Ended Question Answering
by: Wang, Zhanliang, et al.
Published: (2026)
by: Wang, Zhanliang, et al.
Published: (2026)
BERAG: Bayesian Ensemble Retrieval-Augmented Generation for Knowledge-based Visual Question Answering
by: Chen, Jinghong, et al.
Published: (2026)
by: Chen, Jinghong, et al.
Published: (2026)
Design as Desired: Utilizing Visual Question Answering for Multimodal Pre-training
by: Su, Tongkun, et al.
Published: (2024)
by: Su, Tongkun, et al.
Published: (2024)
GeReA: Question-Aware Prompt Captions for Knowledge-based Visual Question Answering
by: Ma, Ziyu, et al.
Published: (2024)
by: Ma, Ziyu, et al.
Published: (2024)
Similar Items
-
Mask What Matters: Mitigating Object Hallucinations in Multimodal Large Language Models with Object-Aligned Visual Contrastive Decoding
by: Chen, Boqi, et al.
Published: (2026) -
MEDSYN: Benchmarking Multi-EviDence SYNthesis in Complex Clinical Cases for Multimodal Large Language Models
by: Chen, Boqi, et al.
Published: (2026) -
GCoT-Decoding: Unlocking Deep Reasoning Paths for Universal Question Answering
by: Luo, Guanran, et al.
Published: (2026) -
Question Calibration and Multi-Hop Modeling for Temporal Question Answering
by: Xue, Chao, et al.
Published: (2024) -
On the Calibration of Multilingual Question Answering LLMs
by: Yang, Yahan, et al.
Published: (2023)