Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Liu, Qihao, Mao, Chengzhi, Liu, Yaojie, Yuille, Alan, Chu, Wen-Sheng
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2512.16921
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917154969878528
author	Liu, Qihao Mao, Chengzhi Liu, Yaojie Yuille, Alan Chu, Wen-Sheng
author_facet	Liu, Qihao Mao, Chengzhi Liu, Yaojie Yuille, Alan Chu, Wen-Sheng
contents	Conventional evaluation methods for multimodal LLMs (MLLMs) lack interpretability and are often insufficient to fully disclose significant capability gaps across models. To address this, we introduce AuditDM, an automated framework that actively discovers and rectifies MLLM failure modes by auditing their divergence. AuditDM fine-tunes an MLLM as an auditor via reinforcement learning to generate challenging questions and counterfactual images that maximize disagreement among target models. Once trained, the auditor uncovers diverse, interpretable exemplars that reveal model weaknesses and serve as annotation-free data for rectification. When applied to SoTA models like Gemma-3 and PaliGemma-2, AuditDM discovers more than 20 distinct failure types. Fine-tuning on these discoveries consistently improves all models across 16 benchmarks, and enables a 3B model to surpass its 28B counterpart. Our results suggest that as data scaling hits diminishing returns, targeted model auditing offers an effective path to model diagnosis and improvement.
format	Preprint
id	arxiv_https___arxiv_org_abs_2512_16921
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification Liu, Qihao Mao, Chengzhi Liu, Yaojie Yuille, Alan Chu, Wen-Sheng Computer Vision and Pattern Recognition Artificial Intelligence Conventional evaluation methods for multimodal LLMs (MLLMs) lack interpretability and are often insufficient to fully disclose significant capability gaps across models. To address this, we introduce AuditDM, an automated framework that actively discovers and rectifies MLLM failure modes by auditing their divergence. AuditDM fine-tunes an MLLM as an auditor via reinforcement learning to generate challenging questions and counterfactual images that maximize disagreement among target models. Once trained, the auditor uncovers diverse, interpretable exemplars that reveal model weaknesses and serve as annotation-free data for rectification. When applied to SoTA models like Gemma-3 and PaliGemma-2, AuditDM discovers more than 20 distinct failure types. Fine-tuning on these discoveries consistently improves all models across 16 benchmarks, and enables a 3B model to surpass its 28B counterpart. Our results suggest that as data scaling hits diminishing returns, targeted model auditing offers an effective path to model diagnosis and improvement.
title	Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
topic	Computer Vision and Pattern Recognition Artificial Intelligence
url	https://arxiv.org/abs/2512.16921

Similar Items