Saved in:
| Main Authors: | Sui, Xiangjie, Li, Songyang, Zhu, Hanwei, Chen, Baoliang, Fang, Yuming, Sun, Xin |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.19032 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
2AFC Prompting of Large Multimodal Models for Image Quality Assessment
by: Zhu, Hanwei, et al.
Published: (2024)
by: Zhu, Hanwei, et al.
Published: (2024)
Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality Assessment
by: Chen, Baoliang, et al.
Published: (2024)
by: Chen, Baoliang, et al.
Published: (2024)
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
by: Chen, Huiyi, et al.
Published: (2025)
by: Chen, Huiyi, et al.
Published: (2025)
Q-Doc: Benchmarking Document Image Quality Assessment Capabilities in Multi-modal Large Language Models
by: Huang, Jiaxi, et al.
Published: (2025)
by: Huang, Jiaxi, et al.
Published: (2025)
Benchmarking the Robustness of UAV Tracking Against Common Corruptions
by: Liu, Xiaoqiong, et al.
Published: (2024)
by: Liu, Xiaoqiong, et al.
Published: (2024)
From Global to Granular: Revealing IQA Model Performance via Correlation Surface
by: Chen, Baoliang, et al.
Published: (2026)
by: Chen, Baoliang, et al.
Published: (2026)
EduVQA: Towards Concept-Aware Assessment of Educational AI-Generated Videos
by: Chen, Baoliang, et al.
Published: (2026)
by: Chen, Baoliang, et al.
Published: (2026)
Deep Feature Statistics Mapping for Generalized Screen Content Image Quality Assessment
by: Chen, Baoliang, et al.
Published: (2022)
by: Chen, Baoliang, et al.
Published: (2022)
Benchmarking the Robustness of Temporal Action Detection Models Against Temporal Corruptions
by: Zeng, Runhao, et al.
Published: (2024)
by: Zeng, Runhao, et al.
Published: (2024)
Benchmarking the Robustness of Optical Flow Estimation to Corruptions
by: Yi, Zhonghua, et al.
Published: (2024)
by: Yi, Zhonghua, et al.
Published: (2024)
RobustSpring: Benchmarking Robustness to Image Corruptions for Optical Flow, Scene Flow and Stereo
by: Oei, Victor, et al.
Published: (2025)
by: Oei, Victor, et al.
Published: (2025)
Adaptive Image Quality Assessment via Teaching Large Multimodal Model to Compare
by: Zhu, Hanwei, et al.
Published: (2024)
by: Zhu, Hanwei, et al.
Published: (2024)
Benchmarking Robustness of Endoscopic Depth Estimation with Synthetically Corrupted Data
by: Wang, An, et al.
Published: (2024)
by: Wang, An, et al.
Published: (2024)
Benchmarking the Spatial Robustness of DNNs via Natural and Adversarial Localized Corruptions
by: Pietrosanti, Giulia Marchiori, et al.
Published: (2025)
by: Pietrosanti, Giulia Marchiori, et al.
Published: (2025)
PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions
by: Ma, Sihan, et al.
Published: (2024)
by: Ma, Sihan, et al.
Published: (2024)
AI-generated Image Quality Assessment in Visual Communication
by: Tian, Yu, et al.
Published: (2024)
by: Tian, Yu, et al.
Published: (2024)
Simple Lines, Big Ideas: Towards Interpretable Assessment of Human Creativity from Drawings
by: Lin, Zihao, et al.
Published: (2025)
by: Lin, Zihao, et al.
Published: (2025)
Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video Enhancement
by: Zhu, Lingyu, et al.
Published: (2024)
by: Zhu, Lingyu, et al.
Published: (2024)
SHALE: A Scalable Benchmark for Fine-grained Hallucination Evaluation in LVLMs
by: Yan, Bei, et al.
Published: (2025)
by: Yan, Bei, et al.
Published: (2025)
Dysca: A Dynamic and Scalable Benchmark for Evaluating Perception Ability of LVLMs
by: Zhang, Jie, et al.
Published: (2024)
by: Zhang, Jie, et al.
Published: (2024)
AgenticIQA: An Agentic Framework for Adaptive and Interpretable Image Quality Assessment
by: Zhu, Hanwei, et al.
Published: (2025)
by: Zhu, Hanwei, et al.
Published: (2025)
ResLPR: A LiDAR Data Restoration Network and Benchmark for Robust Place Recognition Against Weather Corruptions
by: Kuang, Wenqing, et al.
Published: (2025)
by: Kuang, Wenqing, et al.
Published: (2025)
Plug In, Grade Right: Psychology-Inspired AGIQA
by: Liao, Zhicheng, et al.
Published: (2025)
by: Liao, Zhicheng, et al.
Published: (2025)
Gap-closing Matters: Perceptual Quality Evaluation and Optimization of Low-Light Image Enhancement
by: Chen, Baoliang, et al.
Published: (2023)
by: Chen, Baoliang, et al.
Published: (2023)
The Loop Game: Quality Assessment and Optimization for Low-Light Image Enhancement
by: Huang, Danni, et al.
Published: (2022)
by: Huang, Danni, et al.
Published: (2022)
Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs
by: Liu, Xuannan, et al.
Published: (2025)
by: Liu, Xuannan, et al.
Published: (2025)
MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs
by: Liu, Xuannan, et al.
Published: (2024)
by: Liu, Xuannan, et al.
Published: (2024)
OODFace: Benchmarking Robustness of Face Recognition under Common Corruptions and Appearance Variations
by: Kang, Caixin, et al.
Published: (2024)
by: Kang, Caixin, et al.
Published: (2024)
Robust Anti-Backdoor Instruction Tuning in LVLMs
by: Xun, Yuan, et al.
Published: (2025)
by: Xun, Yuan, et al.
Published: (2025)
Perceptual Quality Assessment of Virtual Reality Videos in the Wild
by: Wen, Wen, et al.
Published: (2022)
by: Wen, Wen, et al.
Published: (2022)
Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving
by: Lu, Yuhang, et al.
Published: (2024)
by: Lu, Yuhang, et al.
Published: (2024)
VladVA: Discriminative Fine-tuning of LVLMs
by: Ouali, Yassine, et al.
Published: (2024)
by: Ouali, Yassine, et al.
Published: (2024)
Investigating Calibration and Corruption Robustness of Post-hoc Pruned Perception CNNs: An Image Classification Benchmark Study
by: Mitra, Pallavi, et al.
Published: (2024)
by: Mitra, Pallavi, et al.
Published: (2024)
REOBench: Benchmarking Robustness of Earth Observation Foundation Models
by: Li, Xiang, et al.
Published: (2025)
by: Li, Xiang, et al.
Published: (2025)
Beyond Cosine Similarity: Magnitude-Aware CLIP for No-Reference Image Quality Assessment
by: Liao, Zhicheng, et al.
Published: (2025)
by: Liao, Zhicheng, et al.
Published: (2025)
DD-RobustBench: An Adversarial Robustness Benchmark for Dataset Distillation
by: Wu, Yifan, et al.
Published: (2024)
by: Wu, Yifan, et al.
Published: (2024)
MedFM-Robust: Benchmarking Robustness of Medical Foundation Models
by: Cui, Xiangxiang, et al.
Published: (2026)
by: Cui, Xiangxiang, et al.
Published: (2026)
Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift
by: Qiu, Jielin, et al.
Published: (2022)
by: Qiu, Jielin, et al.
Published: (2022)
Improving Alignment in LVLMs with Debiased Self-Judgment
by: Yang, Sihan, et al.
Published: (2025)
by: Yang, Sihan, et al.
Published: (2025)
RobustSora: De-Watermarked Benchmark for Robust AI-Generated Video Detection
by: Wang, Zhuo, et al.
Published: (2025)
by: Wang, Zhuo, et al.
Published: (2025)
Similar Items
-
2AFC Prompting of Large Multimodal Models for Image Quality Assessment
by: Zhu, Hanwei, et al.
Published: (2024) -
Mitigating Perception Bias: A Training-Free Approach to Enhance LMM for Image Quality Assessment
by: Chen, Baoliang, et al.
Published: (2024) -
MVI-Bench: A Comprehensive Benchmark for Evaluating Robustness to Misleading Visual Inputs in LVLMs
by: Chen, Huiyi, et al.
Published: (2025) -
Q-Doc: Benchmarking Document Image Quality Assessment Capabilities in Multi-modal Large Language Models
by: Huang, Jiaxi, et al.
Published: (2025) -
Benchmarking the Robustness of UAV Tracking Against Common Corruptions
by: Liu, Xiaoqiong, et al.
Published: (2024)