Saved in:
| Main Authors: | Wu, Sijing, Li, Yunhao, Xu, Ziwen, Gao, Yixuan, Duan, Huiyu, Sun, Wei, Zhai, Guangtao |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2504.09255 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
VTONQA: A Multi-Dimensional Quality Assessment Dataset for Virtual Try-on
by: Wei, Xinyi, et al.
Published: (2026)
by: Wei, Xinyi, et al.
Published: (2026)
SFQA: A Comprehensive Perceptual Quality Assessment Dataset for Singing Face Generation
by: Gao, Zhilin, et al.
Published: (2026)
by: Gao, Zhilin, et al.
Published: (2026)
Exploring Instruction Data Quality for Explainable Image Quality Assessment
by: Li, Yunhao, et al.
Published: (2025)
by: Li, Yunhao, et al.
Published: (2025)
DHQA-4D: Perceptual Quality Assessment of Dynamic 4D Digital Human
by: Li, Yunhao, et al.
Published: (2025)
by: Li, Yunhao, et al.
Published: (2025)
LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models
by: Ge, Qihang, et al.
Published: (2024)
by: Ge, Qihang, et al.
Published: (2024)
AIGV-Assessor: Benchmarking and Evaluating the Perceptual Quality of Text-to-Video Generation with LMM
by: Wang, Jiarui, et al.
Published: (2024)
by: Wang, Jiarui, et al.
Published: (2024)
Ges-QA: A Multidimensional Quality Assessment Dataset for Audio-to-3D Gesture Generation
by: Gao, Zhilin, et al.
Published: (2025)
by: Gao, Zhilin, et al.
Published: (2025)
RGC-VQA: An Exploration Database for Robotic-Generated Video Quality Assessment
by: Jin, Jianing, et al.
Published: (2025)
by: Jin, Jianing, et al.
Published: (2025)
VideoAesBench: Benchmarking the Video Aesthetics Perception Capabilities of Large Multimodal Models
by: Li, Yunhao, et al.
Published: (2026)
by: Li, Yunhao, et al.
Published: (2026)
LMM4LMM: Benchmarking and Evaluating Large-multimodal Image Generation with LMMs
by: Wang, Jiarui, et al.
Published: (2025)
by: Wang, Jiarui, et al.
Published: (2025)
Perceptual Video Quality Assessment: A Survey
by: Min, Xiongkuo, et al.
Published: (2024)
by: Min, Xiongkuo, et al.
Published: (2024)
Q-Bench-Portrait: Benchmarking Multimodal Large Language Models on Portrait Image Quality Perception
by: Wu, Sijing, et al.
Published: (2026)
by: Wu, Sijing, et al.
Published: (2026)
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
by: Wu, Sijing, et al.
Published: (2024)
by: Wu, Sijing, et al.
Published: (2024)
MVBIND: Self-Supervised Music Recommendation For Videos Via Embedding Space Binding
by: Teng, Jiajie, et al.
Published: (2024)
by: Teng, Jiajie, et al.
Published: (2024)
AGHI-QA: A Subjective-Aligned Dataset and Metric for AI-Generated Human Images
by: Li, Yunhao, et al.
Published: (2025)
by: Li, Yunhao, et al.
Published: (2025)
LMM-PCQA: Assisting Point Cloud Quality Assessment with LMM
by: Zhang, Zicheng, et al.
Published: (2024)
by: Zhang, Zicheng, et al.
Published: (2024)
UniProcessor: A Text-induced Unified Low-level Image Processor
by: Duan, Huiyu, et al.
Published: (2024)
by: Duan, Huiyu, et al.
Published: (2024)
Quality Assessment for AI Generated Images with Instruction Tuning
by: Wang, Jiarui, et al.
Published: (2024)
by: Wang, Jiarui, et al.
Published: (2024)
BMPCQA: Bioinspired Metaverse Point Cloud Quality Assessment Based on Large Multimodal Models
by: Huiyu Duan, et al.
Published: (2025)
by: Huiyu Duan, et al.
Published: (2025)
Surveillance Facial Image Quality Assessment: A Multi-dimensional Dataset and Lightweight Model
by: Jiang, Yanwei, et al.
Published: (2026)
by: Jiang, Yanwei, et al.
Published: (2026)
AGAV-Rater: Adapting Large Multimodal Model for AI-Generated Audio-Visual Quality Assessment
by: Cao, Yuqin, et al.
Published: (2025)
by: Cao, Yuqin, et al.
Published: (2025)
EEmo-Logic: A Unified Dataset and Multi-Stage Framework for Comprehensive Image-Evoked Emotion Assessment
by: Gao, Lancheng, et al.
Published: (2026)
by: Gao, Lancheng, et al.
Published: (2026)
Preference-Guided Debiasing for No-Reference Enhancement Image Quality Assessment
by: Gao, Shiqi, et al.
Published: (2026)
by: Gao, Shiqi, et al.
Published: (2026)
Multi-Dimensional Quality Assessment for Text-to-3D Assets: Dataset and Model
by: Fu, Kang, et al.
Published: (2025)
by: Fu, Kang, et al.
Published: (2025)
ESVQA: Perceptual Quality Assessment of Egocentric Spatial Videos
by: Zhu, Xilei, et al.
Published: (2024)
by: Zhu, Xilei, et al.
Published: (2024)
Exploring Image Quality Assessment from a New Perspective: Pupil Size
by: Gao, Yixuan, et al.
Published: (2025)
by: Gao, Yixuan, et al.
Published: (2025)
TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs
by: Wang, Juntong, et al.
Published: (2025)
by: Wang, Juntong, et al.
Published: (2025)
ODI-Bench: Can MLLMs Understand Immersive Omnidirectional Environments?
by: Yang, Liu, et al.
Published: (2025)
by: Yang, Liu, et al.
Published: (2025)
ESIQA: Perceptual Quality Assessment of Vision-Pro-based Egocentric Spatial Images
by: Zhu, Xilei, et al.
Published: (2024)
by: Zhu, Xilei, et al.
Published: (2024)
SingingHead: A Large-scale 4D Dataset for Singing Head Animation
by: Wu, Sijing, et al.
Published: (2023)
by: Wu, Sijing, et al.
Published: (2023)
LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs
by: Yang, Woo Yi, et al.
Published: (2025)
by: Yang, Woo Yi, et al.
Published: (2025)
LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs
by: Xu, Zitong, et al.
Published: (2025)
by: Xu, Zitong, et al.
Published: (2025)
How is Visual Attention Influenced by Text Guidance? Database and Model
by: Sun, Yinan, et al.
Published: (2024)
by: Sun, Yinan, et al.
Published: (2024)
Embodied Image Quality Assessment for Robotic Intelligence
by: Zhang, Jianbo, et al.
Published: (2024)
by: Zhang, Jianbo, et al.
Published: (2024)
Analysis of Video Quality Datasets via Design of Minimalistic Video Quality Models
by: Sun, Wei, et al.
Published: (2023)
by: Sun, Wei, et al.
Published: (2023)
Life-IQA: Boosting Blind Image Quality Assessment through GCN-enhanced Layer Interaction and MoE-based Feature Decoupling
by: Tang, Long, et al.
Published: (2025)
by: Tang, Long, et al.
Published: (2025)
Exploring Rich Subjective Quality Information for Image Quality Assessment in the Wild
by: Min, Xiongkuo, et al.
Published: (2024)
by: Min, Xiongkuo, et al.
Published: (2024)
Subjective-Aligned Dataset and Metric for Text-to-Video Quality Assessment
by: Kou, Tengchuan, et al.
Published: (2024)
by: Kou, Tengchuan, et al.
Published: (2024)
HarmonyIQA: Pioneering Benchmark and Model for Image Harmonization Quality Assessment
by: Xu, Zitong, et al.
Published: (2025)
by: Xu, Zitong, et al.
Published: (2025)
Steering and Rectifying Latent Representation Manifolds in Frozen Multi-modal LLMs for Video Anomaly Detection
by: Cai, Zhaolin, et al.
Published: (2026)
by: Cai, Zhaolin, et al.
Published: (2026)
Similar Items
-
VTONQA: A Multi-Dimensional Quality Assessment Dataset for Virtual Try-on
by: Wei, Xinyi, et al.
Published: (2026) -
SFQA: A Comprehensive Perceptual Quality Assessment Dataset for Singing Face Generation
by: Gao, Zhilin, et al.
Published: (2026) -
Exploring Instruction Data Quality for Explainable Image Quality Assessment
by: Li, Yunhao, et al.
Published: (2025) -
DHQA-4D: Perceptual Quality Assessment of Dynamic 4D Digital Human
by: Li, Yunhao, et al.
Published: (2025) -
LMM-VQA: Advancing Video Quality Assessment with Large Multimodal Models
by: Ge, Qihang, et al.
Published: (2024)