Saved in:
| Main Authors: | Luo, Wei, Lu, Yiting, Li, Xin, Li, Haoran, Guan, Fengbin, Gao, Chen, Jin, Xin, Li, Yong, Chen, Zhibo, Wu, Sijing, Fu, Kang, Li, Yunhao, Xiao, Ziang, Duan, Huiyu, Liu, Jing, Hu, Qiang, Min, Xiongkuo, Zhai, Guangtao, Sun, Manxi, Guo, Zixuan, Li, Yun, Chen, Ziyang, Tsukada, Manabu, Li, Zhengyang, Du, Zhenglin, Wen, Yi, Jiao, Licheng, Liu, Fang, Li, Lingling, Ren, Yiwen, Song, Zhilong, Chen, Dubing, Zhou, Yucheng, Yan, Tianyi, Zheng, Huan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.05187 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Video Quality Assessment Based on Swin TransformerV2 and Coarse to Fine Strategy
by: Yu, Zihao, et al.
Published: (2024)
by: Yu, Zihao, et al.
Published: (2024)
InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model
by: Guan, Fengbin, et al.
Published: (2025)
by: Guan, Fengbin, et al.
Published: (2025)
QMamba: On First Exploration of Vision Mamba for Image Quality Assessment
by: Guan, Fengbin, et al.
Published: (2024)
by: Guan, Fengbin, et al.
Published: (2024)
VTONQA: A Multi-Dimensional Quality Assessment Dataset for Virtual Try-on
by: Wei, Xinyi, et al.
Published: (2026)
by: Wei, Xinyi, et al.
Published: (2026)
Q-Bench-Portrait: Benchmarking Multimodal Large Language Models on Portrait Image Quality Perception
by: Wu, Sijing, et al.
Published: (2026)
by: Wu, Sijing, et al.
Published: (2026)
VideoAesBench: Benchmarking the Video Aesthetics Perception Capabilities of Large Multimodal Models
by: Li, Yunhao, et al.
Published: (2026)
by: Li, Yunhao, et al.
Published: (2026)
BMPCQA: Bioinspired Metaverse Point Cloud Quality Assessment Based on Large Multimodal Models
by: Huiyu Duan, et al.
Published: (2025)
by: Huiyu Duan, et al.
Published: (2025)
LossAgent: Towards Any Optimization Objectives for Image Processing with LLM Agents
by: Li, Bingchen, et al.
Published: (2024)
by: Li, Bingchen, et al.
Published: (2024)
Hybrid Agents for Image Restoration
by: Li, Bingchen, et al.
Published: (2025)
by: Li, Bingchen, et al.
Published: (2025)
SFQA: A Comprehensive Perceptual Quality Assessment Dataset for Singing Face Generation
by: Gao, Zhilin, et al.
Published: (2026)
by: Gao, Zhilin, et al.
Published: (2026)
MMHead: Towards Fine-grained Multi-modal 3D Facial Animation
by: Wu, Sijing, et al.
Published: (2024)
by: Wu, Sijing, et al.
Published: (2024)
DHQA-4D: Perceptual Quality Assessment of Dynamic 4D Digital Human
by: Li, Yunhao, et al.
Published: (2025)
by: Li, Yunhao, et al.
Published: (2025)
Exploring Instruction Data Quality for Explainable Image Quality Assessment
by: Li, Yunhao, et al.
Published: (2025)
by: Li, Yunhao, et al.
Published: (2025)
Ges-QA: A Multidimensional Quality Assessment Dataset for Audio-to-3D Gesture Generation
by: Gao, Zhilin, et al.
Published: (2025)
by: Gao, Zhilin, et al.
Published: (2025)
RGC-VQA: An Exploration Database for Robotic-Generated Video Quality Assessment
by: Jin, Jianing, et al.
Published: (2025)
by: Jin, Jianing, et al.
Published: (2025)
AGHI-QA: A Subjective-Aligned Dataset and Metric for AI-Generated Human Images
by: Li, Yunhao, et al.
Published: (2025)
by: Li, Yunhao, et al.
Published: (2025)
ODI-Bench: Can MLLMs Understand Immersive Omnidirectional Environments?
by: Yang, Liu, et al.
Published: (2025)
by: Yang, Liu, et al.
Published: (2025)
IQA-Spider: Unifying Multi-Granularity Image Quality Assessment with Reasoning, Grounding and Referring
by: Peng, Xinge, et al.
Published: (2026)
by: Peng, Xinge, et al.
Published: (2026)
UniProcessor: A Text-induced Unified Low-level Image Processor
by: Duan, Huiyu, et al.
Published: (2024)
by: Duan, Huiyu, et al.
Published: (2024)
FVQ: A Large-Scale Dataset and an LMM-based Method for Face Video Quality Assessment
by: Wu, Sijing, et al.
Published: (2025)
by: Wu, Sijing, et al.
Published: (2025)
Q-Adapt: Adapting LMM for Visual Quality Assessment with Progressive Instruction Tuning
by: Lu, Yiting, et al.
Published: (2025)
by: Lu, Yiting, et al.
Published: (2025)
Priorformer: A UGC-VQA Method with content and distortion priors
by: Pei, Yajing, et al.
Published: (2024)
by: Pei, Yajing, et al.
Published: (2024)
SpongeBob: Sync-Aware Harmonious Audio-Visual Generative Editing
by: Liang, Sen, et al.
Published: (2026)
by: Liang, Sen, et al.
Published: (2026)
ELIQ: A Label-Free Framework for Quality Assessment of Evolving AI-Generated Images
by: Li, Xinyue, et al.
Published: (2026)
by: Li, Xinyue, et al.
Published: (2026)
LiftVSR: Lifting Image Diffusion to Video Super-Resolution via Hybrid Temporal Modeling with Only 4$\times$RTX 4090s
by: Wang, Xijun, et al.
Published: (2025)
by: Wang, Xijun, et al.
Published: (2025)
CoNo: Consistency Noise Injection for Tuning-free Long Video Diffusion
by: Wang, Xingrui, et al.
Published: (2024)
by: Wang, Xingrui, et al.
Published: (2024)
DynT2I-Eval: A Dynamic Evaluation Framework for Text-to-Image Models
by: Wang, Juntong, et al.
Published: (2026)
by: Wang, Juntong, et al.
Published: (2026)
LoViF 2026 The First Challenge on Weather Removal in Videos
by: Qian, Chenghao, et al.
Published: (2026)
by: Qian, Chenghao, et al.
Published: (2026)
LoViF 2026 Challenge on Real-World All-in-One Image Restoration: Methods and Results
by: Chen, Xiang, et al.
Published: (2026)
by: Chen, Xiang, et al.
Published: (2026)
PromptCIR: Blind Compressed Image Restoration with Prompt Learning
by: Li, Bingchen, et al.
Published: (2024)
by: Li, Bingchen, et al.
Published: (2024)
Q&C: When Quantization Meets Cache in Efficient Image Generation
by: Ding, Xin, et al.
Published: (2025)
by: Ding, Xin, et al.
Published: (2025)
EditRefiner: A Human-Aligned Agentic Framework for Image Editing Refinement
by: Xu, Zitong, et al.
Published: (2026)
by: Xu, Zitong, et al.
Published: (2026)
Facial Attractiveness Prediction in Live Streaming: A New Benchmark and Multi-modal Method
by: Li, Hui, et al.
Published: (2025)
by: Li, Hui, et al.
Published: (2025)
Progress in Studies on the Equilibrium Shape of Headland-bay Shoreline.
by: Li, Zhilong, et al.
Published: (2007)
by: Li, Zhilong, et al.
Published: (2007)
Towards Defining an Efficient and Expandable File Format for AI-Generated Contents
by: Gao, Yixin, et al.
Published: (2024)
by: Gao, Yixin, et al.
Published: (2024)
Is Vanilla MLP in Neural Radiance Field Enough for Few-shot View Synthesis?
by: Zhu, Hanxin, et al.
Published: (2024)
by: Zhu, Hanxin, et al.
Published: (2024)
ReasonEdit: Towards Interpretable Image Editing Evaluation via Reinforcement Learning
by: Chen, Honghua, et al.
Published: (2026)
by: Chen, Honghua, et al.
Published: (2026)
LoViF 2026 Challenge on Human-oriented Semantic Image Quality Assessment: Methods and Results
by: Li, Xin, et al.
Published: (2026)
by: Li, Xin, et al.
Published: (2026)
How is Visual Attention Influenced by Text Guidance? Database and Model
by: Sun, Yinan, et al.
Published: (2024)
by: Sun, Yinan, et al.
Published: (2024)
Quality Assessment for AI Generated Images with Instruction Tuning
by: Wang, Jiarui, et al.
Published: (2024)
by: Wang, Jiarui, et al.
Published: (2024)
Similar Items
-
Video Quality Assessment Based on Swin TransformerV2 and Coarse to Fine Strategy
by: Yu, Zihao, et al.
Published: (2024) -
InternVQA: Advancing Compressed Video Quality Assessment with Distilling Large Foundation Model
by: Guan, Fengbin, et al.
Published: (2025) -
QMamba: On First Exploration of Vision Mamba for Image Quality Assessment
by: Guan, Fengbin, et al.
Published: (2024) -
VTONQA: A Multi-Dimensional Quality Assessment Dataset for Virtual Try-on
by: Wei, Xinyi, et al.
Published: (2026) -
Q-Bench-Portrait: Benchmarking Multimodal Large Language Models on Portrait Image Quality Perception
by: Wu, Sijing, et al.
Published: (2026)