Saved in:
| Main Authors: | Chen, Wenzhi, Hu, Bo, Li, Leida, He, Lihuo, Lu, Wen, Gao, Xinbo |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.04614 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity
by: Xia, Jili, et al.
Published: (2024)
by: Xia, Jili, et al.
Published: (2024)
A Multi-annotated and Multi-modal Dataset for Wide-angle Video Quality Assessment
by: Hu, Bo, et al.
Published: (2025)
by: Hu, Bo, et al.
Published: (2025)
HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models
by: Xie, Xin, et al.
Published: (2026)
by: Xie, Xin, et al.
Published: (2026)
EyeSim-VQA: A Free-Energy-Guided Eye Simulation Framework for Video Quality Assessment
by: Wang, Zhaoyang, et al.
Published: (2025)
by: Wang, Zhaoyang, et al.
Published: (2025)
Boosting Temporal Sentence Grounding via Causal Inference
by: Tang, Kefan, et al.
Published: (2025)
by: Tang, Kefan, et al.
Published: (2025)
Towards Adaptive Open-Set Object Detection via Category-Level Collaboration Knowledge Mining
by: Ji, Yuqi, et al.
Published: (2026)
by: Ji, Yuqi, et al.
Published: (2026)
HYPE: Hyperbolic Entailment Filtering for Underspecified Images and Texts
by: Kim, Wonjae, et al.
Published: (2024)
by: Kim, Wonjae, et al.
Published: (2024)
FCA2: Frame Compression-Aware Autoencoder for Modular and Fast Compressed Video Super-Resolution
by: Wang, Zhaoyang, et al.
Published: (2025)
by: Wang, Zhaoyang, et al.
Published: (2025)
CognitionCapturer: Decoding Visual Stimuli From Human EEG Signal With Multimodal Information
by: Zhang, Kaifan, et al.
Published: (2024)
by: Zhang, Kaifan, et al.
Published: (2024)
Diffusion Model Based Visual Compensation Guidance and Visual Difference Analysis for No-Reference Image Quality Assessment
by: Wang, Zhaoyang, et al.
Published: (2024)
by: Wang, Zhaoyang, et al.
Published: (2024)
CognitionCapturerPro: Towards High-Fidelity Visual Decoding from EEG/MEG via Multi-modal Information and Asymmetric Alignment
by: Zhang, Kaifan, et al.
Published: (2026)
by: Zhang, Kaifan, et al.
Published: (2026)
Omni-I2C: A Holistic Benchmark for High-Fidelity Image-to-Code Generation
by: Zhou, Jiawei, et al.
Published: (2026)
by: Zhou, Jiawei, et al.
Published: (2026)
DELST: Dual Entailment Learning for Hyperbolic Image-Gene Pretraining in Spatial Transcriptomics
by: Chen, Xulin, et al.
Published: (2025)
by: Chen, Xulin, et al.
Published: (2025)
Content-Adaptive Image Retouching Guided by Attribute-Based Text Representation
by: Zhu, Hancheng, et al.
Published: (2025)
by: Zhu, Hancheng, et al.
Published: (2025)
Language-Guided Visual Perception Disentanglement for Image Quality Assessment and Conditional Image Generation
by: Yang, Zhichao, et al.
Published: (2025)
by: Yang, Zhichao, et al.
Published: (2025)
Shrinking the Teacher: An Adaptive Teaching Paradigm for Asymmetric EEG-Vision Alignment
by: Wu, Lukun, et al.
Published: (2025)
by: Wu, Lukun, et al.
Published: (2025)
Towards Improved Text-Aligned Codebook Learning: Multi-Hierarchical Codebook-Text Alignment with Long Text
by: Liang, Guotao, et al.
Published: (2025)
by: Liang, Guotao, et al.
Published: (2025)
Semantic-Aligned Learning with Collaborative Refinement for Unsupervised VI-ReID
by: Cheng, De, et al.
Published: (2025)
by: Cheng, De, et al.
Published: (2025)
YOLOA: Real-Time Affordance Detection via LLM Adapter
by: Ji, Yuqi, et al.
Published: (2025)
by: Ji, Yuqi, et al.
Published: (2025)
Fine-grained Image Quality Assessment for Perceptual Image Restoration
by: Sheng, Xiangfei, et al.
Published: (2025)
by: Sheng, Xiangfei, et al.
Published: (2025)
HyperST: Hierarchical Hyperbolic Learning for Spatial Transcriptomics Prediction
by: Zhang, Chen, et al.
Published: (2025)
by: Zhang, Chen, et al.
Published: (2025)
Compositional Entailment Learning for Hyperbolic Vision-Language Models
by: Pal, Avik, et al.
Published: (2024)
by: Pal, Avik, et al.
Published: (2024)
Align Beyond Prompts: Evaluating World Knowledge Alignment in Text-to-Image Generation
by: Zhang, Wenchao, et al.
Published: (2025)
by: Zhang, Wenchao, et al.
Published: (2025)
InstructEngine: Instruction-driven Text-to-Image Alignment
by: Lu, Xingyu, et al.
Published: (2025)
by: Lu, Xingyu, et al.
Published: (2025)
AlignIT: Enhancing Prompt Alignment in Customization of Text-to-Image Models
by: Agarwal, Aishwarya, et al.
Published: (2024)
by: Agarwal, Aishwarya, et al.
Published: (2024)
Hyperbolic Hierarchical Alignment Reasoning Network for Text-3D Retrieval
by: Li, Wenrui, et al.
Published: (2025)
by: Li, Wenrui, et al.
Published: (2025)
Hyperbolic Cycle Alignment for Infrared-Visible Image Fusion
by: Li, Timing, et al.
Published: (2025)
by: Li, Timing, et al.
Published: (2025)
Fine-grained Image Aesthetic Assessment: Learning Discriminative Scores from Relative Ranks
by: Yang, Zhichao, et al.
Published: (2026)
by: Yang, Zhichao, et al.
Published: (2026)
PopAlign: Population-Level Alignment for Fair Text-to-Image Generation
by: Li, Shufan, et al.
Published: (2024)
by: Li, Shufan, et al.
Published: (2024)
LongT2IBench: A Benchmark for Evaluating Long Text-to-Image Generation with Graph-structured Annotations
by: Yang, Zhichao, et al.
Published: (2025)
by: Yang, Zhichao, et al.
Published: (2025)
RAISE: Requirement-Adaptive Evolutionary Refinement for Training-Free Text-to-Image Alignment
by: Jiang, Liyao, et al.
Published: (2026)
by: Jiang, Liyao, et al.
Published: (2026)
Bridging the Synthetic-to-Authentic Gap: Distortion-Guided Unsupervised Domain Adaptation for Blind Image Quality Assessment
by: Li, Aobo, et al.
Published: (2024)
by: Li, Aobo, et al.
Published: (2024)
AlignGuard: Scalable Safety Alignment for Text-to-Image Generation
by: Liu, Runtao, et al.
Published: (2024)
by: Liu, Runtao, et al.
Published: (2024)
TextAlign: Preference Alignment for Text Rendering with Hierarchical Rewards
by: Cui, Mingxuan, et al.
Published: (2026)
by: Cui, Mingxuan, et al.
Published: (2026)
HyperPath: Knowledge-Guided Hyperbolic Semantic Hierarchy Modeling for WSI Analysis
by: Huang, Peixiang, et al.
Published: (2025)
by: Huang, Peixiang, et al.
Published: (2025)
TIQA: Human-Aligned Perceptual Text Quality Assessment in Generated Images
by: Koltsov, Kirill, et al.
Published: (2026)
by: Koltsov, Kirill, et al.
Published: (2026)
Ranking-based Adaptive Query Generation for DETRs in Crowded Pedestrian Detection
by: Gao, Feng, et al.
Published: (2023)
by: Gao, Feng, et al.
Published: (2023)
DAMSDet: Dynamic Adaptive Multispectral Detection Transformer with Competitive Query Selection and Adaptive Feature Fusion
by: Guo, Junjie, et al.
Published: (2024)
by: Guo, Junjie, et al.
Published: (2024)
VEglue: Testing Visual Entailment Systems via Object-Aligned Joint Erasing
by: Chang, Zhiyuan, et al.
Published: (2024)
by: Chang, Zhiyuan, et al.
Published: (2024)
StructAlign: Structured Cross-Modal Alignment for Continual Text-to-Video Retrieval
by: Wang, Shaokun, et al.
Published: (2026)
by: Wang, Shaokun, et al.
Published: (2026)
Similar Items
-
AI-Generated Image Quality Assessment Based on Task-Specific Prompt and Multi-Granularity Similarity
by: Xia, Jili, et al.
Published: (2024) -
A Multi-annotated and Multi-modal Dataset for Wide-angle Video Quality Assessment
by: Hu, Bo, et al.
Published: (2025) -
HyperAlign: Hypernetwork for Efficient Test-Time Alignment of Diffusion Models
by: Xie, Xin, et al.
Published: (2026) -
EyeSim-VQA: A Free-Energy-Guided Eye Simulation Framework for Video Quality Assessment
by: Wang, Zhaoyang, et al.
Published: (2025) -
Boosting Temporal Sentence Grounding via Causal Inference
by: Tang, Kefan, et al.
Published: (2025)