Saved in:
| Main Authors: | Wu, Kaiqun, Jiang, Xiaoling, Yu, Rui, Luo, Yonggang, Jiang, Tian, Wu, Xi, Wei, Peng |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.06992 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TransDAE: Dual Attention Mechanism in a Hierarchical Transformer for Efficient Medical Image Segmentation
by: Azad, Bobby, et al.
Published: (2024)
by: Azad, Bobby, et al.
Published: (2024)
Local and Global Feature Attention Fusion Network for Face Recognition
by: Yu, Wang, et al.
Published: (2024)
by: Yu, Wang, et al.
Published: (2024)
Beyond Cosine Similarity: Magnitude-Aware CLIP for No-Reference Image Quality Assessment
by: Liao, Zhicheng, et al.
Published: (2025)
by: Liao, Zhicheng, et al.
Published: (2025)
Free Lunch for Unified Multimodal Models: Enhancing Generation via Reflective Rectification with Inherent Understanding
by: Jiang, Yibo, et al.
Published: (2026)
by: Jiang, Yibo, et al.
Published: (2026)
No-Reference Point Cloud Quality Assessment via Graph Convolutional Network
by: Chen, Wu, et al.
Published: (2024)
by: Chen, Wu, et al.
Published: (2024)
AI-generated Image Quality Assessment in Visual Communication
by: Tian, Yu, et al.
Published: (2024)
by: Tian, Yu, et al.
Published: (2024)
MapFusion: A Novel BEV Feature Fusion Network for Multi-modal Map Construction
by: Hao, Xiaoshuai, et al.
Published: (2025)
by: Hao, Xiaoshuai, et al.
Published: (2025)
WithAnyone: Towards Controllable and ID Consistent Image Generation
by: Xu, Hengyuan, et al.
Published: (2025)
by: Xu, Hengyuan, et al.
Published: (2025)
Fully $1\times1$ Convolutional Network for Lightweight Image Super-Resolution
by: Wu, Gang, et al.
Published: (2023)
by: Wu, Gang, et al.
Published: (2023)
Vision-Language Consistency Guided Multi-modal Prompt Learning for Blind AI Generated Image Quality Assessment
by: Fu, Jun, et al.
Published: (2024)
by: Fu, Jun, et al.
Published: (2024)
DEFNet: Multitasks-based Deep Evidential Fusion Network for Blind Image Quality Assessment
by: Lou, Yiwei, et al.
Published: (2025)
by: Lou, Yiwei, et al.
Published: (2025)
Fusion to Enhance: Fusion Visual Encoder to Enhance Multimodal Language Model
by: She, Yifei, et al.
Published: (2025)
by: She, Yifei, et al.
Published: (2025)
Residual Kolmogorov-Arnold Network for Enhanced Deep Learning
by: Yu, Ray Congrui, et al.
Published: (2024)
by: Yu, Ray Congrui, et al.
Published: (2024)
Large Language Model Guided Progressive Feature Alignment for Multimodal UAV Object Detection
by: Wu, Wentao, et al.
Published: (2025)
by: Wu, Wentao, et al.
Published: (2025)
UniPCB: A Unified Vision-Language Benchmark for Open-Ended PCB Quality Inspection
by: Sun, Fuxiang, et al.
Published: (2026)
by: Sun, Fuxiang, et al.
Published: (2026)
GPF-Net: Gated Progressive Fusion Learning for Polyp Re-Identification
by: Xiang, Suncheng, et al.
Published: (2025)
by: Xiang, Suncheng, et al.
Published: (2025)
Focal Modulation and Bidirectional Feature Fusion Network for Medical Image Segmentation
by: Safdar, Moin, et al.
Published: (2025)
by: Safdar, Moin, et al.
Published: (2025)
Image-Feature Weak-to-Strong Consistency: An Enhanced Paradigm for Semi-Supervised Learning
by: Wu, Zhiyu, et al.
Published: (2024)
by: Wu, Zhiyu, et al.
Published: (2024)
MaeFuse: Transferring Omni Features with Pretrained Masked Autoencoders for Infrared and Visible Image Fusion via Guided Training
by: Li, Jiayang, et al.
Published: (2024)
by: Li, Jiayang, et al.
Published: (2024)
MedIQA: A Scalable Foundation Model for Prompt-Driven Medical Image Quality Assessment
by: Xun, Siyi, et al.
Published: (2025)
by: Xun, Siyi, et al.
Published: (2025)
MDDFNet: Mamba-based Dynamic Dual Fusion Network for Traffic Sign Detection
by: Yu, TianYi
Published: (2025)
by: Yu, TianYi
Published: (2025)
Realism Control One-step Diffusion for Real-World Image Super-Resolution
by: Wu, Zongliang, et al.
Published: (2025)
by: Wu, Zongliang, et al.
Published: (2025)
MoVL:Exploring Fusion Strategies for the Domain-Adaptive Application of Pretrained Models in Medical Imaging Tasks
by: Tian, Haijiang, et al.
Published: (2024)
by: Tian, Haijiang, et al.
Published: (2024)
Towards Unified Semantic and Controllable Image Fusion: A Diffusion Transformer Approach
by: Li, Jiayang, et al.
Published: (2025)
by: Li, Jiayang, et al.
Published: (2025)
Viewport-Unaware Blind Omnidirectional Image Quality Assessment: A Unified and Generalized Approach
by: Yan, Jiebin, et al.
Published: (2026)
by: Yan, Jiebin, et al.
Published: (2026)
Enhancing Image Quality Assessment Ability of LMMs via Retrieval-Augmented Generation
by: Fu, Kang, et al.
Published: (2026)
by: Fu, Kang, et al.
Published: (2026)
V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation
by: Wang, Cong, et al.
Published: (2024)
by: Wang, Cong, et al.
Published: (2024)
VisJudge-Bench: Aesthetics and Quality Assessment of Visualizations
by: Xie, Yupeng, et al.
Published: (2025)
by: Xie, Yupeng, et al.
Published: (2025)
Progressive Image Restoration via Text-Conditioned Video Generation
by: Kang, Peng, et al.
Published: (2025)
by: Kang, Peng, et al.
Published: (2025)
Hyperspectral Imaging-Based Grain Quality Assessment With Limited Labelled Data
by: Karmakar, Priyabrata, et al.
Published: (2024)
by: Karmakar, Priyabrata, et al.
Published: (2024)
NTIRE 2025 challenge on Text to Image Generation Model Quality Assessment
by: Han, Shuhao, et al.
Published: (2025)
by: Han, Shuhao, et al.
Published: (2025)
TIER: Text-Image Encoder-based Regression for AIGC Image Quality Assessment
by: Yuan, Jiquan, et al.
Published: (2024)
by: Yuan, Jiquan, et al.
Published: (2024)
Depth Map Denoising Network and Lightweight Fusion Network for Enhanced 3D Face Recognition
by: Xu, Ruizhuo, et al.
Published: (2024)
by: Xu, Ruizhuo, et al.
Published: (2024)
XiHeFusion: Harnessing Large Language Models for Science Communication in Nuclear Fusion
by: Wang, Xiao, et al.
Published: (2025)
by: Wang, Xiao, et al.
Published: (2025)
Frequency-aware Feature Fusion for Dense Image Prediction
by: Chen, Linwei, et al.
Published: (2024)
by: Chen, Linwei, et al.
Published: (2024)
Feature Fusion Attention Network with CycleGAN for Image Dehazing, De-Snowing and De-Raining
by: Jain, Akshat
Published: (2025)
by: Jain, Akshat
Published: (2025)
DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene
by: Shi, Xi, et al.
Published: (2024)
by: Shi, Xi, et al.
Published: (2024)
Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction
by: Tian, Keyu, et al.
Published: (2024)
by: Tian, Keyu, et al.
Published: (2024)
Multimodal Feature Fusion Network with Text Difference Enhancement for Remote Sensing Change Detection
by: Zhou, Yijun, et al.
Published: (2025)
by: Zhou, Yijun, et al.
Published: (2025)
Epistemic Uncertainty for Generated Image Detection
by: Nie, Jun, et al.
Published: (2024)
by: Nie, Jun, et al.
Published: (2024)
Similar Items
-
TransDAE: Dual Attention Mechanism in a Hierarchical Transformer for Efficient Medical Image Segmentation
by: Azad, Bobby, et al.
Published: (2024) -
Local and Global Feature Attention Fusion Network for Face Recognition
by: Yu, Wang, et al.
Published: (2024) -
Beyond Cosine Similarity: Magnitude-Aware CLIP for No-Reference Image Quality Assessment
by: Liao, Zhicheng, et al.
Published: (2025) -
Free Lunch for Unified Multimodal Models: Enhancing Generation via Reflective Rectification with Inherent Understanding
by: Jiang, Yibo, et al.
Published: (2026) -
No-Reference Point Cloud Quality Assessment via Graph Convolutional Network
by: Chen, Wu, et al.
Published: (2024)