:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jiang, Yangbo, Jiang, Zhiwei, Han, Le, Huang, Zenan, Zheng, Nenggan
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2403.01713
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

PIDNet: Progressive Implicit Decouple Network for Multimodal Action Quality Assessment
by: Li, Qiqi, et al.
Published: (2026)

DeepBranchTracer: A Generally-Applicable Approach to Curvilinear Structure Reconstruction Using Multi-Feature Learning
by: Liu, Chao, et al.
Published: (2024)

MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models
by: Zhao, Qiyan, et al.
Published: (2025)

Efficient Pyramid Channel Attention Network for Pathological Myopia Recognition
by: Zhang, Xiaoqing, et al.
Published: (2023)

Revisiting the Ordering of Channel and Spatial Attention: A Comprehensive Study on Sequential and Parallel Designs
by: Liu, Zhongming, et al.
Published: (2026)

Spiking Meets Attention: Efficient Remote Sensing Image Super-Resolution with Attention Spiking Neural Networks
by: Xiao, Yi, et al.
Published: (2025)

MedAtlas: Evaluating LLMs for Multi-Round, Multi-Task Medical Reasoning Across Diverse Imaging Modalities and Clinical Text
by: Xu, Ronghao, et al.
Published: (2025)

MCA-RG: Enhancing LLMs with Medical Concept Alignment for Radiology Report Generation
by: Xing, Qilong, et al.
Published: (2025)

SC-HVPPNet: Spatial and Channel Hybrid-Attention Video Post-Processing Network with CNN and Transformer
by: Zhang, Tong, et al.
Published: (2024)

Recursive Deformable Image Registration Network with Mutual Attention
by: Zheng, Jian-Qing, et al.
Published: (2022)

MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment
by: Zou, Gui, et al.
Published: (2025)

Graph Network for Sign Language Tasks
by: Gan, Shiwei, et al.
Published: (2025)

MoCha-Stereo: Motif Channel Attention Network for Stereo Matching
by: Chen, Ziyang, et al.
Published: (2024)

WaveNets: Wavelet Channel Attention Networks
by: Salman, Hadi, et al.
Published: (2022)

Exploring Graph-based Knowledge: Multi-Level Feature Distillation via Channels Relational Graph
by: Wang, Zhiwei, et al.
Published: (2024)

Text-Video Multi-Grained Integration for Video Moment Montage
by: Yin, Zhihui, et al.
Published: (2024)

MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks
by: Wu, Zonglin, et al.
Published: (2025)

H3DE-Net: Efficient and Accurate 3D Landmark Detection in Medical Imaging
by: Huang, Zhen, et al.
Published: (2025)

Lightweight Channel Attention for Efficient CNNs
by: Kanaparthi, Prem Babu, et al.
Published: (2026)

High-Fidelity Mural Restoration via a Unified Hybrid Mask-Aware Transformer
by: Jiang, Jincheng, et al.
Published: (2026)

Physically-Guided Optical Inversion Enable Non-Contact Side-Channel Attack on Isolated Screens
by: Zheng, Zhiwen, et al.
Published: (2026)

DSwinIR: Rethinking Window-based Attention for Image Restoration
by: Wu, Gang, et al.
Published: (2025)

Covariance-corrected Whitening Alleviates Network Degeneration on Imbalanced Classification
by: Zhang, Zhiwei
Published: (2024)

Veda: Scalable Video Diffusion via Distilled Sparse Attention
by: Han, Shihao, et al.
Published: (2026)

Chanel-Orderer: A Channel-Ordering Predictor for Tri-Channel Natural Images
by: Li, Shen, et al.
Published: (2024)

Moment Quantization for Video Temporal Grounding
by: Sun, Xiaolong, et al.
Published: (2025)

Johnson-Lindenstrauss Lemma Guided Network for Efficient 3D Medical Segmentation
by: Lu, Jinpeng, et al.
Published: (2025)

Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
by: Lin, Zhiwei, et al.
Published: (2024)

Learning to Infer Unseen Single-/Multi-Attribute-Object Compositions with Graph Networks
by: Chen, Hui, et al.
Published: (2020)

RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
by: Lin, Zhiwei, et al.
Published: (2024)

Agent Attention: On the Integration of Softmax and Linear Attention
by: Han, Dongchen, et al.
Published: (2023)

ReGLA: Efficient Receptive-Field Modeling with Gated Linear Attention Network
by: Li, Junzhou, et al.
Published: (2026)

Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image Segmentation
by: Zhang, Fan, et al.
Published: (2025)

Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion
by: Yu, Jun, et al.
Published: (2025)

Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
by: Jiang, Yiyang, et al.
Published: (2024)

Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework
by: Jiang, Junkun, et al.
Published: (2024)

Demystify Mamba in Vision: A Linear Attention Perspective
by: Han, Dongchen, et al.
Published: (2024)

Multi-proposal Collaboration and Multi-task Training for Weakly-supervised Video Moment Retrieval
by: Zhang, Bolin, et al.
Published: (2026)

Inversion-Free Video Style Transfer with Trajectory Reset Attention Control and Content-Style Bridging
by: Lin, Jiang, et al.
Published: (2025)

Self-Parameterization Based Multi-Resolution Mesh Convolution Networks
by: Hezi, Shi, et al.
Published: (2024)