Saved in:
| Main Authors: | Jiang, Yangbo, Jiang, Zhiwei, Han, Le, Huang, Zenan, Zheng, Nenggan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.01713 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
PIDNet: Progressive Implicit Decouple Network for Multimodal Action Quality Assessment
by: Li, Qiqi, et al.
Published: (2026)
by: Li, Qiqi, et al.
Published: (2026)
DeepBranchTracer: A Generally-Applicable Approach to Curvilinear Structure Reconstruction Using Multi-Feature Learning
by: Liu, Chao, et al.
Published: (2024)
by: Liu, Chao, et al.
Published: (2024)
MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models
by: Zhao, Qiyan, et al.
Published: (2025)
by: Zhao, Qiyan, et al.
Published: (2025)
Efficient Pyramid Channel Attention Network for Pathological Myopia Recognition
by: Zhang, Xiaoqing, et al.
Published: (2023)
by: Zhang, Xiaoqing, et al.
Published: (2023)
Revisiting the Ordering of Channel and Spatial Attention: A Comprehensive Study on Sequential and Parallel Designs
by: Liu, Zhongming, et al.
Published: (2026)
by: Liu, Zhongming, et al.
Published: (2026)
Spiking Meets Attention: Efficient Remote Sensing Image Super-Resolution with Attention Spiking Neural Networks
by: Xiao, Yi, et al.
Published: (2025)
by: Xiao, Yi, et al.
Published: (2025)
MedAtlas: Evaluating LLMs for Multi-Round, Multi-Task Medical Reasoning Across Diverse Imaging Modalities and Clinical Text
by: Xu, Ronghao, et al.
Published: (2025)
by: Xu, Ronghao, et al.
Published: (2025)
MCA-RG: Enhancing LLMs with Medical Concept Alignment for Radiology Report Generation
by: Xing, Qilong, et al.
Published: (2025)
by: Xing, Qilong, et al.
Published: (2025)
SC-HVPPNet: Spatial and Channel Hybrid-Attention Video Post-Processing Network with CNN and Transformer
by: Zhang, Tong, et al.
Published: (2024)
by: Zhang, Tong, et al.
Published: (2024)
Recursive Deformable Image Registration Network with Mutual Attention
by: Zheng, Jian-Qing, et al.
Published: (2022)
by: Zheng, Jian-Qing, et al.
Published: (2022)
MCA: 2D-3D Retrieval with Noisy Labels via Multi-level Adaptive Correction and Alignment
by: Zou, Gui, et al.
Published: (2025)
by: Zou, Gui, et al.
Published: (2025)
Graph Network for Sign Language Tasks
by: Gan, Shiwei, et al.
Published: (2025)
by: Gan, Shiwei, et al.
Published: (2025)
MoCha-Stereo: Motif Channel Attention Network for Stereo Matching
by: Chen, Ziyang, et al.
Published: (2024)
by: Chen, Ziyang, et al.
Published: (2024)
WaveNets: Wavelet Channel Attention Networks
by: Salman, Hadi, et al.
Published: (2022)
by: Salman, Hadi, et al.
Published: (2022)
Exploring Graph-based Knowledge: Multi-Level Feature Distillation via Channels Relational Graph
by: Wang, Zhiwei, et al.
Published: (2024)
by: Wang, Zhiwei, et al.
Published: (2024)
Text-Video Multi-Grained Integration for Video Moment Montage
by: Yin, Zhihui, et al.
Published: (2024)
by: Yin, Zhihui, et al.
Published: (2024)
MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks
by: Wu, Zonglin, et al.
Published: (2025)
by: Wu, Zonglin, et al.
Published: (2025)
H3DE-Net: Efficient and Accurate 3D Landmark Detection in Medical Imaging
by: Huang, Zhen, et al.
Published: (2025)
by: Huang, Zhen, et al.
Published: (2025)
Lightweight Channel Attention for Efficient CNNs
by: Kanaparthi, Prem Babu, et al.
Published: (2026)
by: Kanaparthi, Prem Babu, et al.
Published: (2026)
High-Fidelity Mural Restoration via a Unified Hybrid Mask-Aware Transformer
by: Jiang, Jincheng, et al.
Published: (2026)
by: Jiang, Jincheng, et al.
Published: (2026)
Physically-Guided Optical Inversion Enable Non-Contact Side-Channel Attack on Isolated Screens
by: Zheng, Zhiwen, et al.
Published: (2026)
by: Zheng, Zhiwen, et al.
Published: (2026)
DSwinIR: Rethinking Window-based Attention for Image Restoration
by: Wu, Gang, et al.
Published: (2025)
by: Wu, Gang, et al.
Published: (2025)
Covariance-corrected Whitening Alleviates Network Degeneration on Imbalanced Classification
by: Zhang, Zhiwei
Published: (2024)
by: Zhang, Zhiwei
Published: (2024)
Veda: Scalable Video Diffusion via Distilled Sparse Attention
by: Han, Shihao, et al.
Published: (2026)
by: Han, Shihao, et al.
Published: (2026)
Chanel-Orderer: A Channel-Ordering Predictor for Tri-Channel Natural Images
by: Li, Shen, et al.
Published: (2024)
by: Li, Shen, et al.
Published: (2024)
Moment Quantization for Video Temporal Grounding
by: Sun, Xiaolong, et al.
Published: (2025)
by: Sun, Xiaolong, et al.
Published: (2025)
Johnson-Lindenstrauss Lemma Guided Network for Efficient 3D Medical Segmentation
by: Lu, Jinpeng, et al.
Published: (2025)
by: Lu, Jinpeng, et al.
Published: (2025)
Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts
by: Lin, Zhiwei, et al.
Published: (2024)
by: Lin, Zhiwei, et al.
Published: (2024)
Learning to Infer Unseen Single-/Multi-Attribute-Object Compositions with Graph Networks
by: Chen, Hui, et al.
Published: (2020)
by: Chen, Hui, et al.
Published: (2020)
RCBEVDet++: Toward High-accuracy Radar-Camera Fusion 3D Perception Network
by: Lin, Zhiwei, et al.
Published: (2024)
by: Lin, Zhiwei, et al.
Published: (2024)
Agent Attention: On the Integration of Softmax and Linear Attention
by: Han, Dongchen, et al.
Published: (2023)
by: Han, Dongchen, et al.
Published: (2023)
ReGLA: Efficient Receptive-Field Modeling with Gated Linear Attention Network
by: Li, Junzhou, et al.
Published: (2026)
by: Li, Junzhou, et al.
Published: (2026)
Decoding with Structured Awareness: Integrating Directional, Frequency-Spatial, and Structural Attention for Medical Image Segmentation
by: Zhang, Fan, et al.
Published: (2025)
by: Zhang, Fan, et al.
Published: (2025)
Design of an Expression Recognition Solution Based on the Global Channel-Spatial Attention Mechanism and Proportional Criterion Fusion
by: Yu, Jun, et al.
Published: (2025)
by: Yu, Jun, et al.
Published: (2025)
Prior Knowledge Integration via LLM Encoding and Pseudo Event Regulation for Video Moment Retrieval
by: Jiang, Yiyang, et al.
Published: (2024)
by: Jiang, Yiyang, et al.
Published: (2024)
Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework
by: Jiang, Junkun, et al.
Published: (2024)
by: Jiang, Junkun, et al.
Published: (2024)
Demystify Mamba in Vision: A Linear Attention Perspective
by: Han, Dongchen, et al.
Published: (2024)
by: Han, Dongchen, et al.
Published: (2024)
Multi-proposal Collaboration and Multi-task Training for Weakly-supervised Video Moment Retrieval
by: Zhang, Bolin, et al.
Published: (2026)
by: Zhang, Bolin, et al.
Published: (2026)
Inversion-Free Video Style Transfer with Trajectory Reset Attention Control and Content-Style Bridging
by: Lin, Jiang, et al.
Published: (2025)
by: Lin, Jiang, et al.
Published: (2025)
Self-Parameterization Based Multi-Resolution Mesh Convolution Networks
by: Hezi, Shi, et al.
Published: (2024)
by: Hezi, Shi, et al.
Published: (2024)
Similar Items
-
PIDNet: Progressive Implicit Decouple Network for Multimodal Action Quality Assessment
by: Li, Qiqi, et al.
Published: (2026) -
DeepBranchTracer: A Generally-Applicable Approach to Curvilinear Structure Reconstruction Using Multi-Feature Learning
by: Liu, Chao, et al.
Published: (2024) -
MCA-LLaVA: Manhattan Causal Attention for Reducing Hallucination in Large Vision-Language Models
by: Zhao, Qiyan, et al.
Published: (2025) -
Efficient Pyramid Channel Attention Network for Pathological Myopia Recognition
by: Zhang, Xiaoqing, et al.
Published: (2023) -
Revisiting the Ordering of Channel and Spatial Attention: A Comprehensive Study on Sequential and Parallel Designs
by: Liu, Zhongming, et al.
Published: (2026)