:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shen, Shu, Chen, C. L. Philip, Zhang, Tong
Format:	Preprint
Published:	2024
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2412.14489
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Reliable Multimodal Learning Via Multi-Level Adaptive DeConfusion
by: Zhang, Tong, et al.
Published: (2025)

Test-time Adaptive Hierarchical Co-enhanced Denoising Network for Reliable Multimodal Classification
by: Shen, Shu, et al.
Published: (2026)

QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving
by: Biswas, Sourav, et al.
Published: (2024)

AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning
by: Shen, Shu, et al.
Published: (2025)

Contextual AD Narration with Interleaved Multimodal Sequence
by: Wang, Hanlin, et al.
Published: (2024)

Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping
by: Chen, Renguang, et al.
Published: (2024)

Multi-Level Correlation Network For Few-Shot Image Classification
by: Dang, Yunkai, et al.
Published: (2024)

M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment
by: Cui, Chuan, et al.
Published: (2025)

Reliable Representation Learning for Incomplete Multi-View Missing Multi-Label Classification
by: Liu, Chengliang, et al.
Published: (2023)

MAPLE: Multi-Path Adaptive Propagation with Level-Aware Embeddings for Hierarchical Multi-Label Image Classification
by: Koloski, Boshko, et al.
Published: (2026)

Adaptive Multi-step Refinement Network for Robust Point Cloud Registration
by: Chen, Zhi, et al.
Published: (2023)

MultiPull: Detailing Signed Distance Functions by Pulling Multi-Level Queries at Multi-Step
by: Noda, Takeshi, et al.
Published: (2024)

Pose-Aware Multi-Level Motion Parsing for Action Quality Assessment
by: Zhu, Shuaikang, et al.
Published: (2025)

MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception
by: Liu, Wenzhuo, et al.
Published: (2025)

Multi-Level Feature Fusion for Continual Learning in Visual Quality Inspection
by: Bauer, Johannes C., et al.
Published: (2026)

Omni-AD: Learning to Reconstruct Global and Local Features for Multi-class Anomaly Detection
by: Quan, Jiajie, et al.
Published: (2025)

QuARI: Query Adaptive Retrieval Improvement
by: Xing, Eric, et al.
Published: (2025)

Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification
by: Zhou, Jingyi, et al.
Published: (2023)

Dynamic Policy-Driven Adaptive Multi-Instance Learning for Whole Slide Image Classification
by: Zheng, Tingting, et al.
Published: (2024)

Phantom-Insight: Adaptive Multi-cue Fusion for Video Camouflaged Object Detection with Multimodal LLM
by: Zhang, Hua, et al.
Published: (2025)

SIQA: Toward Reliable Scientific Image Quality Assessment
by: Li, Wenzhe, et al.
Published: (2026)

Graph Attention Transformer Network for Multi-Label Image Classification
by: Yuan, Jin, et al.
Published: (2022)

MultiFair: Multimodal Balanced Fairness-Aware Medical Classification with Dual-Level Gradient Modulation
by: Zubair, Md, et al.
Published: (2025)

MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
by: He, Zongtao, et al.
Published: (2023)

AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization
by: Liao, Jingyi, et al.
Published: (2025)

Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning
by: Wang, Haomin, et al.
Published: (2026)

Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
by: Xiong, Tianyi, et al.
Published: (2025)

QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching
by: Xu, Ke, et al.
Published: (2026)

UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation
by: Zhou, Yue, et al.
Published: (2025)

Multi-scale Unified Network for Image Classification
by: Liu, Wenzhuo, et al.
Published: (2024)

MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
by: He, Haoyang, et al.
Published: (2024)

Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
by: Yang, Wenhao, et al.
Published: (2025)

TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification
by: Liu, Qinying, et al.
Published: (2023)

Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network
by: Zhang, Chenhao, et al.
Published: (2025)

MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network
by: Bui, Doanh C., et al.
Published: (2024)

Multi-Level CLS Token Fusion for Contrastive Learning in Endoscopy Image Classification
by: Nguyen, Y Hop, et al.
Published: (2025)

Low-Level Matters: An Efficient Hybrid Architecture for Robust Multi-frame Infrared Small Target Detection
by: Shen, Zhihua, et al.
Published: (2025)

Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction
by: Chen, Decai, et al.
Published: (2024)

MMD-Thinker: Adaptive Multi-Dimensional Thinking for Multimodal Misinformation Detection
by: Wu, Junjie, et al.
Published: (2025)

MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
by: Shen, Xin, et al.
Published: (2024)