Saved in:
| Main Authors: | Shen, Shu, Chen, C. L. Philip, Zhang, Tong |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.14489 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Reliable Multimodal Learning Via Multi-Level Adaptive DeConfusion
by: Zhang, Tong, et al.
Published: (2025)
by: Zhang, Tong, et al.
Published: (2025)
Test-time Adaptive Hierarchical Co-enhanced Denoising Network for Reliable Multimodal Classification
by: Shen, Shu, et al.
Published: (2026)
by: Shen, Shu, et al.
Published: (2026)
QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving
by: Biswas, Sourav, et al.
Published: (2024)
by: Biswas, Sourav, et al.
Published: (2024)
AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning
by: Shen, Shu, et al.
Published: (2025)
by: Shen, Shu, et al.
Published: (2025)
Contextual AD Narration with Interleaved Multimodal Sequence
by: Wang, Hanlin, et al.
Published: (2024)
by: Wang, Hanlin, et al.
Published: (2024)
Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping
by: Chen, Renguang, et al.
Published: (2024)
by: Chen, Renguang, et al.
Published: (2024)
Multi-Level Correlation Network For Few-Shot Image Classification
by: Dang, Yunkai, et al.
Published: (2024)
by: Dang, Yunkai, et al.
Published: (2024)
M3-AGIQA: Multimodal, Multi-Round, Multi-Aspect AI-Generated Image Quality Assessment
by: Cui, Chuan, et al.
Published: (2025)
by: Cui, Chuan, et al.
Published: (2025)
Reliable Representation Learning for Incomplete Multi-View Missing Multi-Label Classification
by: Liu, Chengliang, et al.
Published: (2023)
by: Liu, Chengliang, et al.
Published: (2023)
MAPLE: Multi-Path Adaptive Propagation with Level-Aware Embeddings for Hierarchical Multi-Label Image Classification
by: Koloski, Boshko, et al.
Published: (2026)
by: Koloski, Boshko, et al.
Published: (2026)
Adaptive Multi-step Refinement Network for Robust Point Cloud Registration
by: Chen, Zhi, et al.
Published: (2023)
by: Chen, Zhi, et al.
Published: (2023)
MultiPull: Detailing Signed Distance Functions by Pulling Multi-Level Queries at Multi-Step
by: Noda, Takeshi, et al.
Published: (2024)
by: Noda, Takeshi, et al.
Published: (2024)
Pose-Aware Multi-Level Motion Parsing for Action Quality Assessment
by: Zhu, Shuaikang, et al.
Published: (2025)
by: Zhu, Shuaikang, et al.
Published: (2025)
MMTL-UniAD: A Unified Framework for Multimodal and Multi-Task Learning in Assistive Driving Perception
by: Liu, Wenzhuo, et al.
Published: (2025)
by: Liu, Wenzhuo, et al.
Published: (2025)
Multi-Level Feature Fusion for Continual Learning in Visual Quality Inspection
by: Bauer, Johannes C., et al.
Published: (2026)
by: Bauer, Johannes C., et al.
Published: (2026)
Omni-AD: Learning to Reconstruct Global and Local Features for Multi-class Anomaly Detection
by: Quan, Jiajie, et al.
Published: (2025)
by: Quan, Jiajie, et al.
Published: (2025)
QuARI: Query Adaptive Retrieval Improvement
by: Xing, Eric, et al.
Published: (2025)
by: Xing, Eric, et al.
Published: (2025)
Exploring Multi-Timestep Multi-Stage Diffusion Features for Hyperspectral Image Classification
by: Zhou, Jingyi, et al.
Published: (2023)
by: Zhou, Jingyi, et al.
Published: (2023)
Dynamic Policy-Driven Adaptive Multi-Instance Learning for Whole Slide Image Classification
by: Zheng, Tingting, et al.
Published: (2024)
by: Zheng, Tingting, et al.
Published: (2024)
Phantom-Insight: Adaptive Multi-cue Fusion for Video Camouflaged Object Detection with Multimodal LLM
by: Zhang, Hua, et al.
Published: (2025)
by: Zhang, Hua, et al.
Published: (2025)
SIQA: Toward Reliable Scientific Image Quality Assessment
by: Li, Wenzhe, et al.
Published: (2026)
by: Li, Wenzhe, et al.
Published: (2026)
Graph Attention Transformer Network for Multi-Label Image Classification
by: Yuan, Jin, et al.
Published: (2022)
by: Yuan, Jin, et al.
Published: (2022)
MultiFair: Multimodal Balanced Fairness-Aware Medical Classification with Dual-Level Gradient Modulation
by: Zubair, Md, et al.
Published: (2025)
by: Zubair, Md, et al.
Published: (2025)
MLANet: Multi-Level Attention Network with Sub-instruction for Continuous Vision-and-Language Navigation
by: He, Zongtao, et al.
Published: (2023)
by: He, Zongtao, et al.
Published: (2023)
AD-FM: Multimodal LLMs for Anomaly Detection via Multi-Stage Reasoning and Fine-Grained Reward Optimization
by: Liao, Jingyi, et al.
Published: (2025)
by: Liao, Jingyi, et al.
Published: (2025)
Reliable Reasoning in SVG-LLMs via Multi-Task Multi-Reward Reinforcement Learning
by: Wang, Haomin, et al.
Published: (2026)
by: Wang, Haomin, et al.
Published: (2026)
Multi-Crit: Benchmarking Multimodal Judges on Pluralistic Criteria-Following
by: Xiong, Tianyi, et al.
Published: (2025)
by: Xiong, Tianyi, et al.
Published: (2025)
QuEPT: Quantized Elastic Precision Transformers with One-Shot Calibration for Multi-Bit Switching
by: Xu, Ke, et al.
Published: (2026)
by: Xu, Ke, et al.
Published: (2026)
UltraAD: Fine-Grained Ultrasound Anomaly Classification via Few-Shot CLIP Adaptation
by: Zhou, Yue, et al.
Published: (2025)
by: Zhou, Yue, et al.
Published: (2025)
Multi-scale Unified Network for Image Classification
by: Liu, Wenzhuo, et al.
Published: (2024)
by: Liu, Wenzhuo, et al.
Published: (2024)
MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection
by: He, Haoyang, et al.
Published: (2024)
by: He, Haoyang, et al.
Published: (2024)
Deep But Reliable: Advancing Multi-turn Reasoning for Thinking with Images
by: Yang, Wenhao, et al.
Published: (2025)
by: Yang, Wenhao, et al.
Published: (2025)
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification
by: Liu, Qinying, et al.
Published: (2023)
by: Liu, Qinying, et al.
Published: (2023)
Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network
by: Zhang, Chenhao, et al.
Published: (2025)
by: Zhang, Chenhao, et al.
Published: (2025)
MECFormer: Multi-task Whole Slide Image Classification with Expert Consultation Network
by: Bui, Doanh C., et al.
Published: (2024)
by: Bui, Doanh C., et al.
Published: (2024)
Multi-Level CLS Token Fusion for Contrastive Learning in Endoscopy Image Classification
by: Nguyen, Y Hop, et al.
Published: (2025)
by: Nguyen, Y Hop, et al.
Published: (2025)
Low-Level Matters: An Efficient Hybrid Architecture for Robust Multi-frame Infrared Small Target Detection
by: Shen, Zhihua, et al.
Published: (2025)
by: Shen, Zhihua, et al.
Published: (2025)
Adaptive and Temporally Consistent Gaussian Surfels for Multi-view Dynamic Reconstruction
by: Chen, Decai, et al.
Published: (2024)
by: Chen, Decai, et al.
Published: (2024)
MMD-Thinker: Adaptive Multi-Dimensional Thinking for Multimodal Misinformation Detection
by: Wu, Junjie, et al.
Published: (2025)
by: Wu, Junjie, et al.
Published: (2025)
MM-WLAuslan: Multi-View Multi-Modal Word-Level Australian Sign Language Recognition Dataset
by: Shen, Xin, et al.
Published: (2024)
by: Shen, Xin, et al.
Published: (2024)
Similar Items
-
Reliable Multimodal Learning Via Multi-Level Adaptive DeConfusion
by: Zhang, Tong, et al.
Published: (2025) -
Test-time Adaptive Hierarchical Co-enhanced Denoising Network for Reliable Multimodal Classification
by: Shen, Shu, et al.
Published: (2026) -
QuAD: Query-based Interpretable Neural Motion Planning for Autonomous Driving
by: Biswas, Sourav, et al.
Published: (2024) -
AIM: Adaptive Intra-Network Modulation for Balanced Multimodal Learning
by: Shen, Shu, et al.
Published: (2025) -
Contextual AD Narration with Interleaved Multimodal Sequence
by: Wang, Hanlin, et al.
Published: (2024)