:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Radevski, Gorjan
Format:	Preprint
Published:	2025
Subjects:	Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2512.20501
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Eff-GRot: Efficient and Generalizable Rotation Estimation with Transformers
by: Mathioulakis, Fanis, et al.
Published: (2025)

DAVE: Diagnostic benchmark for Audio Visual Evaluation
by: Radevski, Gorjan, et al.
Published: (2025)

Classifying Novel 3D-Printed Objects without Retraining: Towards Post-Production Automation in Additive Manufacturing
by: Mathioulakis, Fanis, et al.
Published: (2026)

Multimodal Knowledge Distillation for Egocentric Action Recognition Robust to Missing Modalities
by: Santos-Villafranca, Maria, et al.
Published: (2025)

Learning Modality Knowledge Alignment for Cross-Modality Transfer
by: Ma, Wenxuan, et al.
Published: (2024)

SkeFi: Cross-Modal Knowledge Transfer for Wireless Skeleton-Based Action Recognition
by: Huang, Shunyu, et al.
Published: (2026)

Enhancing Zero-Shot Facial Expression Recognition by LLM Knowledge Transfer
by: Zhao, Zengqun, et al.
Published: (2024)

Reading, Not Thinking: Understanding and Bridging the Modality Gap When Text Becomes Pixels in Multimodal LLMs
by: Sun, Kaiser, et al.
Published: (2026)

Cross-Modality Gait Recognition: Bridging LiDAR and Camera Modalities for Human Identification
by: Wang, Rui, et al.
Published: (2024)

Asymmetric Cross-Modal Knowledge Distillation: Bridging Modalities with Weak Semantic Consistency
by: Wei, Riling, et al.
Published: (2025)

Percept, Chat, and then Adapt: Multimodal Knowledge Transfer of Foundation Models for Open-World Video Recognition
by: Chen, Boyu, et al.
Published: (2024)

Bridging the Skeleton-Text Modality Gap: Diffusion-Powered Modality Alignment for Zero-shot Skeleton-based Action Recognition
by: Do, Jeonghyeok, et al.
Published: (2024)

Multimodal Emotion Recognition via Causal-Diffusion Bridge (Affect-Diff)
by: Sanjyal, Ankit
Published: (2026)

LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition
by: Li, Jinyuan, et al.
Published: (2024)

Towards Robust and Realible Multimodal Misinformation Recognition with Incomplete Modality
by: Zhou, Hengyang, et al.
Published: (2025)

Enhancing Modal Fusion by Alignment and Label Matching for Multimodal Emotion Recognition
by: Li, Qifei, et al.
Published: (2024)

Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional Manuals
by: Wu, Te-Lin, et al.
Published: (2021)

UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation
by: Li, Teng, et al.
Published: (2025)

ECMF: Enhanced Cross-Modal Fusion for Multimodal Emotion Recognition in MER-SEMI Challenge
by: Hu, Juewen, et al.
Published: (2025)

Robust Brain Tumor Segmentation with Incomplete MRI Modalities Using Hölder Divergence and Mutual Information-Enhanced Knowledge Transfer
by: Cheng, Runze, et al.
Published: (2025)

Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities
by: Li, Mingcheng, et al.
Published: (2024)

Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition
by: Aslam, Muhammad Haseeb, et al.
Published: (2024)

Multimodal Prompt Learning with Missing Modalities for Sentiment Analysis and Emotion Recognition
by: Guo, Zirun, et al.
Published: (2024)

Bridging the Intent Gap: Knowledge-Enhanced Visual Generation
by: Cheng, Yi, et al.
Published: (2024)

HBridge: H-Shape Bridging of Heterogeneous Experts for Unified Multimodal Understanding and Generation
by: Wang, Xiang, et al.
Published: (2025)

Semantic-Enhanced Cross-Modal Place Recognition for Robust Robot Localization
by: Lin, Yujia, et al.
Published: (2025)

Category-Adaptive Cross-Modal Semantic Refinement and Transfer for Open-Vocabulary Multi-Label Recognition
by: Liu, Haijing, et al.
Published: (2024)

Bridging the Gap in Missing Modalities: Leveraging Knowledge Distillation and Style Matching for Brain Tumor Segmentation
by: Zhu, Shenghao, et al.
Published: (2025)

VKnowU: Evaluating Visual Knowledge Understanding in Multimodal LLMs
by: Jiang, Tianxiang, et al.
Published: (2025)

Multimodal Emotion Recognition with Vision-language Prompting and Modality Dropout
by: QI, Anbin, et al.
Published: (2024)

Beyond CLIP: Knowledge-Enhanced Multimodal Transformers for Cross-Modal Alignment in Diabetic Retinopathy Diagnosis
by: Samanta, Argha Kamal, et al.
Published: (2025)

Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding
by: Zhang, Le, et al.
Published: (2023)

IMUGPT 2.0: Language-Based Cross Modality Transfer for Sensor-Based Human Activity Recognition
by: Leng, Zikang, et al.
Published: (2024)

NocPlace: Nocturnal Visual Place Recognition via Generative and Inherited Knowledge Transfer
by: Liu, Bingxi, et al.
Published: (2024)

Enhancing Meme Emotion Understanding with Multi-Level Modality Enhancement and Dual-Stage Modal Fusion
by: Shi, Yi, et al.
Published: (2025)

CAE-DFKD: Bridging the Transferability Gap in Data-Free Knowledge Distillation
by: Zhang, Zherui, et al.
Published: (2025)

Knowledge-Enhanced Facial Expression Recognition with Emotional-to-Neutral Transformation
by: Li, Hangyu, et al.
Published: (2024)

Bridging Modalities via Progressive Re-alignment for Multimodal Test-Time Adaptation
by: Li, Jiacheng, et al.
Published: (2025)

Towards Unified Multimodal Editing with Enhanced Knowledge Collaboration
by: Pan, Kaihang, et al.
Published: (2024)

Enhancing Knowledge Transfer in Hyperspectral Image Classification via Cross-scene Knowledge Integration
by: Huo, Lu, et al.
Published: (2025)