:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shi, Jinghao, Shen, Xiang, Zhao, Kaili, Wang, Xuedong, Wen, Vera, Wang, Zixuan, Wu, Yifan, Zhang, Zhixin
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2410.03038
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Embedding-based Retrieval in Multimodal Content Moderation
by: Liang, Hanzhong, et al.
Published: (2025)

Confidence-aware Contrastive Learning for Selective Classification
by: Wu, Yu-Chang, et al.
Published: (2024)

JDCNet: Confidence-Gated Privileged-Modality Distillation for Cost-Preserving X-ray Inference
by: Ma, Bo, et al.
Published: (2026)

Confidence-aware multi-modality learning for eye disease screening
by: Zou, Ke, et al.
Published: (2024)

VideoDistill: Language-aware Vision Distillation for Video Question Answering
by: Zou, Bo, et al.
Published: (2024)

Consistency-aware Fake Videos Detection on Short Video Platforms
by: Wang, Junxi, et al.
Published: (2025)

Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation
by: Wang, Zixuan, et al.
Published: (2025)

Reasoning-Enhanced Domain-Adaptive Pretraining of Multimodal Large Language Models for Short Video Content Governance
by: Wang, Zixuan, et al.
Published: (2025)

Deterministic Object Pose Confidence Region Estimation
by: Wang, Jinghao, et al.
Published: (2025)

MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
by: Zhang, Yuang, et al.
Published: (2024)

When Rules Fall Short: Agent-Driven Discovery of Emerging Content Issues in Short Video Platforms
by: Yu, Chenghui, et al.
Published: (2026)

Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition
by: Aslam, Muhammad Haseeb, et al.
Published: (2024)

High-Order Progressive Trajectory Matching for Medical Image Dataset Distillation
by: Dong, Le, et al.
Published: (2025)

Efficient Multi-Slide Visual-Language Feature Fusion for Placental Disease Classification
by: Guo, Hang, et al.
Published: (2025)

OSA: Echocardiography Video Segmentation via Orthogonalized State Update and Anatomical Prior-aware Feature Enhancement
by: Wang, Rui, et al.
Published: (2026)

VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos
by: Wu, Qiucheng, et al.
Published: (2025)

Distill Video Datasets into Images
by: Zhao, Zhenghao, et al.
Published: (2025)

AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
by: Zhang, Jinghao, et al.
Published: (2024)

BiCoR-Seg: Bidirectional Co-Refinement Framework for High-Resolution Remote Sensing Image Segmentation
by: Shi, Jinghao, et al.
Published: (2025)

An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM
by: Wen, Wen, et al.
Published: (2024)

Beyond the Last Frame: Process-aware Evaluation for Generative Video Reasoning
by: Li, Yifan, et al.
Published: (2025)

Towards Metric-Aware Multi-Person Mesh Recovery by Jointly Optimizing Human Crowd in Camera Space
by: Wang, Kaiwen, et al.
Published: (2025)

Distilling Privileged Multimodal Information for Expression Recognition using Optimal Transport
by: Aslam, Muhammad Haseeb, et al.
Published: (2024)

Knowledge Distillation via the Target-aware Transformer
by: Lin, Sihao, et al.
Published: (2022)

WaDi: Weight Direction-aware Distillation for One-step Image Synthesis
by: Wang, Lei, et al.
Published: (2026)

Video Set Distillation: Information Diversification and Temporal Densification
by: Zhao, Yinjie, et al.
Published: (2024)

A Survey on Backbones for Deep Video Action Recognition
by: Tang, Zixuan, et al.
Published: (2024)

Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding
by: Zou, Ke, et al.
Published: (2024)

InsectMamba: Insect Pest Classification with State Space Model
by: Wang, Qianning, et al.
Published: (2024)

OmniMem: Scalable and Adaptive Memory Retrieval for Long Video Generation
by: Zhao, Lin, et al.
Published: (2026)

GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting
by: Peng, Yuning, et al.
Published: (2024)

SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching
by: Liu, Xiangzeng, et al.
Published: (2025)

Aerial View River Landform Video segmentation: A Weakly Supervised Context-aware Temporal Consistency Distillation Approach
by: Chen, Chi-Han, et al.
Published: (2025)

Knowledge Guided Entity-aware Video Captioning and A Basketball Benchmark
by: Xi, Zeyu, et al.
Published: (2024)

Towards Adversarially Robust Dataset Distillation by Curvature Regularization
by: Xue, Eric, et al.
Published: (2024)

Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis
by: Chen, Zijian, et al.
Published: (2024)

Dynamic-Aware Video Distillation: Optimizing Temporal Resolution Based on Video Semantics
by: Zhao, Yinjie, et al.
Published: (2025)

BoxComm: Benchmarking Category-Aware Commentary Generation and Narration Rhythm in Boxing
by: Wang, Kaiwen, et al.
Published: (2026)

SimCast: Enhancing Precipitation Nowcasting with Short-to-Long Term Knowledge Distillation
by: Yin, Yifang, et al.
Published: (2025)

Interpretable Medical Image Classification using Prototype Learning and Privileged Information
by: Gallee, Luisa, et al.
Published: (2023)