Saved in:
| Main Authors: | Shi, Jinghao, Shen, Xiang, Zhao, Kaili, Wang, Xuedong, Wen, Vera, Wang, Zixuan, Wu, Yifan, Zhang, Zhixin |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.03038 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Embedding-based Retrieval in Multimodal Content Moderation
by: Liang, Hanzhong, et al.
Published: (2025)
by: Liang, Hanzhong, et al.
Published: (2025)
Confidence-aware Contrastive Learning for Selective Classification
by: Wu, Yu-Chang, et al.
Published: (2024)
by: Wu, Yu-Chang, et al.
Published: (2024)
JDCNet: Confidence-Gated Privileged-Modality Distillation for Cost-Preserving X-ray Inference
by: Ma, Bo, et al.
Published: (2026)
by: Ma, Bo, et al.
Published: (2026)
Confidence-aware multi-modality learning for eye disease screening
by: Zou, Ke, et al.
Published: (2024)
by: Zou, Ke, et al.
Published: (2024)
VideoDistill: Language-aware Vision Distillation for Video Question Answering
by: Zou, Bo, et al.
Published: (2024)
by: Zou, Bo, et al.
Published: (2024)
Consistency-aware Fake Videos Detection on Short Video Platforms
by: Wang, Junxi, et al.
Published: (2025)
by: Wang, Junxi, et al.
Published: (2025)
Filter-And-Refine: A MLLM Based Cascade System for Industrial-Scale Video Content Moderation
by: Wang, Zixuan, et al.
Published: (2025)
by: Wang, Zixuan, et al.
Published: (2025)
Reasoning-Enhanced Domain-Adaptive Pretraining of Multimodal Large Language Models for Short Video Content Governance
by: Wang, Zixuan, et al.
Published: (2025)
by: Wang, Zixuan, et al.
Published: (2025)
Deterministic Object Pose Confidence Region Estimation
by: Wang, Jinghao, et al.
Published: (2025)
by: Wang, Jinghao, et al.
Published: (2025)
MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
by: Zhang, Yuang, et al.
Published: (2024)
by: Zhang, Yuang, et al.
Published: (2024)
When Rules Fall Short: Agent-Driven Discovery of Emerging Content Issues in Short Video Platforms
by: Yu, Chenghui, et al.
Published: (2026)
by: Yu, Chenghui, et al.
Published: (2026)
Multi Teacher Privileged Knowledge Distillation for Multimodal Expression Recognition
by: Aslam, Muhammad Haseeb, et al.
Published: (2024)
by: Aslam, Muhammad Haseeb, et al.
Published: (2024)
High-Order Progressive Trajectory Matching for Medical Image Dataset Distillation
by: Dong, Le, et al.
Published: (2025)
by: Dong, Le, et al.
Published: (2025)
Efficient Multi-Slide Visual-Language Feature Fusion for Placental Disease Classification
by: Guo, Hang, et al.
Published: (2025)
by: Guo, Hang, et al.
Published: (2025)
OSA: Echocardiography Video Segmentation via Orthogonalized State Update and Anatomical Prior-aware Feature Enhancement
by: Wang, Rui, et al.
Published: (2026)
by: Wang, Rui, et al.
Published: (2026)
VividCam: Learning Unconventional Camera Motions from Virtual Synthetic Videos
by: Wu, Qiucheng, et al.
Published: (2025)
by: Wu, Qiucheng, et al.
Published: (2025)
Distill Video Datasets into Images
by: Zhao, Zhenghao, et al.
Published: (2025)
by: Zhao, Zhenghao, et al.
Published: (2025)
AnyLogo: Symbiotic Subject-Driven Diffusion System with Gemini Status
by: Zhang, Jinghao, et al.
Published: (2024)
by: Zhang, Jinghao, et al.
Published: (2024)
BiCoR-Seg: Bidirectional Co-Refinement Framework for High-Resolution Remote Sensing Image Segmentation
by: Shi, Jinghao, et al.
Published: (2025)
by: Shi, Jinghao, et al.
Published: (2025)
An Ensemble Approach to Short-form Video Quality Assessment Using Multimodal LLM
by: Wen, Wen, et al.
Published: (2024)
by: Wen, Wen, et al.
Published: (2024)
Beyond the Last Frame: Process-aware Evaluation for Generative Video Reasoning
by: Li, Yifan, et al.
Published: (2025)
by: Li, Yifan, et al.
Published: (2025)
Towards Metric-Aware Multi-Person Mesh Recovery by Jointly Optimizing Human Crowd in Camera Space
by: Wang, Kaiwen, et al.
Published: (2025)
by: Wang, Kaiwen, et al.
Published: (2025)
Distilling Privileged Multimodal Information for Expression Recognition using Optimal Transport
by: Aslam, Muhammad Haseeb, et al.
Published: (2024)
by: Aslam, Muhammad Haseeb, et al.
Published: (2024)
Knowledge Distillation via the Target-aware Transformer
by: Lin, Sihao, et al.
Published: (2022)
by: Lin, Sihao, et al.
Published: (2022)
WaDi: Weight Direction-aware Distillation for One-step Image Synthesis
by: Wang, Lei, et al.
Published: (2026)
by: Wang, Lei, et al.
Published: (2026)
Video Set Distillation: Information Diversification and Temporal Densification
by: Zhao, Yinjie, et al.
Published: (2024)
by: Zhao, Yinjie, et al.
Published: (2024)
A Survey on Backbones for Deep Video Action Recognition
by: Tang, Zixuan, et al.
Published: (2024)
by: Tang, Zixuan, et al.
Published: (2024)
Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding
by: Zou, Ke, et al.
Published: (2024)
by: Zou, Ke, et al.
Published: (2024)
InsectMamba: Insect Pest Classification with State Space Model
by: Wang, Qianning, et al.
Published: (2024)
by: Wang, Qianning, et al.
Published: (2024)
OmniMem: Scalable and Adaptive Memory Retrieval for Long Video Generation
by: Zhao, Lin, et al.
Published: (2026)
by: Zhao, Lin, et al.
Published: (2026)
GAGS: Granularity-Aware Feature Distillation for Language Gaussian Splatting
by: Peng, Yuning, et al.
Published: (2024)
by: Peng, Yuning, et al.
Published: (2024)
SGAD: Semantic and Geometric-aware Descriptor for Local Feature Matching
by: Liu, Xiangzeng, et al.
Published: (2025)
by: Liu, Xiangzeng, et al.
Published: (2025)
Aerial View River Landform Video segmentation: A Weakly Supervised Context-aware Temporal Consistency Distillation Approach
by: Chen, Chi-Han, et al.
Published: (2025)
by: Chen, Chi-Han, et al.
Published: (2025)
Knowledge Guided Entity-aware Video Captioning and A Basketball Benchmark
by: Xi, Zeyu, et al.
Published: (2024)
by: Xi, Zeyu, et al.
Published: (2024)
Towards Adversarially Robust Dataset Distillation by Curvature Regularization
by: Xue, Eric, et al.
Published: (2024)
by: Xue, Eric, et al.
Published: (2024)
Confidence-Aware RGB-D Face Recognition via Virtual Depth Synthesis
by: Chen, Zijian, et al.
Published: (2024)
by: Chen, Zijian, et al.
Published: (2024)
Dynamic-Aware Video Distillation: Optimizing Temporal Resolution Based on Video Semantics
by: Zhao, Yinjie, et al.
Published: (2025)
by: Zhao, Yinjie, et al.
Published: (2025)
BoxComm: Benchmarking Category-Aware Commentary Generation and Narration Rhythm in Boxing
by: Wang, Kaiwen, et al.
Published: (2026)
by: Wang, Kaiwen, et al.
Published: (2026)
SimCast: Enhancing Precipitation Nowcasting with Short-to-Long Term Knowledge Distillation
by: Yin, Yifang, et al.
Published: (2025)
by: Yin, Yifang, et al.
Published: (2025)
Interpretable Medical Image Classification using Prototype Learning and Privileged Information
by: Gallee, Luisa, et al.
Published: (2023)
by: Gallee, Luisa, et al.
Published: (2023)
Similar Items
-
Embedding-based Retrieval in Multimodal Content Moderation
by: Liang, Hanzhong, et al.
Published: (2025) -
Confidence-aware Contrastive Learning for Selective Classification
by: Wu, Yu-Chang, et al.
Published: (2024) -
JDCNet: Confidence-Gated Privileged-Modality Distillation for Cost-Preserving X-ray Inference
by: Ma, Bo, et al.
Published: (2026) -
Confidence-aware multi-modality learning for eye disease screening
by: Zou, Ke, et al.
Published: (2024) -
VideoDistill: Language-aware Vision Distillation for Video Question Answering
by: Zou, Bo, et al.
Published: (2024)