Saved in:
| Main Authors: | Liufu, Xing, Tan, Chaolei, Lin, Xiaotong, Qi, Yonggang, Li, Jinxuan, Hu, Jian-Fang |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.12892 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey
by: Li, Jinxuan, et al.
Published: (2025)
by: Li, Jinxuan, et al.
Published: (2025)
TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding
by: Li, Jinxuan, et al.
Published: (2025)
by: Li, Jinxuan, et al.
Published: (2025)
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
by: Tan, Chaolei, et al.
Published: (2024)
by: Tan, Chaolei, et al.
Published: (2024)
Epistemic Uncertainty for Generated Image Detection
by: Nie, Jun, et al.
Published: (2024)
by: Nie, Jun, et al.
Published: (2024)
Weakly Supervised Camouflaged Object Detection Based on the SAM Model and Mask Guidance
by: Li, Xia, et al.
Published: (2026)
by: Li, Xia, et al.
Published: (2026)
PG-SAM: Prior-Guided SAM with Medical for Multi-organ Segmentation
by: Zhong, Yiheng, et al.
Published: (2025)
by: Zhong, Yiheng, et al.
Published: (2025)
Ranking Distillation for Open-Ended Video Question Answering with Insufficient Labels
by: Liang, Tianming, et al.
Published: (2024)
by: Liang, Tianming, et al.
Published: (2024)
MGHFT: Multi-Granularity Hierarchical Fusion Transformer for Cross-Modal Sticker Emotion Recognition
by: Chen, Jian, et al.
Published: (2025)
by: Chen, Jian, et al.
Published: (2025)
ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
by: Liang, Tianming, et al.
Published: (2025)
by: Liang, Tianming, et al.
Published: (2025)
Temporal Continual Learning with Prior Compensation for Human Motion Prediction
by: Tang, Jianwei, et al.
Published: (2025)
by: Tang, Jianwei, et al.
Published: (2025)
Challenge Summary U-MedSAM: Uncertainty-aware MedSAM for Medical Image Segmentation
by: Wang, Xin, et al.
Published: (2024)
by: Wang, Xin, et al.
Published: (2024)
Privacy-Preserving SAM Quantization for Efficient Edge Intelligence in Healthcare
by: Li, Zhikai, et al.
Published: (2024)
by: Li, Zhikai, et al.
Published: (2024)
SAM-Sode: Towards Faithful Explanations for Tiny Bacteria Detection
by: Tan, Wanying, et al.
Published: (2026)
by: Tan, Wanying, et al.
Published: (2026)
SynopGround: A Large-Scale Dataset for Multi-Paragraph Video Grounding from TV Dramas and Synopses
by: Tan, Chaolei, et al.
Published: (2024)
by: Tan, Chaolei, et al.
Published: (2024)
Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving
by: Lou, Yang, et al.
Published: (2023)
by: Lou, Yang, et al.
Published: (2023)
SAM-COD: SAM-guided Unified Framework for Weakly-Supervised Camouflaged Object Detection
by: Chen, Huafeng, et al.
Published: (2024)
by: Chen, Huafeng, et al.
Published: (2024)
3DTeethSAM: Taming SAM2 for 3D Teeth Segmentation
by: Lu, Zhiguo, et al.
Published: (2025)
by: Lu, Zhiguo, et al.
Published: (2025)
GeoSAM: Fine-tuning SAM with Multi-Modal Prompts for Mobility Infrastructure Segmentation
by: Sultan, Rafi Ibn, et al.
Published: (2023)
by: Sultan, Rafi Ibn, et al.
Published: (2023)
AutoProSAM: Automated Prompting SAM for 3D Multi-Organ Segmentation
by: Li, Chengyin, et al.
Published: (2023)
by: Li, Chengyin, et al.
Published: (2023)
Long-RVOS: A Comprehensive Benchmark for Long-term Referring Video Object Segmentation
by: Liang, Tianming, et al.
Published: (2025)
by: Liang, Tianming, et al.
Published: (2025)
OVS-DINO: Open-Vocabulary Segmentation via Structure-Aligned SAM-DINO with Language Guidance
by: Zeng, Haoxi, et al.
Published: (2026)
by: Zeng, Haoxi, et al.
Published: (2026)
SDCoNet: Saliency-Driven Multi-Task Collaborative Network for Remote Sensing Object Detection
by: Qi, Ruo, et al.
Published: (2026)
by: Qi, Ruo, et al.
Published: (2026)
Generalizable Sensor-Based Activity Recognition via Categorical Concept Invariant Learning
by: Xiong, Di, et al.
Published: (2024)
by: Xiong, Di, et al.
Published: (2024)
Progressive Pretext Task Learning for Human Trajectory Prediction
by: Lin, Xiaotong, et al.
Published: (2024)
by: Lin, Xiaotong, et al.
Published: (2024)
MedSAM-U: Uncertainty-Guided Auto Multi-Prompt Adaptation for Reliable MedSAM
by: Zhou, Nan, et al.
Published: (2024)
by: Zhou, Nan, et al.
Published: (2024)
AMA-SAM: Adversarial Multi-Domain Alignment of Segment Anything Model for High-Fidelity Histology Nuclei Segmentation
by: Qian, Jiahe, et al.
Published: (2025)
by: Qian, Jiahe, et al.
Published: (2025)
SAM 2++: Tracking Anything at Any Granularity
by: Zhang, Jiaming, et al.
Published: (2025)
by: Zhang, Jiaming, et al.
Published: (2025)
Null-LoRA: Low-Rank Adaptation on Null Space
by: Zhang, Yi, et al.
Published: (2025)
by: Zhang, Yi, et al.
Published: (2025)
Taming Diffusion Probabilistic Models for Character Control
by: Chen, Rui, et al.
Published: (2024)
by: Chen, Rui, et al.
Published: (2024)
AVGen-Bench: A Task-Driven Benchmark for Multi-Granular Evaluation of Text-to-Audio-Video Generation
by: Zhou, Ziwei, et al.
Published: (2026)
by: Zhou, Ziwei, et al.
Published: (2026)
Multi-Granularity Hand Action Detection
by: Zhe, Ting, et al.
Published: (2023)
by: Zhe, Ting, et al.
Published: (2023)
From Global to Granular: Revealing IQA Model Performance via Correlation Surface
by: Chen, Baoliang, et al.
Published: (2026)
by: Chen, Baoliang, et al.
Published: (2026)
EdgeSAM: Prompt-In-the-Loop Distillation for SAM
by: Zhou, Chong, et al.
Published: (2023)
by: Zhou, Chong, et al.
Published: (2023)
An Integrated Framework for Multi-Granular Explanation of Video Summarization
by: Tsigos, Konstantinos, et al.
Published: (2024)
by: Tsigos, Konstantinos, et al.
Published: (2024)
EdgeSync: Accelerating Edge-Model Updates for Data Drift through Adaptive Continuous Learning
by: Donga, Runchu, et al.
Published: (2025)
by: Donga, Runchu, et al.
Published: (2025)
MedSAM-Agent: Empowering Interactive Medical Image Segmentation with Multi-turn Agentic Reinforcement Learning
by: Liu, Shengyuan, et al.
Published: (2026)
by: Liu, Shengyuan, et al.
Published: (2026)
RayMamba: Ray-Aligned Serialization for Long-Range 3D Object Detection
by: Lu, Cheng, et al.
Published: (2026)
by: Lu, Cheng, et al.
Published: (2026)
Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models
by: Liu, Xiao, et al.
Published: (2024)
by: Liu, Xiao, et al.
Published: (2024)
Granular Computing-driven SAM: From Coarse-to-Fine Guidance for Prompt-Free Segmentation
by: Yu, Qiyang, et al.
Published: (2025)
by: Yu, Qiyang, et al.
Published: (2025)
OmniOVCD: Streamlining Open-Vocabulary Change Detection with SAM 3
by: Zhang, Xu, et al.
Published: (2026)
by: Zhang, Xu, et al.
Published: (2026)
Similar Items
-
Image-to-Video Transfer Learning based on Image-Language Foundation Models: A Comprehensive Survey
by: Li, Jinxuan, et al.
Published: (2025) -
TubeRMC: Tube-conditioned Reconstruction with Mutual Constraints for Weakly-supervised Spatio-Temporal Video Grounding
by: Li, Jinxuan, et al.
Published: (2025) -
Siamese Learning with Joint Alignment and Regression for Weakly-Supervised Video Paragraph Grounding
by: Tan, Chaolei, et al.
Published: (2024) -
Epistemic Uncertainty for Generated Image Detection
by: Nie, Jun, et al.
Published: (2024) -
Weakly Supervised Camouflaged Object Detection Based on the SAM Model and Mask Guidance
by: Li, Xia, et al.
Published: (2026)