Saved in:
| Main Authors: | Barreto, Jesimon, Caetano, Carlos, Araujo, André, Schwartz, William Robson |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.20994 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
by: Qian, Rui, et al.
Published: (2024)
by: Qian, Rui, et al.
Published: (2024)
Advancing Video Self-Supervised Learning via Image Foundation Models
by: Wu, Jingwei, et al.
Published: (2025)
by: Wu, Jingwei, et al.
Published: (2025)
Object-centric Video Question Answering with Visual Grounding and Referring
by: Wang, Haochen, et al.
Published: (2025)
by: Wang, Haochen, et al.
Published: (2025)
Test-Time Adaptation for Height Completion via Self-Supervised ViT Features and Monocular Foundation Models
by: Rafaeli, Osher, et al.
Published: (2026)
by: Rafaeli, Osher, et al.
Published: (2026)
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos
by: Xu, Jilan, et al.
Published: (2025)
by: Xu, Jilan, et al.
Published: (2025)
Scalable Adaptation of 3D Geometric Foundation Models via Weak Supervision from Internet Video
by: Gao, Zihui, et al.
Published: (2026)
by: Gao, Zihui, et al.
Published: (2026)
Weakly Supervised Concept Learning for Object-centric Visual Reasoning
by: Tiwari, Sparsh, et al.
Published: (2026)
by: Tiwari, Sparsh, et al.
Published: (2026)
EgoVideo: Exploring Egocentric Foundation Model and Downstream Adaptation
by: Pei, Baoqi, et al.
Published: (2024)
by: Pei, Baoqi, et al.
Published: (2024)
Semantics Meets Temporal Correspondence: Self-supervised Object-centric Learning in Videos
by: Qian, Rui, et al.
Published: (2023)
by: Qian, Rui, et al.
Published: (2023)
InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision
by: Wang, Chenting, et al.
Published: (2025)
by: Wang, Chenting, et al.
Published: (2025)
Self-Consistent Model-based Adaptation for Visual Reinforcement Learning
by: Zhou, Xinning, et al.
Published: (2025)
by: Zhou, Xinning, et al.
Published: (2025)
Dynamic in Static: Hybrid Visual Correspondence for Self-Supervised Video Object Segmentation
by: Pei, Gensheng, et al.
Published: (2024)
by: Pei, Gensheng, et al.
Published: (2024)
AdaCropFollow: Self-Supervised Online Adaptation for Visual Under-Canopy Navigation
by: Sivakumar, Arun N., et al.
Published: (2024)
by: Sivakumar, Arun N., et al.
Published: (2024)
Enhancing License Plate Super-Resolution: A Layout-Aware and Character-Driven Approach
by: Nascimento, Valfride, et al.
Published: (2024)
by: Nascimento, Valfride, et al.
Published: (2024)
Improving the Generalization of Segmentation Foundation Model under Distribution Shift via Weakly Supervised Adaptation
by: Zhang, Haojie, et al.
Published: (2023)
by: Zhang, Haojie, et al.
Published: (2023)
Efficient Self-Supervised Adaptation for Medical Image Analysis
by: Sorkhei, Moein, et al.
Published: (2025)
by: Sorkhei, Moein, et al.
Published: (2025)
Towards More General Video-based Deepfake Detection through Facial Component Guided Adaptation for Foundation Model
by: Han, Yue-Hua, et al.
Published: (2024)
by: Han, Yue-Hua, et al.
Published: (2024)
VideoSAVi: Self-Aligned Video Language Models without Human Supervision
by: Kulkarni, Yogesh, et al.
Published: (2024)
by: Kulkarni, Yogesh, et al.
Published: (2024)
Decorrelation-based Self-Supervised Visual Representation Learning for Writer Identification
by: Maitra, Arkadip, et al.
Published: (2024)
by: Maitra, Arkadip, et al.
Published: (2024)
VideoSSR: Video Self-Supervised Reinforcement Learning
by: He, Zefeng, et al.
Published: (2025)
by: He, Zefeng, et al.
Published: (2025)
Closer to Reality: Practical Semi-Supervised Federated Learning for Foundation Model Adaptation
by: Sun, Guangyu, et al.
Published: (2025)
by: Sun, Guangyu, et al.
Published: (2025)
How Effective are Self-Supervised Models for Contact Identification in Videos
by: Gunawardhana, Malitha, et al.
Published: (2024)
by: Gunawardhana, Malitha, et al.
Published: (2024)
Robust Adaptation of Foundation Models with Black-Box Visual Prompting
by: Oh, Changdae, et al.
Published: (2024)
by: Oh, Changdae, et al.
Published: (2024)
SelfHVD: Self-Supervised Handheld Video Deblurring
by: Xu, Honglei, et al.
Published: (2025)
by: Xu, Honglei, et al.
Published: (2025)
Supervised Fine-tuning in turn Improves Visual Foundation Models
by: Jiang, Xiaohu, et al.
Published: (2024)
by: Jiang, Xiaohu, et al.
Published: (2024)
OmniGlue: Generalizable Feature Matching with Foundation Model Guidance
by: Jiang, Hanwen, et al.
Published: (2024)
by: Jiang, Hanwen, et al.
Published: (2024)
Deep Weakly-Supervised Domain Adaptation for Pain Localization in Videos
by: Praveen, R. Gnana, et al.
Published: (2019)
by: Praveen, R. Gnana, et al.
Published: (2019)
When Test-Time Adaptation Meets Self-Supervised Models
by: Han, Jisu, et al.
Published: (2025)
by: Han, Jisu, et al.
Published: (2025)
Self-Supervised Contrastive Embedding Adaptation for Endoscopic Image Matching
by: Rota, Alberto, et al.
Published: (2025)
by: Rota, Alberto, et al.
Published: (2025)
EVDI++: Event-based Video Deblurring and Interpolation via Self-Supervised Learning
by: Zhang, Chi, et al.
Published: (2025)
by: Zhang, Chi, et al.
Published: (2025)
Self-Supervised Video Desmoking for Laparoscopic Surgery
by: Wu, Renlong, et al.
Published: (2024)
by: Wu, Renlong, et al.
Published: (2024)
Self-Supervised Animal Identification for Long Videos
by: Fang, Xuyang, et al.
Published: (2026)
by: Fang, Xuyang, et al.
Published: (2026)
Enhancing Generalization of Depth Estimation Foundation Model via Weakly-Supervised Adaptation with Regularization
by: Huang, Yan, et al.
Published: (2025)
by: Huang, Yan, et al.
Published: (2025)
Reshoot-Anything: A Self-Supervised Model for In-the-Wild Video Reshooting
by: Paliwal, Avinash, et al.
Published: (2026)
by: Paliwal, Avinash, et al.
Published: (2026)
Zero-Shot Image Anomaly Detection Using Generative Foundation Models
by: Abdi, Lemar, et al.
Published: (2025)
by: Abdi, Lemar, et al.
Published: (2025)
Generalized Semi-Supervised Learning via Self-Supervised Feature Adaptation
by: Liang, Jiachen, et al.
Published: (2024)
by: Liang, Jiachen, et al.
Published: (2024)
InsEdit: Towards Instruction-based Visual Editing via Data-Efficient Video Diffusion Models Adaptation
by: Rao, Zhefan, et al.
Published: (2026)
by: Rao, Zhefan, et al.
Published: (2026)
Large-scale Self-supervised Video Foundation Model for Intelligent Surgery
by: Yang, Shu, et al.
Published: (2025)
by: Yang, Shu, et al.
Published: (2025)
SelfPrompt: Confidence-Aware Semi-Supervised Tuning for Robust Vision-Language Model Adaptation
by: Roy, Shuvendu, et al.
Published: (2025)
by: Roy, Shuvendu, et al.
Published: (2025)
ORV: 4D Occupancy-centric Robot Video Generation
by: Yang, Xiuyu, et al.
Published: (2025)
by: Yang, Xiuyu, et al.
Published: (2025)
Similar Items
-
Rethinking Image-to-Video Adaptation: An Object-centric Perspective
by: Qian, Rui, et al.
Published: (2024) -
Advancing Video Self-Supervised Learning via Image Foundation Models
by: Wu, Jingwei, et al.
Published: (2025) -
Object-centric Video Question Answering with Visual Grounding and Referring
by: Wang, Haochen, et al.
Published: (2025) -
Test-Time Adaptation for Height Completion via Self-Supervised ViT Features and Monocular Foundation Models
by: Rafaeli, Osher, et al.
Published: (2026) -
EgoExo-Gen: Ego-centric Video Prediction by Watching Exo-centric Videos
by: Xu, Jilan, et al.
Published: (2025)