Saved in:
Bibliographic Details
Main Authors: Moskvoretskii, Viktor, Alvandian, Narek
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.15834
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Coreset selection methods are effective in accelerating training and reducing memory requirements but remain largely unexplored in applied multimodal settings. We adapt a state-of-the-art (SoTA) coreset selection technique for multimodal data, focusing on the depth prediction task. Our experiments with embedding aggregation and dimensionality reduction approaches reveal the challenges of extending unimodal algorithms to multimodal scenarios, highlighting the need for specialized methods to better capture inter-modal relationships.