Skip to content
VuFind
  • Login
    • English
    • Deutsch
    • Español
    • Français
    • Italiano
Advanced
  • Cite this
  • Text this
  • Email this
  • Print
  • Export Record
    • Export to RefWorks
    • Export to EndNoteWeb
    • Export to EndNote
  • Save to List
  • Permanent link
Cover Image

Saved in:
Bibliographic Details
Main Authors: Xie, Liuyue, Kuthiala, Avik, Wei, George Z., Zheng, Ce, Bal, Ananya, Dabhi, Mosam, Wen, Liting, Rustagi, Taru, Lai, Ethan, Khyalia, Sushil, Choudhury, Rohan, Ziyadi, Morteza, Zhang, Xu, Yang, Hao, Jeni, László A.
Format: Preprint
Published: 2025
Subjects:
Multimedia
Artificial Intelligence
Computer Vision and Pattern Recognition
Sound
Audio and Speech Processing
Online Access:https://arxiv.org/abs/2503.21699
Tags: Add Tag
No Tags, Be the first to tag this record!
  • Holdings
  • Description
  • Table of Contents
  • Comments
  • Similar Items
  • Staff View

Internet

https://arxiv.org/abs/2503.21699

Similar Items

  • Unified Spherical Frontend: Learning Rotation-Equivariant Representations of Spherical Images from Any Camera
    by: Yu, Mukai, et al.
    Published: (2025)
  • 3D-LFM: Lifting Foundation Model
    by: Dabhi, Mosam, et al.
    Published: (2023)
  • MusiCRS: Benchmarking Audio-Centric Conversational Recommendation
    by: Surana, Rohan, et al.
    Published: (2025)
  • Teacher-Guided Pseudo Supervision and Cross-Modal Alignment for Audio-Visual Video Parsing
    by: Chen, Yaru, et al.
    Published: (2025)
  • MaskAnyone Toolkit: Offering Strategies for Minimizing Privacy Risks and Maximizing Utility in Audio-Visual Data Archiving
    by: Owoyele, Babajide Alamu, et al.
    Published: (2024)

Search Options

  • Search History
  • Advanced Search

Find More

  • Browse the Catalog
  • Browse Alphabetically
  • Explore Channels
  • Course Reserves
  • New Items

Need Help?

  • Search Tips
  • Ask a Librarian
  • FAQs