Skip to content
VuFind
  • Login
    • English
    • Deutsch
    • Español
    • Français
    • Italiano
Advanced
  • Cite this
  • Text this
  • Email this
  • Print
  • Export Record
    • Export to RefWorks
    • Export to EndNoteWeb
    • Export to EndNote
  • Save to List
  • Permanent link
Cover Image

Saved in:
Bibliographic Details
Main Authors: Yang, Chao-Han Huck, Ghosh, Sreyan, Wang, Qing, Kim, Jaeyeon, Hong, Hengyi, Kumar, Sonal, Zhong, Guirui, Kong, Zhifeng, Sakshi, S, Lokegaonkar, Vaibhavi, Nieto, Oriol, Duraiswami, Ramani, Manocha, Dinesh, Kim, Gunhee, Du, Jun, Valle, Rafael, Catanzaro, Bryan
Format: Preprint
Published: 2025
Subjects:
Sound
Artificial Intelligence
Computation and Language
Multimedia
Audio and Speech Processing
Online Access:https://arxiv.org/abs/2505.07365
Tags: Add Tag
No Tags, Be the first to tag this record!
  • Holdings
  • Description
  • Table of Contents
  • Comments
  • Similar Items
  • Staff View

Internet

https://arxiv.org/abs/2505.07365

Similar Items

  • SPUR: A Plug-and-Play Framework for Integrating Spatial Audio Understanding and Reasoning into Large Audio-Language Models
    by: Sakshi, S, et al.
    Published: (2025)
  • Audio Flamingo 3: Advancing Audio Intelligence with Fully Open Large Audio Language Models
    by: Goel, Arushi, et al.
    Published: (2025)
  • Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities
    by: Ghosh, Sreyan, et al.
    Published: (2025)
  • MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
    by: Sakshi, S, et al.
    Published: (2024)
  • ReCLAP: Improving Zero Shot Audio Classification by Describing Sounds
    by: Ghosh, Sreyan, et al.
    Published: (2024)

Search Options

  • Search History
  • Advanced Search

Find More

  • Browse the Catalog
  • Browse Alphabetically
  • Explore Channels
  • Course Reserves
  • New Items

Need Help?

  • Search Tips
  • Ask a Librarian
  • FAQs