:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Morais, Giovana, McFee, Brian, Fuentes, Magdalena
Format:	Preprint
Published:	2025
Subjects:	Sound
Online Access:	https://arxiv.org/abs/2502.12972
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Investigating Modality Contribution in Audio LLMs for Music
by: Morais, Giovana, et al.
Published: (2025)

Evaluating Compositional Structure in Audio Representations
by: Chen, Chuyang, et al.
Published: (2026)

Musical Source Separation of Brazilian Percussion
by: Namballa, Richa, et al.
Published: (2025)

Hybrid Losses for Hierarchical Embedding Learning
by: Tian, Haokun, et al.
Published: (2025)

Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms
by: Roman, Iran R., et al.
Published: (2024)

BeatFM: Improving Beat Tracking with Pre-trained Music Foundation Model
by: Ru, Ganghui, et al.
Published: (2025)

Sound Scene Synthesis at the DCASE 2024 Challenge
by: Lagrange, Mathieu, et al.
Published: (2025)

Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
by: Lee, Junwon, et al.
Published: (2024)

Beat Tracking as Object Detection
by: Ahn, Jaehoon, et al.
Published: (2025)

Feature Article: The economics of narrow self‐interest – five key trends
by: Innes McFee
Published: (2025)

Feature Article: Revisiting the global impact from a slowing China
by: Innes McFee
Published: (2024)

Feature Article: How GenAI will change the world economy
by: Innes McFee
Published: (2024)

Osu2MIR: Beat Tracking Dataset Derived From Osu! Data
by: Liu, Ziyun, et al.
Published: (2025)

Common names applied to the Long-finned Pilot Whale, Globicepheda melas
by: McFee, W. E.
Published: (1991)

Volunteer Handbook
by: McFee, W. E.
Published: (2003)

BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer
by: Chang, Chih-Cheng, et al.
Published: (2023)

Post-Training Quantization for Audio Diffusion Transformers
by: Khandelwal, Tanmay, et al.
Published: (2025)

MaskBeat: Loopable Drum Beat Generation
by: Lanzendörfer, Luca A., et al.
Published: (2025)

Revisiting Meter Tracking in Carnatic Music using Deep Learning Approaches
by: Prabhu, Satyajeet
Published: (2025)

HingeNet: A Harmonic-Aware Fine-Tuning Approach for Beat Tracking
by: Ru, Ganghui, et al.
Published: (2025)

Study on the Fairness of Speaker Verification Systems on Underrepresented Accents in English
by: Estevez, Mariel, et al.
Published: (2022)

SONIQUE: Video Background Music Generation Using Unpaired Audio-Visual Data
by: Zhang, Liqian, et al.
Published: (2024)

The SMC Blind Spot: A Failure Mode Analysis of State-of-the-Art Beat Tracking
by: Ahn, Jaehoon, et al.
Published: (2026)

Do Music Source Separation Models Preserve Spatial Information in Binaural Audio?
by: Namballa, Richa, et al.
Published: (2025)

Controlling Contrastive Self-Supervised Learning with Knowledge-Driven Multiple Hypothesis: Application to Beat Tracking
by: Gagnere, Antonin, et al.
Published: (2025)

Domain Adaptation Method and Modality Gap Impact in Audio-Text Models for Prototypical Sound Classification
by: Acevedo, Emiliano, et al.
Published: (2025)

Latent Multi-view Learning for Robust Environmental Sound Representations
by: Ding, Sivan, et al.
Published: (2025)

Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation
by: Huang, Zikai, et al.
Published: (2024)

Beat and Downbeat Tracking in Performance MIDI Using an End-to-End Transformer Architecture
by: Murgul, Sebastian, et al.
Published: (2025)

A Critical Assessment of Visual Sound Source Localization Models Including Negative Audio
by: Juanola, Xavier, et al.
Published: (2024)

Efficient Adapter Tuning for Joint Singing Voice Beat and Downbeat Tracking with Self-supervised Learning Features
by: Deng, Jiajun, et al.
Published: (2025)

Live Vocal Extraction from K-pop Performances
by: Kim, Yujin, et al.
Published: (2025)

Break-the-Beat! Controllable MIDI-to-Drum Audio Synthesis
by: Cui, Shuyang, et al.
Published: (2026)

Preliminary report on bottlenose dolphin (Tursiops truncatus) uterine samples for parity analysis
by: Meisner, Rene, et al.
Published: (2004)

Balancing Information Preservation and Disentanglement in Self-Supervised Music Representation Learning
by: Wilkins, Julia, et al.
Published: (2025)

Self-Supervised Multi-View Learning for Disentangled Music Audio Representations
by: Wilkins, Julia, et al.
Published: (2024)

Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis
by: Chen, Zehua, et al.
Published: (2023)

Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects
by: Deng, Victor, et al.
Published: (2025)

Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
by: Wang, Xuanchen, et al.
Published: (2024)

CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition
by: Hou, Junfeng, et al.
Published: (2024)