Saved in:
| Main Authors: | Morais, Giovana, McFee, Brian, Fuentes, Magdalena |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.12972 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Investigating Modality Contribution in Audio LLMs for Music
by: Morais, Giovana, et al.
Published: (2025)
by: Morais, Giovana, et al.
Published: (2025)
Evaluating Compositional Structure in Audio Representations
by: Chen, Chuyang, et al.
Published: (2026)
by: Chen, Chuyang, et al.
Published: (2026)
Musical Source Separation of Brazilian Percussion
by: Namballa, Richa, et al.
Published: (2025)
by: Namballa, Richa, et al.
Published: (2025)
Hybrid Losses for Hierarchical Embedding Learning
by: Tian, Haokun, et al.
Published: (2025)
by: Tian, Haokun, et al.
Published: (2025)
Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms
by: Roman, Iran R., et al.
Published: (2024)
by: Roman, Iran R., et al.
Published: (2024)
BeatFM: Improving Beat Tracking with Pre-trained Music Foundation Model
by: Ru, Ganghui, et al.
Published: (2025)
by: Ru, Ganghui, et al.
Published: (2025)
Sound Scene Synthesis at the DCASE 2024 Challenge
by: Lagrange, Mathieu, et al.
Published: (2025)
by: Lagrange, Mathieu, et al.
Published: (2025)
Challenge on Sound Scene Synthesis: Evaluating Text-to-Audio Generation
by: Lee, Junwon, et al.
Published: (2024)
by: Lee, Junwon, et al.
Published: (2024)
Beat Tracking as Object Detection
by: Ahn, Jaehoon, et al.
Published: (2025)
by: Ahn, Jaehoon, et al.
Published: (2025)
Feature Article: The economics of narrow self‐interest – five key trends
by: Innes McFee
Published: (2025)
by: Innes McFee
Published: (2025)
Feature Article: Revisiting the global impact from a slowing China
by: Innes McFee
Published: (2024)
by: Innes McFee
Published: (2024)
Feature Article: How GenAI will change the world economy
by: Innes McFee
Published: (2024)
by: Innes McFee
Published: (2024)
Osu2MIR: Beat Tracking Dataset Derived From Osu! Data
by: Liu, Ziyun, et al.
Published: (2025)
by: Liu, Ziyun, et al.
Published: (2025)
Common names applied to the Long-finned Pilot Whale, Globicepheda melas
by: McFee, W. E.
Published: (1991)
by: McFee, W. E.
Published: (1991)
Volunteer Handbook
by: McFee, W. E.
Published: (2003)
by: McFee, W. E.
Published: (2003)
BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer
by: Chang, Chih-Cheng, et al.
Published: (2023)
by: Chang, Chih-Cheng, et al.
Published: (2023)
Post-Training Quantization for Audio Diffusion Transformers
by: Khandelwal, Tanmay, et al.
Published: (2025)
by: Khandelwal, Tanmay, et al.
Published: (2025)
MaskBeat: Loopable Drum Beat Generation
by: Lanzendörfer, Luca A., et al.
Published: (2025)
by: Lanzendörfer, Luca A., et al.
Published: (2025)
Revisiting Meter Tracking in Carnatic Music using Deep Learning Approaches
by: Prabhu, Satyajeet
Published: (2025)
by: Prabhu, Satyajeet
Published: (2025)
HingeNet: A Harmonic-Aware Fine-Tuning Approach for Beat Tracking
by: Ru, Ganghui, et al.
Published: (2025)
by: Ru, Ganghui, et al.
Published: (2025)
Study on the Fairness of Speaker Verification Systems on Underrepresented Accents in English
by: Estevez, Mariel, et al.
Published: (2022)
by: Estevez, Mariel, et al.
Published: (2022)
SONIQUE: Video Background Music Generation Using Unpaired Audio-Visual Data
by: Zhang, Liqian, et al.
Published: (2024)
by: Zhang, Liqian, et al.
Published: (2024)
The SMC Blind Spot: A Failure Mode Analysis of State-of-the-Art Beat Tracking
by: Ahn, Jaehoon, et al.
Published: (2026)
by: Ahn, Jaehoon, et al.
Published: (2026)
Do Music Source Separation Models Preserve Spatial Information in Binaural Audio?
by: Namballa, Richa, et al.
Published: (2025)
by: Namballa, Richa, et al.
Published: (2025)
Controlling Contrastive Self-Supervised Learning with Knowledge-Driven Multiple Hypothesis: Application to Beat Tracking
by: Gagnere, Antonin, et al.
Published: (2025)
by: Gagnere, Antonin, et al.
Published: (2025)
Domain Adaptation Method and Modality Gap Impact in Audio-Text Models for Prototypical Sound Classification
by: Acevedo, Emiliano, et al.
Published: (2025)
by: Acevedo, Emiliano, et al.
Published: (2025)
Latent Multi-view Learning for Robust Environmental Sound Representations
by: Ding, Sivan, et al.
Published: (2025)
by: Ding, Sivan, et al.
Published: (2025)
Beat-It: Beat-Synchronized Multi-Condition 3D Dance Generation
by: Huang, Zikai, et al.
Published: (2024)
by: Huang, Zikai, et al.
Published: (2024)
Beat and Downbeat Tracking in Performance MIDI Using an End-to-End Transformer Architecture
by: Murgul, Sebastian, et al.
Published: (2025)
by: Murgul, Sebastian, et al.
Published: (2025)
A Critical Assessment of Visual Sound Source Localization Models Including Negative Audio
by: Juanola, Xavier, et al.
Published: (2024)
by: Juanola, Xavier, et al.
Published: (2024)
Efficient Adapter Tuning for Joint Singing Voice Beat and Downbeat Tracking with Self-supervised Learning Features
by: Deng, Jiajun, et al.
Published: (2025)
by: Deng, Jiajun, et al.
Published: (2025)
Live Vocal Extraction from K-pop Performances
by: Kim, Yujin, et al.
Published: (2025)
by: Kim, Yujin, et al.
Published: (2025)
Break-the-Beat! Controllable MIDI-to-Drum Audio Synthesis
by: Cui, Shuyang, et al.
Published: (2026)
by: Cui, Shuyang, et al.
Published: (2026)
Preliminary report on bottlenose dolphin (Tursiops truncatus) uterine samples for parity analysis
by: Meisner, Rene, et al.
Published: (2004)
by: Meisner, Rene, et al.
Published: (2004)
Balancing Information Preservation and Disentanglement in Self-Supervised Music Representation Learning
by: Wilkins, Julia, et al.
Published: (2025)
by: Wilkins, Julia, et al.
Published: (2025)
Self-Supervised Multi-View Learning for Disentangled Music Audio Representations
by: Wilkins, Julia, et al.
Published: (2024)
by: Wilkins, Julia, et al.
Published: (2024)
Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis
by: Chen, Zehua, et al.
Published: (2023)
by: Chen, Zehua, et al.
Published: (2023)
Investigating the Sensitivity of Pre-trained Audio Embeddings to Common Effects
by: Deng, Victor, et al.
Published: (2025)
by: Deng, Victor, et al.
Published: (2025)
Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
by: Wang, Xuanchen, et al.
Published: (2024)
by: Wang, Xuanchen, et al.
Published: (2024)
CTC Blank Triggered Dynamic Layer-Skipping for Efficient CTC-based Speech Recognition
by: Hou, Junfeng, et al.
Published: (2024)
by: Hou, Junfeng, et al.
Published: (2024)
Similar Items
-
Investigating Modality Contribution in Audio LLMs for Music
by: Morais, Giovana, et al.
Published: (2025) -
Evaluating Compositional Structure in Audio Representations
by: Chen, Chuyang, et al.
Published: (2026) -
Musical Source Separation of Brazilian Percussion
by: Namballa, Richa, et al.
Published: (2025) -
Hybrid Losses for Hierarchical Embedding Learning
by: Tian, Haokun, et al.
Published: (2025) -
Spatial Scaper: A Library to Simulate and Augment Soundscapes for Sound Event Localization and Detection in Realistic Rooms
by: Roman, Iran R., et al.
Published: (2024)