Saved in:
| Main Authors: | Simionato, Riccardo, Fasciani, Stefano |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2405.04124 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sines, Transient, Noise Neural Modeling of Piano Notes
by: Simionato, Riccardo, et al.
Published: (2024)
by: Simionato, Riccardo, et al.
Published: (2024)
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
by: Simionato, Riccardo, et al.
Published: (2024)
by: Simionato, Riccardo, et al.
Published: (2024)
Robust Neural Audio Fingerprinting using Music Foundation Models
by: Singh, Shubhr, et al.
Published: (2025)
by: Singh, Shubhr, et al.
Published: (2025)
Exploring How Audio Effects Alter Emotion with Foundation Models
by: Katsis, Stelios, et al.
Published: (2025)
by: Katsis, Stelios, et al.
Published: (2025)
Audio Mamba: Pretrained Audio State Space Model For Audio Tagging
by: Lin, Jiaju, et al.
Published: (2024)
by: Lin, Jiaju, et al.
Published: (2024)
Are Audio-Language Models Listening? Audio-Specialist Heads for Adaptive Audio Steering
by: Glazer, Neta, et al.
Published: (2026)
by: Glazer, Neta, et al.
Published: (2026)
Latent-Mark: An Audio Watermark Robust to Neural Resynthesis
by: Chen, Yen-Shan, et al.
Published: (2026)
by: Chen, Yen-Shan, et al.
Published: (2026)
Eureka-Audio: Triggering Audio Intelligence in Compact Language Models
by: Zhang, Dan, et al.
Published: (2026)
by: Zhang, Dan, et al.
Published: (2026)
Audio Mamba: Bidirectional State Space Model for Audio Representation Learning
by: Erol, Mehmet Hamza, et al.
Published: (2024)
by: Erol, Mehmet Hamza, et al.
Published: (2024)
State Space Models for Bioacoustics: A Comparative Evaluation with Transformers
by: Tang, Chengyu, et al.
Published: (2025)
by: Tang, Chengyu, et al.
Published: (2025)
The Equalizer: Introducing Shape-Gain Decomposition in Neural Audio Codecs
by: Sadok, Samir, et al.
Published: (2026)
by: Sadok, Samir, et al.
Published: (2026)
Self Voice Conversion as an Attack against Neural Audio Watermarking
by: Özer, Yigitcan, et al.
Published: (2026)
by: Özer, Yigitcan, et al.
Published: (2026)
The Sonar Moment: Benchmarking Audio-Language Models in Audio Geo-Localization
by: Zhang, Ruixing, et al.
Published: (2026)
by: Zhang, Ruixing, et al.
Published: (2026)
Audio-Maestro: Enhancing Large Audio-Language Models with Tool-Augmented Reasoning
by: Lee, Kuan-Yi, et al.
Published: (2025)
by: Lee, Kuan-Yi, et al.
Published: (2025)
AudioSAE: Towards Understanding of Audio-Processing Models with Sparse AutoEncoders
by: Aparin, Georgii, et al.
Published: (2026)
by: Aparin, Georgii, et al.
Published: (2026)
Studying the Effect of Audio Filters in Pre-Trained Models for Environmental Sound Classification
by: Dawn, Aditya, et al.
Published: (2024)
by: Dawn, Aditya, et al.
Published: (2024)
AudioGuard: Toward Comprehensive Audio Safety Protection Across Diverse Threat Models
by: Kang, Mintong, et al.
Published: (2026)
by: Kang, Mintong, et al.
Published: (2026)
HalluAudio: A Comprehensive Benchmark for Hallucination Detection in Large Audio-Language Models
by: Zhao, Feiyu, et al.
Published: (2026)
by: Zhao, Feiyu, et al.
Published: (2026)
Drum Synthesis from Expressive Drum Grids via Neural Audio Codecs
by: Soiledis, Konstantinos, et al.
Published: (2026)
by: Soiledis, Konstantinos, et al.
Published: (2026)
LearnAFE: Circuit-Algorithm Co-design Framework for Learnable Audio Analog Front-End
by: Hu, Jinhai, et al.
Published: (2025)
by: Hu, Jinhai, et al.
Published: (2025)
Evaluating Neural Networks Architectures for Spring Reverb Modelling
by: Papaleo, Francesco, et al.
Published: (2024)
by: Papaleo, Francesco, et al.
Published: (2024)
Raw Audio Classification with Cosine Convolutional Neural Network (CosCovNN)
by: Haque, Kazi Nazmul, et al.
Published: (2024)
by: Haque, Kazi Nazmul, et al.
Published: (2024)
LHGNN: Local-Higher Order Graph Neural Networks For Audio Classification and Tagging
by: Singh, Shubhr, et al.
Published: (2025)
by: Singh, Shubhr, et al.
Published: (2025)
Evaluation of Audio Language Models for Fairness, Safety, and Security
by: Aloufi, Ranya, et al.
Published: (2026)
by: Aloufi, Ranya, et al.
Published: (2026)
Towards Fine-grained Temporal Perception: Post-Training Large Audio-Language Models with Audio-Side Time Prompt
by: Shi, Yanfeng, et al.
Published: (2026)
by: Shi, Yanfeng, et al.
Published: (2026)
AudioMoG: Guiding Audio Generation with Mixture-of-Guidance
by: Wang, Junyou, et al.
Published: (2025)
by: Wang, Junyou, et al.
Published: (2025)
No Free Lunch from Audio Pretraining in Bioacoustics: A Benchmark Study of Embeddings
by: Chen, Chenggang, et al.
Published: (2025)
by: Chen, Chenggang, et al.
Published: (2025)
Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations
by: Yadav, Sarthak, et al.
Published: (2024)
by: Yadav, Sarthak, et al.
Published: (2024)
EnCLAP: Combining Neural Audio Codec and Audio-Text Joint Embedding for Automated Audio Captioning
by: Kim, Jaeyeon, et al.
Published: (2024)
by: Kim, Jaeyeon, et al.
Published: (2024)
AND: Audio Network Dissection for Interpreting Deep Acoustic Models
by: Wu, Tung-Yu, et al.
Published: (2024)
by: Wu, Tung-Yu, et al.
Published: (2024)
MSMT-FN: Multi-segment Multi-task Fusion Network for Marketing Audio Classification
by: Liu, HongYu, et al.
Published: (2025)
by: Liu, HongYu, et al.
Published: (2025)
Compressing Quaternion Convolutional Neural Networks for Audio Classification
by: Singh, Arshdeep, et al.
Published: (2025)
by: Singh, Arshdeep, et al.
Published: (2025)
MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
by: Li, Xiquan, et al.
Published: (2025)
by: Li, Xiquan, et al.
Published: (2025)
AudioMotionBench: Evaluating Auditory Motion Perception in Audio LLMs
by: Sun, Zhe, et al.
Published: (2025)
by: Sun, Zhe, et al.
Published: (2025)
PitchBench: Measuring Pitch Hearing in Audio-Language Models
by: Dujardin, Milan Liessens, et al.
Published: (2026)
by: Dujardin, Milan Liessens, et al.
Published: (2026)
Rebellion: Noise-Robust Reasoning Training for Audio Reasoning Models
by: Huang, Tiansheng, et al.
Published: (2025)
by: Huang, Tiansheng, et al.
Published: (2025)
Stable Audio 3
by: Evans, Zach, et al.
Published: (2026)
by: Evans, Zach, et al.
Published: (2026)
Does Current Deepfake Audio Detection Model Effectively Detect ALM-based Deepfake Audio?
by: Xie, Yuankun, et al.
Published: (2024)
by: Xie, Yuankun, et al.
Published: (2024)
Audio Source Separation in Reverberant Environments using $β$-divergence based Nonnegative Factorization
by: Fakhry, Mahmoud, et al.
Published: (2026)
by: Fakhry, Mahmoud, et al.
Published: (2026)
The World is Not Mono: Enabling Spatial Understanding in Large Audio-Language Models
by: You, Yuhuan, et al.
Published: (2026)
by: You, Yuhuan, et al.
Published: (2026)
Similar Items
-
Sines, Transient, Noise Neural Modeling of Piano Notes
by: Simionato, Riccardo, et al.
Published: (2024) -
Modeling Time-Variant Responses of Optical Compressors with Selective State Space Models
by: Simionato, Riccardo, et al.
Published: (2024) -
Robust Neural Audio Fingerprinting using Music Foundation Models
by: Singh, Shubhr, et al.
Published: (2025) -
Exploring How Audio Effects Alter Emotion with Foundation Models
by: Katsis, Stelios, et al.
Published: (2025) -
Audio Mamba: Pretrained Audio State Space Model For Audio Tagging
by: Lin, Jiaju, et al.
Published: (2024)