Saved in:
| Main Authors: | Yu, Haocheng, Ahuja, Krishan K., Sankar, Lakshmi N., Bryngelson, Spencer H. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.16355 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Water Flow Detection Device Based on Sound Data Analysis and Machine Learning to Detect Water Leakage
by: Pourmehrani, Hossein, et al.
Published: (2025)
by: Pourmehrani, Hossein, et al.
Published: (2025)
SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation
by: Guo, Yu-Ren, et al.
Published: (2025)
by: Guo, Yu-Ren, et al.
Published: (2025)
Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation
by: Chung, Soo-Whan, et al.
Published: (2025)
by: Chung, Soo-Whan, et al.
Published: (2025)
Content Leakage in LibriSpeech and Its Impact on the Privacy Evaluation of Speaker Anonymization
by: Franzreb, Carlos, et al.
Published: (2026)
by: Franzreb, Carlos, et al.
Published: (2026)
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
by: Lu, Ye-Xin, et al.
Published: (2024)
by: Lu, Ye-Xin, et al.
Published: (2024)
A Two-Step Learning Framework for Enhancing Sound Event Localization and Detection
by: Yu, Hogeon
Published: (2025)
by: Yu, Hogeon
Published: (2025)
Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
by: Wu, Haibin, et al.
Published: (2024)
by: Wu, Haibin, et al.
Published: (2024)
Leveraging Sound Source Trajectories for Universal Sound Separation
by: Wu, Donghang, et al.
Published: (2024)
by: Wu, Donghang, et al.
Published: (2024)
Sound Zone Control Robust To Sound Speed Change
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)
AudSemThinker: Enhancing Audio-Language Models through Reasoning over Semantics of Sound
by: Wijngaard, Gijs, et al.
Published: (2025)
by: Wijngaard, Gijs, et al.
Published: (2025)
Evaluating Sound Similarity Metrics for Differentiable, Iterative Sound-Matching
by: Salimi, Amir, et al.
Published: (2025)
by: Salimi, Amir, et al.
Published: (2025)
Enhance Temporal Relations in Audio Captioning with Sound Event Detection
by: Xie, Zeyu, et al.
Published: (2023)
by: Xie, Zeyu, et al.
Published: (2023)
Hierarchical Pooling Structure for Weakly Labeled Sound Event Detection
by: He, Ke-Xin, et al.
Published: (2019)
by: He, Ke-Xin, et al.
Published: (2019)
APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding
by: Ai, Yang, et al.
Published: (2024)
by: Ai, Yang, et al.
Published: (2024)
HSDreport: Heart Sound Diagnosis with Echocardiography Reports
by: Zhao, Zihan, et al.
Published: (2024)
by: Zhao, Zihan, et al.
Published: (2024)
Semantic MIMO Systems for Speech-to-Text Transmission
by: Weng, Zhenzi, et al.
Published: (2024)
by: Weng, Zhenzi, et al.
Published: (2024)
Improving Anomalous Sound Detection through Pseudo-anomalous Set Selection and Pseudo-label Utilization under Unlabeled Conditions
by: Kuroyanagi, Ibuki, et al.
Published: (2025)
by: Kuroyanagi, Ibuki, et al.
Published: (2025)
STFTCodec: High-Fidelity Audio Compression through Time-Frequency Domain Representation
by: Feng, Tao, et al.
Published: (2025)
by: Feng, Tao, et al.
Published: (2025)
Noise-Robust Sound Event Detection and Counting via Language-Queried Sound Separation
by: Chen, Yuanjian, et al.
Published: (2025)
by: Chen, Yuanjian, et al.
Published: (2025)
DiffSound: Differentiable Modal Sound Rendering and Inverse Rendering for Diverse Inference Tasks
by: Jin, Xutong, et al.
Published: (2024)
by: Jin, Xutong, et al.
Published: (2024)
Fractional Fourier Sound Synthesis
by: Gutiérrez, Esteban, et al.
Published: (2025)
by: Gutiérrez, Esteban, et al.
Published: (2025)
Diffuse Sound Field Synthesis
by: Zotter, Franz, et al.
Published: (2024)
by: Zotter, Franz, et al.
Published: (2024)
Sound Event Bounding Boxes
by: Ebbers, Janek, et al.
Published: (2024)
by: Ebbers, Janek, et al.
Published: (2024)
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
by: Shen, Yu-Han, et al.
Published: (2018)
by: Shen, Yu-Han, et al.
Published: (2018)
SoundBeam meets M2D: Target Sound Extraction with Audio Foundation Model
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)
A Generalist Audio Foundation Model for Comprehensive Body Sound Auscultation
by: Wang, Pingjie, et al.
Published: (2024)
by: Wang, Pingjie, et al.
Published: (2024)
SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation
by: Niu, Xinlei, et al.
Published: (2024)
by: Niu, Xinlei, et al.
Published: (2024)
A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
by: Xu, Xuenan, et al.
Published: (2024)
by: Xu, Xuenan, et al.
Published: (2024)
DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation
by: Li, Baihan, et al.
Published: (2024)
by: Li, Baihan, et al.
Published: (2024)
A MATLAB toolbox for Computation of Speech Transmission Index (STI)
by: Rajmic, Pavel, et al.
Published: (2025)
by: Rajmic, Pavel, et al.
Published: (2025)
Fast Algorithm for Moving Sound Source
by: Yang, Dong
Published: (2025)
by: Yang, Dong
Published: (2025)
Boundary-Informed Sound Field Reconstruction
by: Sundström, David, et al.
Published: (2025)
by: Sundström, David, et al.
Published: (2025)
Sound Field Synthesis with Acoustic Waves
by: Mansour, Mohamed F.
Published: (2024)
by: Mansour, Mohamed F.
Published: (2024)
Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Training of Sound Events With Partial Labels
by: Imoto, Keisuke
Published: (2025)
by: Imoto, Keisuke
Published: (2025)
Two-sided Acoustic Metascreen for Broadband and Individual Reflection and Transmission Control
by: Chen, Ao, et al.
Published: (2024)
by: Chen, Ao, et al.
Published: (2024)
AudioSpa: Spatializing Sound Events with Text
by: Feng, Linfeng, et al.
Published: (2025)
by: Feng, Linfeng, et al.
Published: (2025)
Adaptive Differential Denoising for Respiratory Sounds Classification
by: Dong, Gaoyang, et al.
Published: (2025)
by: Dong, Gaoyang, et al.
Published: (2025)
Region-Specific Audio Tagging for Spatial Sound
by: Zhao, Jinzheng, et al.
Published: (2025)
by: Zhao, Jinzheng, et al.
Published: (2025)
Frequency Dynamic Convolutions for Sound Event Detection
by: Nam, Hyeonuk
Published: (2025)
by: Nam, Hyeonuk
Published: (2025)
Domain-Invariant Representation Learning of Bird Sounds
by: Moummad, Ilyass, et al.
Published: (2024)
by: Moummad, Ilyass, et al.
Published: (2024)
Similar Items
-
Water Flow Detection Device Based on Sound Data Analysis and Machine Learning to Detect Water Leakage
by: Pourmehrani, Hossein, et al.
Published: (2025) -
SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation
by: Guo, Yu-Ren, et al.
Published: (2025) -
Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation
by: Chung, Soo-Whan, et al.
Published: (2025) -
Content Leakage in LibriSpeech and Its Impact on the Privacy Evaluation of Speaker Anonymization
by: Franzreb, Carlos, et al.
Published: (2026) -
Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
by: Lu, Ye-Xin, et al.
Published: (2024)