:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yu, Haocheng, Ahuja, Krishan K., Sankar, Lakshmi N., Bryngelson, Spencer H.
Format:	Preprint
Published:	2025
Subjects:	Sound Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2510.16355
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Water Flow Detection Device Based on Sound Data Analysis and Machine Learning to Detect Water Leakage
by: Pourmehrani, Hossein, et al.
Published: (2025)

SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation
by: Guo, Yu-Ren, et al.
Published: (2025)

Listen through the Sound: Generative Speech Restoration Leveraging Acoustic Context Representation
by: Chung, Soo-Whan, et al.
Published: (2025)

Content Leakage in LibriSpeech and Its Impact on the Privacy Evaluation of Speaker Anonymization
by: Franzreb, Carlos, et al.
Published: (2026)

Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction
by: Lu, Ye-Xin, et al.
Published: (2024)

A Two-Step Learning Framework for Enhancing Sound Event Localization and Detection
by: Yu, Hogeon
Published: (2025)

Codec-SUPERB: An In-Depth Analysis of Sound Codec Models
by: Wu, Haibin, et al.
Published: (2024)

Leveraging Sound Source Trajectories for Universal Sound Separation
by: Wu, Donghang, et al.
Published: (2024)

Sound Zone Control Robust To Sound Speed Change
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)

AudSemThinker: Enhancing Audio-Language Models through Reasoning over Semantics of Sound
by: Wijngaard, Gijs, et al.
Published: (2025)

Evaluating Sound Similarity Metrics for Differentiable, Iterative Sound-Matching
by: Salimi, Amir, et al.
Published: (2025)

Enhance Temporal Relations in Audio Captioning with Sound Event Detection
by: Xie, Zeyu, et al.
Published: (2023)

Hierarchical Pooling Structure for Weakly Labeled Sound Event Detection
by: He, Ke-Xin, et al.
Published: (2019)

APCodec: A Neural Audio Codec with Parallel Amplitude and Phase Spectrum Encoding and Decoding
by: Ai, Yang, et al.
Published: (2024)

HSDreport: Heart Sound Diagnosis with Echocardiography Reports
by: Zhao, Zihan, et al.
Published: (2024)

Semantic MIMO Systems for Speech-to-Text Transmission
by: Weng, Zhenzi, et al.
Published: (2024)

Improving Anomalous Sound Detection through Pseudo-anomalous Set Selection and Pseudo-label Utilization under Unlabeled Conditions
by: Kuroyanagi, Ibuki, et al.
Published: (2025)

STFTCodec: High-Fidelity Audio Compression through Time-Frequency Domain Representation
by: Feng, Tao, et al.
Published: (2025)

Noise-Robust Sound Event Detection and Counting via Language-Queried Sound Separation
by: Chen, Yuanjian, et al.
Published: (2025)

DiffSound: Differentiable Modal Sound Rendering and Inverse Rendering for Diverse Inference Tasks
by: Jin, Xutong, et al.
Published: (2024)

Fractional Fourier Sound Synthesis
by: Gutiérrez, Esteban, et al.
Published: (2025)

Diffuse Sound Field Synthesis
by: Zotter, Franz, et al.
Published: (2024)

Sound Event Bounding Boxes
by: Ebbers, Janek, et al.
Published: (2024)

Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
by: Shen, Yu-Han, et al.
Published: (2018)

SoundBeam meets M2D: Target Sound Extraction with Audio Foundation Model
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)

A Generalist Audio Foundation Model for Comprehensive Body Sound Auscultation
by: Wang, Pingjie, et al.
Published: (2024)

SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation
by: Niu, Xinlei, et al.
Published: (2024)

A Detailed Audio-Text Data Simulation Pipeline using Single-Event Sounds
by: Xu, Xuenan, et al.
Published: (2024)

DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation
by: Li, Baihan, et al.
Published: (2024)

A MATLAB toolbox for Computation of Speech Transmission Index (STI)
by: Rajmic, Pavel, et al.
Published: (2025)

Fast Algorithm for Moving Sound Source
by: Yang, Dong
Published: (2025)

Boundary-Informed Sound Field Reconstruction
by: Sundström, David, et al.
Published: (2025)

Sound Field Synthesis with Acoustic Waves
by: Mansour, Mohamed F.
Published: (2024)

Joint Analysis of Acoustic Scenes and Sound Events Based on Semi-Supervised Training of Sound Events With Partial Labels
by: Imoto, Keisuke
Published: (2025)

Two-sided Acoustic Metascreen for Broadband and Individual Reflection and Transmission Control
by: Chen, Ao, et al.
Published: (2024)

AudioSpa: Spatializing Sound Events with Text
by: Feng, Linfeng, et al.
Published: (2025)

Adaptive Differential Denoising for Respiratory Sounds Classification
by: Dong, Gaoyang, et al.
Published: (2025)

Region-Specific Audio Tagging for Spatial Sound
by: Zhao, Jinzheng, et al.
Published: (2025)

Frequency Dynamic Convolutions for Sound Event Detection
by: Nam, Hyeonuk
Published: (2025)

Domain-Invariant Representation Learning of Bird Sounds
by: Moummad, Ilyass, et al.
Published: (2024)