:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Zong, Yisu, Reiss, Joshua
Format:	Preprint
Published:	2025
Subjects:	Sound Audio and Speech Processing
Online Access:	https://arxiv.org/abs/2503.08806
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Learning Vocal-Tract Area and Radiation with a Physics-Informed Webster Model
by: Lu, Minhui, et al.
Published: (2026)

Improving Neural Pitch Estimation with SWIPE Kernels
by: Marttila, David, et al.
Published: (2025)

Differentiable Black-box and Gray-box Modeling of Nonlinear Audio Effects
by: Comunità, Marco, et al.
Published: (2025)

NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks
by: Barahona-Ríos, Adrián, et al.
Published: (2023)

ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization
by: Steinmetz, Christian J., et al.
Published: (2024)

Cross-attention Inspired Selective State Space Models for Target Sound Extraction
by: Wu, Donghang, et al.
Published: (2024)

Exploring trends in audio mixes and masters: Insights from a dataset analysis
by: Mourgela, Angeliki, et al.
Published: (2024)

An automatic mixing speech enhancement system for multi-track audio
by: Liu, Xiaojing, et al.
Published: (2024)

Physics-Informed Neural Engine Sound Modeling with Differentiable Pulse-Train Synthesis
by: Doerfler, Robin, et al.
Published: (2026)

Fractional Fourier Sound Synthesis
by: Gutiérrez, Esteban, et al.
Published: (2025)

Diffuse Sound Field Synthesis
by: Zotter, Franz, et al.
Published: (2024)

Visual-based spatial audio generation system for multi-speaker environments
by: Liu, Xiaojing, et al.
Published: (2025)

Diff-MST: Differentiable Mixing Style Transfer
by: Vanka, Soumya Sai, et al.
Published: (2024)

SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation
by: Guo, Yu-Ren, et al.
Published: (2025)

Sound Zone Control Robust To Sound Speed Change
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)

Sound Field Synthesis with Acoustic Waves
by: Mansour, Mohamed F.
Published: (2024)

Physics-Informed Machine Learning For Sound Field Estimation
by: Koyama, Shoichi, et al.
Published: (2024)

Decomposing the Influence of Physical Acoustic Modeling on Neural Personal Sound Zone Rendering: An Ablation Study
by: Jiang, Hao, et al.
Published: (2026)

Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables
by: Yu, Chin-Yun, et al.
Published: (2023)

Efficient Sound Field Reconstruction with Conditional Invertible Neural Networks
by: Karakonstantis, Xenofon, et al.
Published: (2024)

A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis
by: Gutiérrez, Esteban, et al.
Published: (2025)

Modulation Discovery with Differentiable Digital Signal Processing
by: Mitcheltree, Christopher, et al.
Published: (2025)

Heterogeneous bimodal attention fusion for speech emotion recognition
by: Luo, Jiachen, et al.
Published: (2025)

6KSFx Synth Dataset
by: Garcia, Nelly, et al.
Published: (2025)

Physics-Informed Transfer Learning for Data-Driven Sound Source Reconstruction in Near-Field Acoustic Holography
by: Luan, Xinmeng, et al.
Published: (2025)

Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms
by: Zhang, Chu Yuan, et al.
Published: (2023)

Domain-Invariant Representation Learning of Bird Sounds
by: Moummad, Ilyass, et al.
Published: (2024)

PyNeuralFx: A Python Package for Neural Audio Effect Modeling
by: Yeh, Yen-Tung, et al.
Published: (2024)

Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression
by: Tomita, Yoshihide, et al.
Published: (2024)

Sound Field Reconstruction Using a Compact Acoustics-informed Neural Network
by: Ma, Fei, et al.
Published: (2024)

Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network
by: Chen, Yanan, et al.
Published: (2024)

Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
by: Shen, Yu-Han, et al.
Published: (2018)

SoundBeam meets M2D: Target Sound Extraction with Audio Foundation Model
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)

AudioBERTScore: Objective Evaluation of Environmental Sound Synthesis Based on Similarity of Audio embedding Sequences
by: Kishi, Minoru, et al.
Published: (2025)

Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks
by: Jiang, Zifan, et al.
Published: (2023)

VCNAC: A Variable-Channel Neural Audio Codec for Mono, Stereo, and Surround Sound
by: Grötschla, Florian, et al.
Published: (2026)

Learning to Solve Inverse Problems for Perceptual Sound Matching
by: Han, Han, et al.
Published: (2023)

SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation
by: Niu, Xinlei, et al.
Published: (2024)

Leveraging Sound Source Trajectories for Universal Sound Separation
by: Wu, Donghang, et al.
Published: (2024)

Learning Magnitude Distribution of Sound Fields via Conditioned Autoencoder
by: Koyama, Shoichi, et al.
Published: (2025)