Saved in:
| Main Authors: | Zong, Yisu, Reiss, Joshua |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.08806 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Learning Vocal-Tract Area and Radiation with a Physics-Informed Webster Model
by: Lu, Minhui, et al.
Published: (2026)
by: Lu, Minhui, et al.
Published: (2026)
Improving Neural Pitch Estimation with SWIPE Kernels
by: Marttila, David, et al.
Published: (2025)
by: Marttila, David, et al.
Published: (2025)
Differentiable Black-box and Gray-box Modeling of Nonlinear Audio Effects
by: Comunità, Marco, et al.
Published: (2025)
by: Comunità, Marco, et al.
Published: (2025)
NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks
by: Barahona-Ríos, Adrián, et al.
Published: (2023)
by: Barahona-Ríos, Adrián, et al.
Published: (2023)
ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization
by: Steinmetz, Christian J., et al.
Published: (2024)
by: Steinmetz, Christian J., et al.
Published: (2024)
Cross-attention Inspired Selective State Space Models for Target Sound Extraction
by: Wu, Donghang, et al.
Published: (2024)
by: Wu, Donghang, et al.
Published: (2024)
Exploring trends in audio mixes and masters: Insights from a dataset analysis
by: Mourgela, Angeliki, et al.
Published: (2024)
by: Mourgela, Angeliki, et al.
Published: (2024)
An automatic mixing speech enhancement system for multi-track audio
by: Liu, Xiaojing, et al.
Published: (2024)
by: Liu, Xiaojing, et al.
Published: (2024)
Physics-Informed Neural Engine Sound Modeling with Differentiable Pulse-Train Synthesis
by: Doerfler, Robin, et al.
Published: (2026)
by: Doerfler, Robin, et al.
Published: (2026)
Fractional Fourier Sound Synthesis
by: Gutiérrez, Esteban, et al.
Published: (2025)
by: Gutiérrez, Esteban, et al.
Published: (2025)
Diffuse Sound Field Synthesis
by: Zotter, Franz, et al.
Published: (2024)
by: Zotter, Franz, et al.
Published: (2024)
Visual-based spatial audio generation system for multi-speaker environments
by: Liu, Xiaojing, et al.
Published: (2025)
by: Liu, Xiaojing, et al.
Published: (2025)
Diff-MST: Differentiable Mixing Style Transfer
by: Vanka, Soumya Sai, et al.
Published: (2024)
by: Vanka, Soumya Sai, et al.
Published: (2024)
SonicRAG : High Fidelity Sound Effects Synthesis Based on Retrival Augmented Generation
by: Guo, Yu-Ren, et al.
Published: (2025)
by: Guo, Yu-Ren, et al.
Published: (2025)
Sound Zone Control Robust To Sound Speed Change
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)
Sound Field Synthesis with Acoustic Waves
by: Mansour, Mohamed F.
Published: (2024)
by: Mansour, Mohamed F.
Published: (2024)
Physics-Informed Machine Learning For Sound Field Estimation
by: Koyama, Shoichi, et al.
Published: (2024)
by: Koyama, Shoichi, et al.
Published: (2024)
Decomposing the Influence of Physical Acoustic Modeling on Neural Personal Sound Zone Rendering: An Ablation Study
by: Jiang, Hao, et al.
Published: (2026)
by: Jiang, Hao, et al.
Published: (2026)
Singing Voice Synthesis Using Differentiable LPC and Glottal-Flow-Inspired Wavetables
by: Yu, Chin-Yun, et al.
Published: (2023)
by: Yu, Chin-Yun, et al.
Published: (2023)
Efficient Sound Field Reconstruction with Conditional Invertible Neural Networks
by: Karakonstantis, Xenofon, et al.
Published: (2024)
by: Karakonstantis, Xenofon, et al.
Published: (2024)
A Statistics-Driven Differentiable Approach for Sound Texture Synthesis and Analysis
by: Gutiérrez, Esteban, et al.
Published: (2025)
by: Gutiérrez, Esteban, et al.
Published: (2025)
Modulation Discovery with Differentiable Digital Signal Processing
by: Mitcheltree, Christopher, et al.
Published: (2025)
by: Mitcheltree, Christopher, et al.
Published: (2025)
Heterogeneous bimodal attention fusion for speech emotion recognition
by: Luo, Jiachen, et al.
Published: (2025)
by: Luo, Jiachen, et al.
Published: (2025)
6KSFx Synth Dataset
by: Garcia, Nelly, et al.
Published: (2025)
by: Garcia, Nelly, et al.
Published: (2025)
Physics-Informed Transfer Learning for Data-Driven Sound Source Reconstruction in Near-Field Acoustic Holography
by: Luan, Xinmeng, et al.
Published: (2025)
by: Luan, Xinmeng, et al.
Published: (2025)
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms
by: Zhang, Chu Yuan, et al.
Published: (2023)
by: Zhang, Chu Yuan, et al.
Published: (2023)
Domain-Invariant Representation Learning of Bird Sounds
by: Moummad, Ilyass, et al.
Published: (2024)
by: Moummad, Ilyass, et al.
Published: (2024)
PyNeuralFx: A Python Package for Neural Audio Effect Modeling
by: Yeh, Yen-Tung, et al.
Published: (2024)
by: Yeh, Yen-Tung, et al.
Published: (2024)
Localizing Acoustic Energy in Sound Field Synthesis by Directionally Weighted Exterior Radiation Suppression
by: Tomita, Yoshihide, et al.
Published: (2024)
by: Tomita, Yoshihide, et al.
Published: (2024)
Sound Field Reconstruction Using a Compact Acoustics-informed Neural Network
by: Ma, Fei, et al.
Published: (2024)
by: Ma, Fei, et al.
Published: (2024)
Plugin Speech Enhancement: A Universal Speech Enhancement Framework Inspired by Dynamic Neural Network
by: Chen, Yanan, et al.
Published: (2024)
by: Chen, Yanan, et al.
Published: (2024)
Learning How to Listen: A Temporal-Frequential Attention Model for Sound Event Detection
by: Shen, Yu-Han, et al.
Published: (2018)
by: Shen, Yu-Han, et al.
Published: (2018)
SoundBeam meets M2D: Target Sound Extraction with Audio Foundation Model
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)
by: Hernandez-Olivan, Carlos, et al.
Published: (2024)
AudioBERTScore: Objective Evaluation of Environmental Sound Synthesis Based on Similarity of Audio embedding Sequences
by: Kishi, Minoru, et al.
Published: (2025)
by: Kishi, Minoru, et al.
Published: (2025)
Automatic Sound Event Detection and Classification of Great Ape Calls Using Neural Networks
by: Jiang, Zifan, et al.
Published: (2023)
by: Jiang, Zifan, et al.
Published: (2023)
VCNAC: A Variable-Channel Neural Audio Codec for Mono, Stereo, and Surround Sound
by: Grötschla, Florian, et al.
Published: (2026)
by: Grötschla, Florian, et al.
Published: (2026)
Learning to Solve Inverse Problems for Perceptual Sound Matching
by: Han, Han, et al.
Published: (2023)
by: Han, Han, et al.
Published: (2023)
SoundLoCD: An Efficient Conditional Discrete Contrastive Latent Diffusion Model for Text-to-Sound Generation
by: Niu, Xinlei, et al.
Published: (2024)
by: Niu, Xinlei, et al.
Published: (2024)
Leveraging Sound Source Trajectories for Universal Sound Separation
by: Wu, Donghang, et al.
Published: (2024)
by: Wu, Donghang, et al.
Published: (2024)
Learning Magnitude Distribution of Sound Fields via Conditioned Autoencoder
by: Koyama, Shoichi, et al.
Published: (2025)
by: Koyama, Shoichi, et al.
Published: (2025)
Similar Items
-
Learning Vocal-Tract Area and Radiation with a Physics-Informed Webster Model
by: Lu, Minhui, et al.
Published: (2026) -
Improving Neural Pitch Estimation with SWIPE Kernels
by: Marttila, David, et al.
Published: (2025) -
Differentiable Black-box and Gray-box Modeling of Nonlinear Audio Effects
by: Comunità, Marco, et al.
Published: (2025) -
NoiseBandNet: Controllable Time-Varying Neural Synthesis of Sound Effects Using Filterbanks
by: Barahona-Ríos, Adrián, et al.
Published: (2023) -
ST-ITO: Controlling Audio Effects for Style Transfer with Inference-Time Optimization
by: Steinmetz, Christian J., et al.
Published: (2024)