Saved in:
| Main Authors: | Müller, Kristóf, Hatvani, Janka, Goda, Márton Áron, Koller, Miklós |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.10783 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
pyPCG: A Python Toolbox Specialized for Phonocardiography Analysis
by: Müller, Kristóf, et al.
Published: (2024)
by: Müller, Kristóf, et al.
Published: (2024)
A Steered Response Power Method for Sound Source Localization With Generic Acoustic Models
by: Müller, Kaspar, et al.
Published: (2025)
by: Müller, Kaspar, et al.
Published: (2025)
SALT: Standardized Audio event Label Taxonomy
by: Stamatiadis, Paraskevas, et al.
Published: (2024)
by: Stamatiadis, Paraskevas, et al.
Published: (2024)
IQRA 2026: Interspeech Challenge on Automatic Pronunciation Assessment for Modern Standard Arabic (MSA)
by: Kheir, Yassine El, et al.
Published: (2026)
by: Kheir, Yassine El, et al.
Published: (2026)
Spatial Analysis and Synthesis Methods: Subjective and Objective Evaluations Using Various Microphone Arrays in the Auralization of a Critical Listening Room
by: Pawlak, Alan, et al.
Published: (2024)
by: Pawlak, Alan, et al.
Published: (2024)
SPO-CLAPScore: Enhancing CLAP-based alignment prediction system with Standardize Preference Optimization, for the first XACLE Challenge
by: Takano, Taisei, et al.
Published: (2026)
by: Takano, Taisei, et al.
Published: (2026)
MPDR Beamforming for Almost-Cyclostationary Processes
by: Bologni, Giovanni, et al.
Published: (2025)
by: Bologni, Giovanni, et al.
Published: (2025)
Towards Machine Unlearning for Paralinguistic Speech Processing
by: Phukan, Orchid Chetia, et al.
Published: (2025)
by: Phukan, Orchid Chetia, et al.
Published: (2025)
MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling
by: Cheng, Yifan, et al.
Published: (2025)
by: Cheng, Yifan, et al.
Published: (2025)
Data Standards in Audiology: A Mixed-Methods Exploration of Community Perspectives and Implementation Considerations
by: Vercammen, Charlotte, et al.
Published: (2025)
by: Vercammen, Charlotte, et al.
Published: (2025)
Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach
by: Abeßer, Jakob, et al.
Published: (2025)
by: Abeßer, Jakob, et al.
Published: (2025)
The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
by: Chang, Xuankai, et al.
Published: (2024)
by: Chang, Xuankai, et al.
Published: (2024)
A Practical Guide to Spectrogram Analysis for Audio Signal Processing
by: Khodzhaev, Zulfidin
Published: (2024)
by: Khodzhaev, Zulfidin
Published: (2024)
Conformal Prediction for Manifold-based Source Localization with Gaussian Processes
by: Rozenfeld, Vadim, et al.
Published: (2024)
by: Rozenfeld, Vadim, et al.
Published: (2024)
Moving Speaker Separation via Parallel Spectral-Spatial Processing
by: Wang, Yuzhu, et al.
Published: (2026)
by: Wang, Yuzhu, et al.
Published: (2026)
Joint Minimum Processing Beamforming and Near-end Listening Enhancement
by: Fuglsig, Andreas J., et al.
Published: (2023)
by: Fuglsig, Andreas J., et al.
Published: (2023)
An Attribute Interpolation Method in Speech Synthesis by Model Merging
by: Murata, Masato, et al.
Published: (2024)
by: Murata, Masato, et al.
Published: (2024)
Comparative Analysis of ASR Methods for Speech Deepfake Detection
by: Salvi, Davide, et al.
Published: (2024)
by: Salvi, Davide, et al.
Published: (2024)
Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement
by: Pandey, Ashutosh, et al.
Published: (2024)
by: Pandey, Ashutosh, et al.
Published: (2024)
Audiosockets: A Python socket package for Real-Time Audio Processing
by: Shu, Nicolas, et al.
Published: (2024)
by: Shu, Nicolas, et al.
Published: (2024)
Synthetic Speech Classification: IEEE Signal Processing Cup 2022 challenge
by: Rahmun, Mahieyin, et al.
Published: (2024)
by: Rahmun, Mahieyin, et al.
Published: (2024)
GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
by: Lee, Sungho, et al.
Published: (2024)
by: Lee, Sungho, et al.
Published: (2024)
SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics
by: Saeki, Takaaki, et al.
Published: (2024)
by: Saeki, Takaaki, et al.
Published: (2024)
MLAAD: The Multi-Language Audio Anti-Spoofing Dataset
by: Müller, Nicolas M., et al.
Published: (2024)
by: Müller, Nicolas M., et al.
Published: (2024)
Mel-Spectrogram Inversion via Alternating Direction Method of Multipliers
by: Masuyama, Yoshiki, et al.
Published: (2025)
by: Masuyama, Yoshiki, et al.
Published: (2025)
Blind Source Separation in Biomedical Signals Using Variational Methods
by: Torabi, Yasaman, et al.
Published: (2025)
by: Torabi, Yasaman, et al.
Published: (2025)
EDSep: An Effective Diffusion-Based Method for Speech Source Separation
by: Dong, Jinwei, et al.
Published: (2025)
by: Dong, Jinwei, et al.
Published: (2025)
Leveraging Multimodal Methods and Spontaneous Speech for Alzheimer's Disease Identification
by: Gao, Yifan, et al.
Published: (2024)
by: Gao, Yifan, et al.
Published: (2024)
CEC: A Noisy Label Detection Method for Speaker Recognition
by: Shen, Yao, et al.
Published: (2024)
by: Shen, Yao, et al.
Published: (2024)
Improving Speech Enhancement by Cross- and Sub-band Processing with State Space Model
by: Li, Jizhen, et al.
Published: (2025)
by: Li, Jizhen, et al.
Published: (2025)
ClearerVoice-Studio: Bridging Advanced Speech Processing Research and Practical Deployment
by: Zhao, Shengkui, et al.
Published: (2025)
by: Zhao, Shengkui, et al.
Published: (2025)
A Survey on 30+ Years of Automatic Singing Assessment and Singing Information Processing
by: Santos, Arthur N. dos, et al.
Published: (2026)
by: Santos, Arthur N. dos, et al.
Published: (2026)
Examining the Interplay Between Privacy and Fairness for Speech Processing: A Review and Perspective
by: Leschanowsky, Anna, et al.
Published: (2024)
by: Leschanowsky, Anna, et al.
Published: (2024)
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing
by: Wu, Shangda, et al.
Published: (2024)
by: Wu, Shangda, et al.
Published: (2024)
Reduction of Nonlinear Distortion in Condenser Microphones Using a Simple Post-Processing Technique
by: Honzík, Petr, et al.
Published: (2024)
by: Honzík, Petr, et al.
Published: (2024)
NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
by: Huang, He, et al.
Published: (2024)
by: Huang, He, et al.
Published: (2024)
Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
by: Ochiai, Tsubasa, et al.
Published: (2024)
by: Ochiai, Tsubasa, et al.
Published: (2024)
Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
by: Lee, Ying-Shuo, et al.
Published: (2024)
by: Lee, Ying-Shuo, et al.
Published: (2024)
RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
by: Shi, Haoxiang, et al.
Published: (2024)
by: Shi, Haoxiang, et al.
Published: (2024)
Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
by: Poncelet, Jakob, et al.
Published: (2021)
by: Poncelet, Jakob, et al.
Published: (2021)
Similar Items
-
pyPCG: A Python Toolbox Specialized for Phonocardiography Analysis
by: Müller, Kristóf, et al.
Published: (2024) -
A Steered Response Power Method for Sound Source Localization With Generic Acoustic Models
by: Müller, Kaspar, et al.
Published: (2025) -
SALT: Standardized Audio event Label Taxonomy
by: Stamatiadis, Paraskevas, et al.
Published: (2024) -
IQRA 2026: Interspeech Challenge on Automatic Pronunciation Assessment for Modern Standard Arabic (MSA)
by: Kheir, Yassine El, et al.
Published: (2026) -
Spatial Analysis and Synthesis Methods: Subjective and Objective Evaluations Using Various Microphone Arrays in the Auralization of a Critical Listening Room
by: Pawlak, Alan, et al.
Published: (2024)