:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Müller, Kristóf, Hatvani, Janka, Goda, Márton Áron, Koller, Miklós
Format:	Preprint
Published:	2025
Subjects:	Audio and Speech Processing Sound
Online Access:	https://arxiv.org/abs/2507.10783
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

pyPCG: A Python Toolbox Specialized for Phonocardiography Analysis
by: Müller, Kristóf, et al.
Published: (2024)

A Steered Response Power Method for Sound Source Localization With Generic Acoustic Models
by: Müller, Kaspar, et al.
Published: (2025)

SALT: Standardized Audio event Label Taxonomy
by: Stamatiadis, Paraskevas, et al.
Published: (2024)

IQRA 2026: Interspeech Challenge on Automatic Pronunciation Assessment for Modern Standard Arabic (MSA)
by: Kheir, Yassine El, et al.
Published: (2026)

Spatial Analysis and Synthesis Methods: Subjective and Objective Evaluations Using Various Microphone Arrays in the Auralization of a Critical Listening Room
by: Pawlak, Alan, et al.
Published: (2024)

SPO-CLAPScore: Enhancing CLAP-based alignment prediction system with Standardize Preference Optimization, for the first XACLE Challenge
by: Takano, Taisei, et al.
Published: (2026)

MPDR Beamforming for Almost-Cyclostationary Processes
by: Bologni, Giovanni, et al.
Published: (2025)

Towards Machine Unlearning for Paralinguistic Speech Processing
by: Phukan, Orchid Chetia, et al.
Published: (2025)

MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling
by: Cheng, Yifan, et al.
Published: (2025)

Data Standards in Audiology: A Mixed-Methods Exploration of Community Perspectives and Implementation Considerations
by: Vercammen, Charlotte, et al.
Published: (2025)

Pitch Contour Exploration Across Audio Domains: A Vision-Based Transfer Learning Approach
by: Abeßer, Jakob, et al.
Published: (2025)

The Interspeech 2024 Challenge on Speech Processing Using Discrete Units
by: Chang, Xuankai, et al.
Published: (2024)

A Practical Guide to Spectrogram Analysis for Audio Signal Processing
by: Khodzhaev, Zulfidin
Published: (2024)

Conformal Prediction for Manifold-based Source Localization with Gaussian Processes
by: Rozenfeld, Vadim, et al.
Published: (2024)

Moving Speaker Separation via Parallel Spectral-Spatial Processing
by: Wang, Yuzhu, et al.
Published: (2026)

Joint Minimum Processing Beamforming and Near-end Listening Enhancement
by: Fuglsig, Andreas J., et al.
Published: (2023)

An Attribute Interpolation Method in Speech Synthesis by Model Merging
by: Murata, Masato, et al.
Published: (2024)

Comparative Analysis of ASR Methods for Speech Deepfake Detection
by: Salvi, Davide, et al.
Published: (2024)

Decoupled Spatial and Temporal Processing for Resource Efficient Multichannel Speech Enhancement
by: Pandey, Ashutosh, et al.
Published: (2024)

Audiosockets: A Python socket package for Real-Time Audio Processing
by: Shu, Nicolas, et al.
Published: (2024)

Synthetic Speech Classification: IEEE Signal Processing Cup 2022 challenge
by: Rahmun, Mahieyin, et al.
Published: (2024)

GRAFX: An Open-Source Library for Audio Processing Graphs in PyTorch
by: Lee, Sungho, et al.
Published: (2024)

SpeechBERTScore: Reference-Aware Automatic Evaluation of Speech Generation Leveraging NLP Evaluation Metrics
by: Saeki, Takaaki, et al.
Published: (2024)

MLAAD: The Multi-Language Audio Anti-Spoofing Dataset
by: Müller, Nicolas M., et al.
Published: (2024)

Mel-Spectrogram Inversion via Alternating Direction Method of Multipliers
by: Masuyama, Yoshiki, et al.
Published: (2025)

Blind Source Separation in Biomedical Signals Using Variational Methods
by: Torabi, Yasaman, et al.
Published: (2025)

EDSep: An Effective Diffusion-Based Method for Speech Source Separation
by: Dong, Jinwei, et al.
Published: (2025)

Leveraging Multimodal Methods and Spontaneous Speech for Alzheimer's Disease Identification
by: Gao, Yifan, et al.
Published: (2024)

CEC: A Noisy Label Detection Method for Speaker Recognition
by: Shen, Yao, et al.
Published: (2024)

Improving Speech Enhancement by Cross- and Sub-band Processing with State Space Model
by: Li, Jizhen, et al.
Published: (2025)

ClearerVoice-Studio: Bridging Advanced Speech Processing Research and Practical Deployment
by: Zhao, Shengkui, et al.
Published: (2025)

A Survey on 30+ Years of Automatic Singing Assessment and Singing Information Processing
by: Santos, Arthur N. dos, et al.
Published: (2026)

Examining the Interplay Between Privacy and Fairness for Speech Processing: A Review and Perspective
by: Leschanowsky, Anna, et al.
Published: (2024)

MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing
by: Wu, Shangda, et al.
Published: (2024)

Reduction of Nonlinear Distortion in Condenser Microphones Using a Simple Post-Processing Technique
by: Honzík, Petr, et al.
Published: (2024)

NEST: Self-supervised Fast Conformer as All-purpose Seasoning to Speech Processing Tasks
by: Huang, He, et al.
Published: (2024)

Rethinking Processing Distortions: Disentangling the Impact of Speech Enhancement Errors on Speech Recognition Performance
by: Ochiai, Tsubasa, et al.
Published: (2024)

Distortion Recovery: A Two-Stage Method for Guitar Effect Removal
by: Lee, Ying-Shuo, et al.
Published: (2024)

RSET: Remapping-based Sorting Method for Emotion Transfer Speech Synthesis
by: Shi, Haoxiang, et al.
Published: (2024)

Comparison of Self-Supervised Speech Pre-Training Methods on Flemish Dutch
by: Poncelet, Jakob, et al.
Published: (2021)