:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Gogoi, Deepshikha, Gogoi, Parismita, Saring, Yang
Format:	Preprint
Published:	2026
Subjects:	Audio and Speech Processing Signal Processing
Online Access:	https://arxiv.org/abs/2604.25309
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Exploring rhythm formant analysis for Indic language classification
by: Gogoi, Parismita, et al.
Published: (2024)

Analyzing long-term rhythm variations in Mising and Assamese using frequency domain correlates
by: Gogoi, Parismita, et al.
Published: (2024)

Tone recognition in low-resource languages of North-East India: peeling the layers of SSL-based speech models
by: Gogoi, Parismita, et al.
Published: (2025)

On the Invariance of Cross-Correlation Peak Positions Under Monotonic Signal Transformations, with Application to Fast Time Difference Estimation
by: Ueno, Natsuki, et al.
Published: (2025)

Gridless Chirp Parameter Retrieval via Constrained Two-Dimensional Atomic Norm Minimization
by: Yang, Dehui, et al.
Published: (2025)

Acoustical Features as Knee Health Biomarkers: A Critical Analysis
by: Kechris, Christodoulos, et al.
Published: (2024)

Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and Assessment
by: Gogoi, Parismita, et al.
Published: (2025)

GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
by: Shetu, Shrishti Saha, et al.
Published: (2024)

Wavelet-Based Time-Frequency Fingerprinting for Feature Extraction of Traditional Irish Music
by: Shore, Noah
Published: (2025)

Phase-Only Positioning in Distributed MIMO Under Phase Impairments: AP Selection Using Deep Learning
by: Ayten, Fatih, et al.
Published: (2026)

Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios
by: Jiang, Ya, et al.
Published: (2024)

Independent Feature Enhanced Crossmodal Fusion for Match-Mismatch Classification of Speech Stimulus and EEG Response
by: Fan, Shitong, et al.
Published: (2024)

Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring
by: Fan, Shitong, et al.
Published: (2024)

Modeling and Link Budget Feasibility Analysis of Secure LoRa-Based Peer-to-Peer Communication for Short-Range Tactical Networks
by: Agrawal, Ayush Kumar, et al.
Published: (2026)

ParaS2S: Benchmarking and Aligning Spoken Language Models for Paralinguistic-aware Speech-to-Speech Interaction
by: Yang, Shu-wen, et al.
Published: (2025)

A Novel Numerical Method for Relaxing the Minimal Configurations of TOA-Based Joint Sensors and Sources Localization
by: Cao, Faxian, et al.
Published: (2024)

Auditory Attention Decoding from Ear-EEG Signals: A Dataset with Dynamic Attention Switching and Rigorous Cross-Validation
by: Zhang, Yuanming, et al.
Published: (2025)

Cross-Talk Reduction
by: Wang, Zhong-Qiu, et al.
Published: (2024)

A Machine Hearing System for Robust Cough Detection Based on a High-Level Representation of Band-Specific Audio Features
by: Monge-Alvarez, Jesús, et al.
Published: (2024)

Noise Suppression for Time Difference of Arrival: Performance Evaluation of a Generalized Cross-Correlation Method Using Mean Signal and Inverse Filter
by: Obo, Hirotaka, et al.
Published: (2025)

Speech-Based Prioritization for Schizophrenia Intervention
by: Premananth, Gowtham, et al.
Published: (2025)

CrossSpeech++: Cross-lingual Speech Synthesis with Decoupled Language and Speaker Generation
by: Kim, Ji-Hoon, et al.
Published: (2024)

SELM: Speech Enhancement Using Discrete Tokens and Language Models
by: Wang, Ziqian, et al.
Published: (2023)

Advanced Signal Analysis in Detecting Replay Attacks for Automatic Speaker Verification Systems
by: Kuang, Lee Shih
Published: (2024)

Significance of Chirp MFCC as a Feature in Speech and Audio Applications
by: Joysingh, S. Johanan, et al.
Published: (2024)

Speech-Declipping Transformer with Complex Spectrogram and Learnerble Temporal Features
by: Kwon, Younghoo, et al.
Published: (2024)

Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation
by: Berghi, Davide, et al.
Published: (2025)

Generative AI in Signal Processing Education: An Audio Foundation Model Based Approach
by: Khan, Muhammad Salman, et al.
Published: (2026)

Incremental Averaging Method to Improve Graph-Based Time-Difference-of-Arrival Estimation
by: Brümann, Klaus, et al.
Published: (2025)

Robust Fixed-Filter Sound Zone Control with Audio-Based Position Tracking
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)

Blind Capon Beamformer Based on Independent Component Extraction: Single-Parameter Algorithm,
by: Koldovský, Zbyněk, et al.
Published: (2025)

Multi-Source Position and Direction-of-Arrival Estimation Based on Euclidean Distance Matrices
by: Brümann, Klaus, et al.
Published: (2025)

Optimizing Domain-Adaptive Self-Supervised Learning for Clinical Voice-Based Disease Classification
by: Liu, Weixin, et al.
Published: (2026)

Transferable Selective Virtual Sensing Active Noise Control Technique Based on Metric Learning
by: Wang, Boxiang, et al.
Published: (2024)

Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder
by: Xie, Yuying, et al.
Published: (2024)

Collection: UAV-Based RSS Measurements from the AFAR Challenge in Digital Twin and Real-World Environments
by: Masrur, Saad, et al.
Published: (2025)

Hybrid SMI Realization via Matrix Completion and Riemannian Manifold Optimization on Narrowband Sub-Array Based Architectures
by: Cousik, Tarun Suman, et al.
Published: (2026)

PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts
by: Qi, Tianhua, et al.
Published: (2025)

Neural Spectral Band Generation for Audio Coding
by: Choi, Woongjib, et al.
Published: (2025)

Robust Detection of Underwater Target Against Non-Uniform Noise With Optical Fiber DAS Array
by: Cang, Siyuan, et al.
Published: (2025)