Saved in:
| Main Authors: | Gogoi, Deepshikha, Gogoi, Parismita, Saring, Yang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.25309 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exploring rhythm formant analysis for Indic language classification
by: Gogoi, Parismita, et al.
Published: (2024)
by: Gogoi, Parismita, et al.
Published: (2024)
Analyzing long-term rhythm variations in Mising and Assamese using frequency domain correlates
by: Gogoi, Parismita, et al.
Published: (2024)
by: Gogoi, Parismita, et al.
Published: (2024)
Tone recognition in low-resource languages of North-East India: peeling the layers of SSL-based speech models
by: Gogoi, Parismita, et al.
Published: (2025)
by: Gogoi, Parismita, et al.
Published: (2025)
On the Invariance of Cross-Correlation Peak Positions Under Monotonic Signal Transformations, with Application to Fast Time Difference Estimation
by: Ueno, Natsuki, et al.
Published: (2025)
by: Ueno, Natsuki, et al.
Published: (2025)
Gridless Chirp Parameter Retrieval via Constrained Two-Dimensional Atomic Norm Minimization
by: Yang, Dehui, et al.
Published: (2025)
by: Yang, Dehui, et al.
Published: (2025)
Acoustical Features as Knee Health Biomarkers: A Critical Analysis
by: Kechris, Christodoulos, et al.
Published: (2024)
by: Kechris, Christodoulos, et al.
Published: (2024)
Leveraging AM and FM Rhythm Spectrograms for Dementia Classification and Assessment
by: Gogoi, Parismita, et al.
Published: (2025)
by: Gogoi, Parismita, et al.
Published: (2025)
GAN-Based Speech Enhancement for Low SNR Using Latent Feature Conditioning
by: Shetu, Shrishti Saha, et al.
Published: (2024)
by: Shetu, Shrishti Saha, et al.
Published: (2024)
Wavelet-Based Time-Frequency Fingerprinting for Feature Extraction of Traditional Irish Music
by: Shore, Noah
Published: (2025)
by: Shore, Noah
Published: (2025)
Phase-Only Positioning in Distributed MIMO Under Phase Impairments: AP Selection Using Deep Learning
by: Ayten, Fatih, et al.
Published: (2026)
by: Ayten, Fatih, et al.
Published: (2026)
Exploring Audio-Visual Information Fusion for Sound Event Localization and Detection In Low-Resource Realistic Scenarios
by: Jiang, Ya, et al.
Published: (2024)
by: Jiang, Ya, et al.
Published: (2024)
Independent Feature Enhanced Crossmodal Fusion for Match-Mismatch Classification of Speech Stimulus and EEG Response
by: Fan, Shitong, et al.
Published: (2024)
by: Fan, Shitong, et al.
Published: (2024)
Graph-Enhanced Dual-Stream Feature Fusion with Pre-Trained Model for Acoustic Traffic Monitoring
by: Fan, Shitong, et al.
Published: (2024)
by: Fan, Shitong, et al.
Published: (2024)
Modeling and Link Budget Feasibility Analysis of Secure LoRa-Based Peer-to-Peer Communication for Short-Range Tactical Networks
by: Agrawal, Ayush Kumar, et al.
Published: (2026)
by: Agrawal, Ayush Kumar, et al.
Published: (2026)
ParaS2S: Benchmarking and Aligning Spoken Language Models for Paralinguistic-aware Speech-to-Speech Interaction
by: Yang, Shu-wen, et al.
Published: (2025)
by: Yang, Shu-wen, et al.
Published: (2025)
A Novel Numerical Method for Relaxing the Minimal Configurations of TOA-Based Joint Sensors and Sources Localization
by: Cao, Faxian, et al.
Published: (2024)
by: Cao, Faxian, et al.
Published: (2024)
Auditory Attention Decoding from Ear-EEG Signals: A Dataset with Dynamic Attention Switching and Rigorous Cross-Validation
by: Zhang, Yuanming, et al.
Published: (2025)
by: Zhang, Yuanming, et al.
Published: (2025)
Cross-Talk Reduction
by: Wang, Zhong-Qiu, et al.
Published: (2024)
by: Wang, Zhong-Qiu, et al.
Published: (2024)
A Machine Hearing System for Robust Cough Detection Based on a High-Level Representation of Band-Specific Audio Features
by: Monge-Alvarez, Jesús, et al.
Published: (2024)
by: Monge-Alvarez, Jesús, et al.
Published: (2024)
Noise Suppression for Time Difference of Arrival: Performance Evaluation of a Generalized Cross-Correlation Method Using Mean Signal and Inverse Filter
by: Obo, Hirotaka, et al.
Published: (2025)
by: Obo, Hirotaka, et al.
Published: (2025)
Speech-Based Prioritization for Schizophrenia Intervention
by: Premananth, Gowtham, et al.
Published: (2025)
by: Premananth, Gowtham, et al.
Published: (2025)
CrossSpeech++: Cross-lingual Speech Synthesis with Decoupled Language and Speaker Generation
by: Kim, Ji-Hoon, et al.
Published: (2024)
by: Kim, Ji-Hoon, et al.
Published: (2024)
SELM: Speech Enhancement Using Discrete Tokens and Language Models
by: Wang, Ziqian, et al.
Published: (2023)
by: Wang, Ziqian, et al.
Published: (2023)
Advanced Signal Analysis in Detecting Replay Attacks for Automatic Speaker Verification Systems
by: Kuang, Lee Shih
Published: (2024)
by: Kuang, Lee Shih
Published: (2024)
Significance of Chirp MFCC as a Feature in Speech and Audio Applications
by: Joysingh, S. Johanan, et al.
Published: (2024)
by: Joysingh, S. Johanan, et al.
Published: (2024)
Speech-Declipping Transformer with Complex Spectrogram and Learnerble Temporal Features
by: Kwon, Younghoo, et al.
Published: (2024)
by: Kwon, Younghoo, et al.
Published: (2024)
Reverberation-based Features for Sound Event Localization and Detection with Distance Estimation
by: Berghi, Davide, et al.
Published: (2025)
by: Berghi, Davide, et al.
Published: (2025)
Generative AI in Signal Processing Education: An Audio Foundation Model Based Approach
by: Khan, Muhammad Salman, et al.
Published: (2026)
by: Khan, Muhammad Salman, et al.
Published: (2026)
Incremental Averaging Method to Improve Graph-Based Time-Difference-of-Arrival Estimation
by: Brümann, Klaus, et al.
Published: (2025)
by: Brümann, Klaus, et al.
Published: (2025)
Robust Fixed-Filter Sound Zone Control with Audio-Based Position Tracking
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)
by: Bhattacharjee, Sankha Subhra, et al.
Published: (2024)
Blind Capon Beamformer Based on Independent Component Extraction: Single-Parameter Algorithm,
by: Koldovský, Zbyněk, et al.
Published: (2025)
by: Koldovský, Zbyněk, et al.
Published: (2025)
Multi-Source Position and Direction-of-Arrival Estimation Based on Euclidean Distance Matrices
by: Brümann, Klaus, et al.
Published: (2025)
by: Brümann, Klaus, et al.
Published: (2025)
Optimizing Domain-Adaptive Self-Supervised Learning for Clinical Voice-Based Disease Classification
by: Liu, Weixin, et al.
Published: (2026)
by: Liu, Weixin, et al.
Published: (2026)
Transferable Selective Virtual Sensing Active Noise Control Technique Based on Metric Learning
by: Wang, Boxiang, et al.
Published: (2024)
by: Wang, Boxiang, et al.
Published: (2024)
Speaker and Style Disentanglement of Speech Based on Contrastive Predictive Coding Supported Factorized Variational Autoencoder
by: Xie, Yuying, et al.
Published: (2024)
by: Xie, Yuying, et al.
Published: (2024)
Collection: UAV-Based RSS Measurements from the AFAR Challenge in Digital Twin and Real-World Environments
by: Masrur, Saad, et al.
Published: (2025)
by: Masrur, Saad, et al.
Published: (2025)
Hybrid SMI Realization via Matrix Completion and Riemannian Manifold Optimization on Narrowband Sub-Array Based Architectures
by: Cousik, Tarun Suman, et al.
Published: (2026)
by: Cousik, Tarun Suman, et al.
Published: (2026)
PromptEVC: Controllable Emotional Voice Conversion with Natural Language Prompts
by: Qi, Tianhua, et al.
Published: (2025)
by: Qi, Tianhua, et al.
Published: (2025)
Neural Spectral Band Generation for Audio Coding
by: Choi, Woongjib, et al.
Published: (2025)
by: Choi, Woongjib, et al.
Published: (2025)
Robust Detection of Underwater Target Against Non-Uniform Noise With Optical Fiber DAS Array
by: Cang, Siyuan, et al.
Published: (2025)
by: Cang, Siyuan, et al.
Published: (2025)
Similar Items
-
Exploring rhythm formant analysis for Indic language classification
by: Gogoi, Parismita, et al.
Published: (2024) -
Analyzing long-term rhythm variations in Mising and Assamese using frequency domain correlates
by: Gogoi, Parismita, et al.
Published: (2024) -
Tone recognition in low-resource languages of North-East India: peeling the layers of SSL-based speech models
by: Gogoi, Parismita, et al.
Published: (2025) -
On the Invariance of Cross-Correlation Peak Positions Under Monotonic Signal Transformations, with Application to Fast Time Difference Estimation
by: Ueno, Natsuki, et al.
Published: (2025) -
Gridless Chirp Parameter Retrieval via Constrained Two-Dimensional Atomic Norm Minimization
by: Yang, Dehui, et al.
Published: (2025)