Saved in:
| Main Authors: | Sarkar, Eklavya, -Doss, Mathew Magimai. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.10190 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing
by: Sarkar, Eklavya, et al.
Published: (2025)
by: Sarkar, Eklavya, et al.
Published: (2025)
On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis
by: Sarkar, Eklavya, et al.
Published: (2024)
by: Sarkar, Eklavya, et al.
Published: (2024)
Feature Representations for Automatic Meerkat Vocalization Classification
by: Mahmoud, Imen Ben, et al.
Published: (2024)
by: Mahmoud, Imen Ben, et al.
Published: (2024)
On feature representations for marmoset vocal communication analysis
by: Sarkar, Eklavya, et al.
Published: (2025)
by: Sarkar, Eklavya, et al.
Published: (2025)
Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech
by: Hajal, Karl El, et al.
Published: (2025)
by: Hajal, Karl El, et al.
Published: (2025)
Unsupervised Rhythm and Voice Conversion of Dysarthric to Healthy Speech for ASR
by: Hajal, Karl El, et al.
Published: (2025)
by: Hajal, Karl El, et al.
Published: (2025)
kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech
by: Hajal, Karl El, et al.
Published: (2024)
by: Hajal, Karl El, et al.
Published: (2024)
Supplementary Information for: "Speech power spectra: a window into neural oscillations in Parkinson's disease"
by: Hovsepyan, Sevada, et al.
Published: (2025)
by: Hovsepyan, Sevada, et al.
Published: (2025)
Multilingual Hidden Prompt Injection Attacks on LLM-Based Academic Reviewing
by: Theocharopoulos, Panagiotis, et al.
Published: (2025)
by: Theocharopoulos, Panagiotis, et al.
Published: (2025)
Predicting Heart Activity from Speech using Data-driven and Knowledge-based features
by: Elbanna, Gasser, et al.
Published: (2024)
by: Elbanna, Gasser, et al.
Published: (2024)
AVEX: What Matters for Animal Vocalization Encoding
by: Miron, Marius, et al.
Published: (2025)
by: Miron, Marius, et al.
Published: (2025)
Towards interfacing large language models with ASR systems using confidence measures and prompting
by: Naderi, Maryam, et al.
Published: (2024)
by: Naderi, Maryam, et al.
Published: (2024)
Unveiling Audio Deepfake Origins: A Deep Metric learning And Conformer Network Approach With Ensemble Fusion
by: Kulkarni, Ajinkya, et al.
Published: (2025)
by: Kulkarni, Ajinkya, et al.
Published: (2025)
Assessment of Personality Dimensions Across Situations Using Conversational Speech
by: Zhang, Alice, et al.
Published: (2025)
by: Zhang, Alice, et al.
Published: (2025)
Toward using Speech to Sense Student Emotion in Remote Learning Environments
by: Vyas, Sargam, et al.
Published: (2026)
by: Vyas, Sargam, et al.
Published: (2026)
BioSEN: A Bio-acoustic Signal Enhancement Network for Animal Vocalizations
by: Song, Tianyu, et al.
Published: (2026)
by: Song, Tianyu, et al.
Published: (2026)
Do Compact SSL Backbones Matter for Audio Deepfake Detection? A Controlled Study with RAPTOR
by: Kulkarni, Ajinkya, et al.
Published: (2026)
by: Kulkarni, Ajinkya, et al.
Published: (2026)
Towards the Synthesis of Non-speech Vocalizations
by: Hoq, Enjamamul, et al.
Published: (2024)
by: Hoq, Enjamamul, et al.
Published: (2024)
Vocal Tract Length Warped Features for Spoken Keyword Spotting
by: Sarkar, Achintya kr., et al.
Published: (2025)
by: Sarkar, Achintya kr., et al.
Published: (2025)
WhAM: Towards A Translative Model of Sperm Whale Vocalization
by: Paradise, Orr, et al.
Published: (2025)
by: Paradise, Orr, et al.
Published: (2025)
Beyond the Baseband: Adaptive Multi-Band Encoding for Full-Spectrum Bioacoustics Classification
by: Sarkar, Eklavya, et al.
Published: (2026)
by: Sarkar, Eklavya, et al.
Published: (2026)
Children's Voice Privacy: First Steps And Emerging Challenges
by: Kulkarni, Ajinkya, et al.
Published: (2025)
by: Kulkarni, Ajinkya, et al.
Published: (2025)
Estimation of heterogeneous principal effects under principal ignorability
by: Zhang, Rui, et al.
Published: (2026)
by: Zhang, Rui, et al.
Published: (2026)
A Novel Hierarchical Integration Method for Efficient Model Merging in Medical LLMs
by: Timilsina, Prakrit, et al.
Published: (2025)
by: Timilsina, Prakrit, et al.
Published: (2025)
Leveraging Negative Signals with Self-Attention for Sequential Music Recommendation
by: Seshadri, Pavan, et al.
Published: (2023)
by: Seshadri, Pavan, et al.
Published: (2023)
Leveraging Nested MLMC for Sequential Neural Posterior Estimation with Intractable Likelihoods
by: Yang, Xiliang, et al.
Published: (2024)
by: Yang, Xiliang, et al.
Published: (2024)
Sequential learning based PINNs to overcome temporal domain complexities in unsteady flow past flapping wings
by: Sundar, Rahul, et al.
Published: (2025)
by: Sundar, Rahul, et al.
Published: (2025)
A Dataset for Automatic Vocal Mode Classification
by: Hinrichs, Reemt, et al.
Published: (2026)
by: Hinrichs, Reemt, et al.
Published: (2026)
Temporal Sparse Autoencoders: Leveraging the Sequential Nature of Language for Interpretability
by: Bhalla, Usha, et al.
Published: (2025)
by: Bhalla, Usha, et al.
Published: (2025)
Structured Learning of Compositional Sequential Interventions
by: Yu, Jialin, et al.
Published: (2024)
by: Yu, Jialin, et al.
Published: (2024)
Score Shocks: The Burgers Equation Structure of Diffusion Generative Models
by: Sarkar, Krisanu
Published: (2026)
by: Sarkar, Krisanu
Published: (2026)
Balancing Classification and Calibration Performance in Decision-Making LLMs via Calibration Aware Reinforcement Learning
by: Yaldiz, Duygu Nur, et al.
Published: (2026)
by: Yaldiz, Duygu Nur, et al.
Published: (2026)
Downscaling Extreme Precipitation with Wasserstein Regularized Diffusion
by: Liu, Yuhao, et al.
Published: (2024)
by: Liu, Yuhao, et al.
Published: (2024)
Incremental Structure Discovery of Classification via Sequential Monte Carlo
by: Huang, Changze, et al.
Published: (2024)
by: Huang, Changze, et al.
Published: (2024)
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making
by: Wang, Zhiyong
Published: (2025)
by: Wang, Zhiyong
Published: (2025)
Towards Futuristic Autonomous Experimentation--A Surprise-Reacting Sequential Experiment Policy
by: Ahmed, Imtiaz, et al.
Published: (2021)
by: Ahmed, Imtiaz, et al.
Published: (2021)
Optimal Sequential Recommendations: Exploiting User and Item Structure
by: Karzand, Mina, et al.
Published: (2025)
by: Karzand, Mina, et al.
Published: (2025)
Leveraging Data to Say No: Memory Augmented Plug-and-Play Selective Prediction
by: Sarkar, Aditya, et al.
Published: (2026)
by: Sarkar, Aditya, et al.
Published: (2026)
SALSA: Sequential Approximate Leverage-Score Algorithm with Application in Analyzing Big Time Series Data
by: Eshragh, Ali, et al.
Published: (2023)
by: Eshragh, Ali, et al.
Published: (2023)
Structure Is Not Enough: Leveraging Behavior for Neural Network Weight Reconstruction
by: Meynent, Léo, et al.
Published: (2025)
by: Meynent, Léo, et al.
Published: (2025)
Similar Items
-
Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing
by: Sarkar, Eklavya, et al.
Published: (2025) -
On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis
by: Sarkar, Eklavya, et al.
Published: (2024) -
Feature Representations for Automatic Meerkat Vocalization Classification
by: Mahmoud, Imen Ben, et al.
Published: (2024) -
On feature representations for marmoset vocal communication analysis
by: Sarkar, Eklavya, et al.
Published: (2025) -
Unsupervised Rhythm and Voice Conversion to Improve ASR on Dysarthric Speech
by: Hajal, Karl El, et al.
Published: (2025)