Saved in:
| Main Authors: | Strycharczuk, Patrycja, Lo, Justin J. H., Kirkham, Sam |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.23416 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Articulatory strategy in vowel production as a basis for speaker discrimination
by: Lo, Justin J. H., et al.
Published: (2025)
by: Lo, Justin J. H., et al.
Published: (2025)
Towards a dynamical model of English vowels. Evidence from diphthongisation
by: Strycharczuk, Patrycja, et al.
Published: (2024)
by: Strycharczuk, Patrycja, et al.
Published: (2024)
Nosey: Open-source hardware for acoustic nasalance
by: Dewhurst, Maya, et al.
Published: (2025)
by: Dewhurst, Maya, et al.
Published: (2025)
AURORA Model of Formant-to-Tongue Inversion for Didactic and Clinical Applications
by: Strycharczuk, Patrycja, et al.
Published: (2026)
by: Strycharczuk, Patrycja, et al.
Published: (2026)
Dynamical model parameters from ultrasound tongue kinematics
by: Kirkham, Sam, et al.
Published: (2025)
by: Kirkham, Sam, et al.
Published: (2025)
Phonetic accommodation and inhibition in a dynamic neural field model
by: Kirkham, Sam, et al.
Published: (2025)
by: Kirkham, Sam, et al.
Published: (2025)
Comparison of sEMG Encoding Accuracy Across Speech Modes Using Articulatory and Phoneme Features
by: Le, Chenqian, et al.
Published: (2026)
by: Le, Chenqian, et al.
Published: (2026)
On the Relationship between Accent Strength and Articulatory Features
by: Huang, Kevin, et al.
Published: (2025)
by: Huang, Kevin, et al.
Published: (2025)
Tracking Articulatory Dynamics in Speech with a Fixed-Weight BiLSTM-CNN Architecture
by: Pillai, Leena G, et al.
Published: (2025)
by: Pillai, Leena G, et al.
Published: (2025)
Articulatory Configurations across Genders and Periods in French Radio and TV archives
by: Elie, Benjamin, et al.
Published: (2024)
by: Elie, Benjamin, et al.
Published: (2024)
Comparison of parameters of vowel sounds of russian and english languages
by: Fedoseev, V. I., et al.
Published: (2024)
by: Fedoseev, V. I., et al.
Published: (2024)
Acoustic to Articulatory Inversion of Speech; Data Driven Approaches, Challenges, Applications, and Future Scope
by: Pillai, Leena G, et al.
Published: (2025)
by: Pillai, Leena G, et al.
Published: (2025)
PyPhonPlan: Simulating phonetic planning with dynamic neural fields and task dynamics
by: Kirkham, Sam
Published: (2026)
by: Kirkham, Sam
Published: (2026)
Scaling laws for nonlinear dynamical models of articulatory control
by: Kirkham, Sam
Published: (2024)
by: Kirkham, Sam
Published: (2024)
GTR-Voice: Articulatory Phonetics Informed Controllable Expressive Speech Synthesis
by: Li, Zehua Kcriss, et al.
Published: (2024)
by: Li, Zehua Kcriss, et al.
Published: (2024)
Discovering dynamical laws for speech gestures
by: Kirkham, Sam
Published: (2025)
by: Kirkham, Sam
Published: (2025)
Speaker- and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech
by: Weise, Tobias, et al.
Published: (2024)
by: Weise, Tobias, et al.
Published: (2024)
Articulation-Informed ASR: Integrating Articulatory Features into ASR via Auxiliary Speech Inversion and Cross-Attention Fusion
by: Attia, Ahmed Adel, et al.
Published: (2025)
by: Attia, Ahmed Adel, et al.
Published: (2025)
Locality enhanced dynamic biasing and sampling strategies for contextual ASR
by: Jalal, Md Asif, et al.
Published: (2024)
by: Jalal, Md Asif, et al.
Published: (2024)
Multilingual acoustic word embeddings for zero-resource languages
by: Jacobs, Christiaan
Published: (2024)
by: Jacobs, Christiaan
Published: (2024)
CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition
by: Sung, Hung-Yang, et al.
Published: (2025)
by: Sung, Hung-Yang, et al.
Published: (2025)
Beyond Modality Limitations: A Unified MLLM Approach to Automated Speaking Assessment with Effective Curriculum Learning
by: Fang, Yu-Hsuan, et al.
Published: (2025)
by: Fang, Yu-Hsuan, et al.
Published: (2025)
Visual Cues Support Robust Turn-taking Prediction in Noise
by: Russell, Sam O'Connor, et al.
Published: (2025)
by: Russell, Sam O'Connor, et al.
Published: (2025)
Voice Conversion for Lombard Speaking Style with Implicit and Explicit Acoustic Feature Conditioning
by: Woszczyk, Dominika, et al.
Published: (2025)
by: Woszczyk, Dominika, et al.
Published: (2025)
Covertly improving intelligibility with data-driven adaptations of speech timing
by: Tuttösí, Paige, et al.
Published: (2026)
by: Tuttösí, Paige, et al.
Published: (2026)
Can large audio language models understand child stuttering speech? speech summarization, and source separation
by: Okocha, Chibuzor, et al.
Published: (2025)
by: Okocha, Chibuzor, et al.
Published: (2025)
Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition?
by: Araiza-Illan, Gloria, et al.
Published: (2023)
by: Araiza-Illan, Gloria, et al.
Published: (2023)
Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning
by: Manco, Ilaria, et al.
Published: (2024)
by: Manco, Ilaria, et al.
Published: (2024)
Robust Audio-Text Retrieval via Cross-Modal Attention and Hybrid Loss
by: Liu, Meizhu, et al.
Published: (2026)
by: Liu, Meizhu, et al.
Published: (2026)
The NTNU System at the S&I Challenge 2025 SLA Open Track
by: Lin, Hong-Yun, et al.
Published: (2025)
by: Lin, Hong-Yun, et al.
Published: (2025)
A multilingual training strategy for low resource Text to Speech
by: Amalas, Asma, et al.
Published: (2024)
by: Amalas, Asma, et al.
Published: (2024)
Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR
by: Leang, Sotheara, et al.
Published: (2025)
by: Leang, Sotheara, et al.
Published: (2025)
Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
by: Wright, George August, et al.
Published: (2023)
by: Wright, George August, et al.
Published: (2023)
SongSong: A Time Phonograph for Chinese SongCi Music from Thousand of Years Away
by: Li, Jiajia, et al.
Published: (2026)
by: Li, Jiajia, et al.
Published: (2026)
DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding
by: Zhou, Jiaming, et al.
Published: (2026)
by: Zhou, Jiaming, et al.
Published: (2026)
PRiSM: Benchmarking Phone Realization in Speech Models
by: Bharadwaj, Shikhar, et al.
Published: (2026)
by: Bharadwaj, Shikhar, et al.
Published: (2026)
What and When to Learn: CURriculum Ranking Loss for Large-Scale Speaker Verification
by: Baali, Massa, et al.
Published: (2026)
by: Baali, Massa, et al.
Published: (2026)
Deep Supervised Contrastive Learning of Pitch Contours for Robust Pitch Accent Classification in Seoul Korean
by: Joo, Hyunjung, et al.
Published: (2026)
by: Joo, Hyunjung, et al.
Published: (2026)
HeadRouter: Dynamic Head-Weight Routing for Task-Adaptive Audio Token Pruning in Large Audio Language Models
by: He, Peize, et al.
Published: (2026)
by: He, Peize, et al.
Published: (2026)
AfriVox-v2: A Domain-Verticalized Benchmark for In-the-Wild African Speech Recognition
by: Awobade, Busayo, et al.
Published: (2026)
by: Awobade, Busayo, et al.
Published: (2026)
Similar Items
-
Articulatory strategy in vowel production as a basis for speaker discrimination
by: Lo, Justin J. H., et al.
Published: (2025) -
Towards a dynamical model of English vowels. Evidence from diphthongisation
by: Strycharczuk, Patrycja, et al.
Published: (2024) -
Nosey: Open-source hardware for acoustic nasalance
by: Dewhurst, Maya, et al.
Published: (2025) -
AURORA Model of Formant-to-Tongue Inversion for Didactic and Clinical Applications
by: Strycharczuk, Patrycja, et al.
Published: (2026) -
Dynamical model parameters from ultrasound tongue kinematics
by: Kirkham, Sam, et al.
Published: (2025)