:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Strycharczuk, Patrycja, Lo, Justin J. H., Kirkham, Sam
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Sound
Online Access:	https://arxiv.org/abs/2605.23416
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Articulatory strategy in vowel production as a basis for speaker discrimination
by: Lo, Justin J. H., et al.
Published: (2025)

Towards a dynamical model of English vowels. Evidence from diphthongisation
by: Strycharczuk, Patrycja, et al.
Published: (2024)

Nosey: Open-source hardware for acoustic nasalance
by: Dewhurst, Maya, et al.
Published: (2025)

AURORA Model of Formant-to-Tongue Inversion for Didactic and Clinical Applications
by: Strycharczuk, Patrycja, et al.
Published: (2026)

Dynamical model parameters from ultrasound tongue kinematics
by: Kirkham, Sam, et al.
Published: (2025)

Phonetic accommodation and inhibition in a dynamic neural field model
by: Kirkham, Sam, et al.
Published: (2025)

Comparison of sEMG Encoding Accuracy Across Speech Modes Using Articulatory and Phoneme Features
by: Le, Chenqian, et al.
Published: (2026)

On the Relationship between Accent Strength and Articulatory Features
by: Huang, Kevin, et al.
Published: (2025)

Tracking Articulatory Dynamics in Speech with a Fixed-Weight BiLSTM-CNN Architecture
by: Pillai, Leena G, et al.
Published: (2025)

Articulatory Configurations across Genders and Periods in French Radio and TV archives
by: Elie, Benjamin, et al.
Published: (2024)

Comparison of parameters of vowel sounds of russian and english languages
by: Fedoseev, V. I., et al.
Published: (2024)

Acoustic to Articulatory Inversion of Speech; Data Driven Approaches, Challenges, Applications, and Future Scope
by: Pillai, Leena G, et al.
Published: (2025)

PyPhonPlan: Simulating phonetic planning with dynamic neural fields and task dynamics
by: Kirkham, Sam
Published: (2026)

Scaling laws for nonlinear dynamical models of articulatory control
by: Kirkham, Sam
Published: (2024)

GTR-Voice: Articulatory Phonetics Informed Controllable Expressive Speech Synthesis
by: Li, Zehua Kcriss, et al.
Published: (2024)

Discovering dynamical laws for speech gestures
by: Kirkham, Sam
Published: (2025)

Speaker- and Text-Independent Estimation of Articulatory Movements and Phoneme Alignments from Speech
by: Weise, Tobias, et al.
Published: (2024)

Articulation-Informed ASR: Integrating Articulatory Features into ASR via Auxiliary Speech Inversion and Cross-Attention Fusion
by: Attia, Ahmed Adel, et al.
Published: (2025)

Locality enhanced dynamic biasing and sampling strategies for contextual ASR
by: Jalal, Md Asif, et al.
Published: (2024)

Multilingual acoustic word embeddings for zero-resource languages
by: Jacobs, Christiaan
Published: (2024)

CLiFT-ASR: A Cross-Lingual Fine-Tuning Framework for Low-Resource Taiwanese Hokkien Speech Recognition
by: Sung, Hung-Yang, et al.
Published: (2025)

Beyond Modality Limitations: A Unified MLLM Approach to Automated Speaking Assessment with Effective Curriculum Learning
by: Fang, Yu-Hsuan, et al.
Published: (2025)

Visual Cues Support Robust Turn-taking Prediction in Noise
by: Russell, Sam O'Connor, et al.
Published: (2025)

Voice Conversion for Lombard Speaking Style with Implicit and Explicit Acoustic Feature Conditioning
by: Woszczyk, Dominika, et al.
Published: (2025)

Covertly improving intelligibility with data-driven adaptations of speech timing
by: Tuttösí, Paige, et al.
Published: (2026)

Can large audio language models understand child stuttering speech? speech summarization, and source separation
by: Okocha, Chibuzor, et al.
Published: (2025)

Automated speech audiometry: Can it work using open-source pre-trained Kaldi-NL automatic speech recognition?
by: Araiza-Illan, Gloria, et al.
Published: (2023)

Augment, Drop & Swap: Improving Diversity in LLM Captions for Efficient Music-Text Representation Learning
by: Manco, Ilaria, et al.
Published: (2024)

Robust Audio-Text Retrieval via Cross-Modal Attention and Hybrid Loss
by: Liu, Meizhu, et al.
Published: (2026)

The NTNU System at the S&I Challenge 2025 SLA Open Track
by: Lin, Hong-Yun, et al.
Published: (2025)

A multilingual training strategy for low resource Text to Speech
by: Amalas, Asma, et al.
Published: (2024)

Exploring Dynamic Parameters for Vietnamese Gender-Independent ASR
by: Leang, Sotheara, et al.
Published: (2025)

Training dynamic models using early exits for automatic speech recognition on resource-constrained devices
by: Wright, George August, et al.
Published: (2023)

SongSong: A Time Phonograph for Chinese SongCi Music from Thousand of Years Away
by: Li, Jiajia, et al.
Published: (2026)

DIFFA-2: A Practical Diffusion Large Language Model for General Audio Understanding
by: Zhou, Jiaming, et al.
Published: (2026)

PRiSM: Benchmarking Phone Realization in Speech Models
by: Bharadwaj, Shikhar, et al.
Published: (2026)

What and When to Learn: CURriculum Ranking Loss for Large-Scale Speaker Verification
by: Baali, Massa, et al.
Published: (2026)

Deep Supervised Contrastive Learning of Pitch Contours for Robust Pitch Accent Classification in Seoul Korean
by: Joo, Hyunjung, et al.
Published: (2026)

HeadRouter: Dynamic Head-Weight Routing for Task-Adaptive Audio Token Pruning in Large Audio Language Models
by: He, Peize, et al.
Published: (2026)

AfriVox-v2: A Domain-Verticalized Benchmark for In-the-Wild African Speech Recognition
by: Awobade, Busayo, et al.
Published: (2026)