Saved in:
| Main Authors: | Liang, Siyu, Mawkanuli, Talant, Levow, Gina-Anne |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.00923 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages
by: Liang, Siyu, et al.
Published: (2025)
by: Liang, Siyu, et al.
Published: (2025)
Beyond WER: Probing Whisper's Sub-token Decoder Across Diverse Language Resource Levels
by: Liang, Siyu, et al.
Published: (2025)
by: Liang, Siyu, et al.
Published: (2025)
A Sociophonetic Analysis of Racial Bias in Commercial ASR Systems Using the Pacific Northwest English Corpus
by: Scott, Michael, et al.
Published: (2025)
by: Scott, Michael, et al.
Published: (2025)
The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
by: Liang, Siyu, et al.
Published: (2025)
by: Liang, Siyu, et al.
Published: (2025)
Robust Generalization Strategies for Morpheme Glossing in an Endangered Language Documentation Context
by: Ginn, Michael, et al.
Published: (2023)
by: Ginn, Michael, et al.
Published: (2023)
TEII: Think, Explain, Interact and Iterate with Large Language Models to Solve Cross-lingual Emotion Detection
by: Cheng, Long, et al.
Published: (2024)
by: Cheng, Long, et al.
Published: (2024)
Automatic Speech Recognition for Documenting Endangered Languages: Case Study of Ikema Miyakoan
by: Taguchi, Chihiro, et al.
Published: (2026)
by: Taguchi, Chihiro, et al.
Published: (2026)
Wav2Gloss: Generating Interlinear Glossed Text from Speech
by: He, Taiqi, et al.
Published: (2024)
by: He, Taiqi, et al.
Published: (2024)
GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text
by: Ginn, Michael, et al.
Published: (2024)
by: Ginn, Michael, et al.
Published: (2024)
Interdisciplinary Research in Conversation: A Case Study in Computational Morphology for Language Documentation
by: Rice, Enora, et al.
Published: (2025)
by: Rice, Enora, et al.
Published: (2025)
Improving Gloss-free Sign Language Translation by Reducing Representation Density
by: Ye, Jinhui, et al.
Published: (2024)
by: Ye, Jinhui, et al.
Published: (2024)
Gloss2Text: Sign Language Gloss translation using LLMs and Semantically Aware Label Smoothing
by: Fayyazsanavi, Pooya, et al.
Published: (2024)
by: Fayyazsanavi, Pooya, et al.
Published: (2024)
Selective Contrastive Learning For Gloss Free Sign Language Translation
by: Lai, Changhao, et al.
Published: (2026)
by: Lai, Changhao, et al.
Published: (2026)
NoLoR: An ASR-Based Framework for Expedited Endangered Language Documentation with Neo-Aramaic as a Case Study
by: Nazari, Matthew
Published: (2024)
by: Nazari, Matthew
Published: (2024)
Massively Multilingual Joint Segmentation and Glossing
by: Ginn, Michael, et al.
Published: (2026)
by: Ginn, Michael, et al.
Published: (2026)
Leveraging the Power of MLLMs for Gloss-Free Sign Language Translation
by: Kim, Jungeun, et al.
Published: (2024)
by: Kim, Jungeun, et al.
Published: (2024)
Factorized Learning Assisted with Large Language Model for Gloss-free Sign Language Translation
by: Chen, Zhigang, et al.
Published: (2024)
by: Chen, Zhigang, et al.
Published: (2024)
Embedded Translations for Low-resource Automated Glossing
by: Yang, Changbing, et al.
Published: (2024)
by: Yang, Changbing, et al.
Published: (2024)
Gloss-Free Sign Language Translation: An Unbiased Evaluation of Progress in the Field
by: Sincan, Ozge Mercanoglu, et al.
Published: (2026)
by: Sincan, Ozge Mercanoglu, et al.
Published: (2026)
GLOS: Sign Language Generation with Temporally Aligned Gloss-Level Conditioning
by: Lee, Taeryung, et al.
Published: (2025)
by: Lee, Taeryung, et al.
Published: (2025)
Advancing Uto-Aztecan Language Technologies: A Case Study on the Endangered Comanche Language
by: C, Jesus Alvarez, et al.
Published: (2025)
by: C, Jesus Alvarez, et al.
Published: (2025)
Integrating Linguistics and AI: Morphological Analysis and Corpus development of Endangered Toto Language of West Bengal
by: Guha, Ambalika, et al.
Published: (2025)
by: Guha, Ambalika, et al.
Published: (2025)
C${^2}$RL: Content and Context Representation Learning for Gloss-free Sign Language Translation and Retrieval
by: Chen, Zhigang, et al.
Published: (2024)
by: Chen, Zhigang, et al.
Published: (2024)
CWoMP: Morpheme Representation Learning for Interlinear Glossing
by: Alper, Morris, et al.
Published: (2026)
by: Alper, Morris, et al.
Published: (2026)
Multilingual Gloss-free Sign Language Translation: Towards Building a Sign Language Foundation Model
by: Tan, Sihan, et al.
Published: (2025)
by: Tan, Sihan, et al.
Published: (2025)
Keyboards for the Endangered Idu Mishmi Language
by: Ramarao, Akhilesh Kakolu
Published: (2026)
by: Ramarao, Akhilesh Kakolu
Published: (2026)
Text2Sign Diffusion: A Generative Approach for Gloss-Free Sign Language Production
by: Feng, Liqian, et al.
Published: (2025)
by: Feng, Liqian, et al.
Published: (2025)
A Spatio-Temporal Representation Learning as an Alternative to Traditional Glosses in Sign Language Translation and Production
by: Hwang, Eui Jun, et al.
Published: (2024)
by: Hwang, Eui Jun, et al.
Published: (2024)
Is It Navajo? Accurate Language Detection in Endangered Athabaskan Languages
by: Yang, Ivory, et al.
Published: (2025)
by: Yang, Ivory, et al.
Published: (2025)
Introducing A Bangla Sentence - Gloss Pair Dataset for Bangla Sign Language Translation and Research
by: Saha, Neelavro, et al.
Published: (2025)
by: Saha, Neelavro, et al.
Published: (2025)
Can a Neural Model Guide Fieldwork? A Case Study on Morphological Data Collection
by: Mahmudi, Aso, et al.
Published: (2024)
by: Mahmudi, Aso, et al.
Published: (2024)
Continuous Bangla Sign Language Translation: Mitigating the Expense of Gloss Annotation with the Assistance of Graph
by: Arib, Safaeid Hossain, et al.
Published: (2025)
by: Arib, Safaeid Hossain, et al.
Published: (2025)
Neural Morphological Tagging for Nguni Languages
by: Marquard, Cael, et al.
Published: (2025)
by: Marquard, Cael, et al.
Published: (2025)
WARDEN: Endangered Indigenous Language Transcription and Translation with 6 Hours of Training Data
by: Zhang, Ziheng, et al.
Published: (2026)
by: Zhang, Ziheng, et al.
Published: (2026)
Breaking the Silence: A Dataset and Benchmark for Bangla Text-to-Gloss Translation
by: Abdullah, Sharif Mohammad, et al.
Published: (2025)
by: Abdullah, Sharif Mohammad, et al.
Published: (2025)
FEA-SLT: A Gloss-Free End-to-End Framework for Facial-Expression-Aware Sign Language Translation
by: Tu, Guobin, et al.
Published: (2026)
by: Tu, Guobin, et al.
Published: (2026)
In-context Language Learning for Endangered Languages in Speech Recognition
by: Li, Zhaolin, et al.
Published: (2025)
by: Li, Zhaolin, et al.
Published: (2025)
A Case Study on the Impact of Anonymization Along the RAG Pipeline
by: Bodea, Andreea-Elena, et al.
Published: (2026)
by: Bodea, Andreea-Elena, et al.
Published: (2026)
A Hybrid Supervised-LLM Pipeline for Actionable Suggestion Mining in Unstructured Customer Reviews
by: Trivedi, Aakash, et al.
Published: (2026)
by: Trivedi, Aakash, et al.
Published: (2026)
SignDPO: Multi-level Direct Preference Optimisation for Skeleton-based Gloss-free Sign Language Translation
by: Pu, Muxin, et al.
Published: (2026)
by: Pu, Muxin, et al.
Published: (2026)
Similar Items
-
Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages
by: Liang, Siyu, et al.
Published: (2025) -
Beyond WER: Probing Whisper's Sub-token Decoder Across Diverse Language Resource Levels
by: Liang, Siyu, et al.
Published: (2025) -
A Sociophonetic Analysis of Racial Bias in Commercial ASR Systems Using the Pacific Northwest English Corpus
by: Scott, Michael, et al.
Published: (2025) -
The Limits of Data Scaling: Sub-token Utilization and Acoustic Saturation in Multilingual ASR
by: Liang, Siyu, et al.
Published: (2025) -
Robust Generalization Strategies for Morpheme Glossing in an Endangered Language Documentation Context
by: Ginn, Michael, et al.
Published: (2023)