Saved in:
| Main Authors: | Baiju, Bajiyo, Manohar, Kavya, Pillai, Leena G, Sherly, Elizabeth |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.09957 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations
by: Manohar, Kavya, et al.
Published: (2024)
by: Manohar, Kavya, et al.
Published: (2024)
Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages
by: Pillai, Leena G, et al.
Published: (2024)
by: Pillai, Leena G, et al.
Published: (2024)
Tracking Articulatory Dynamics in Speech with a Fixed-Weight BiLSTM-CNN Architecture
by: Pillai, Leena G, et al.
Published: (2025)
by: Pillai, Leena G, et al.
Published: (2025)
One Script Instead of Hundreds? On Pretraining Romanized Encoder Language Models
by: Ebing, Benedikt, et al.
Published: (2026)
by: Ebing, Benedikt, et al.
Published: (2026)
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
by: Butt, Umer, et al.
Published: (2025)
by: Butt, Umer, et al.
Published: (2025)
AyutthayaAlpha: A Thai-Latin Script Transliteration Transformer
by: Lauc, Davor, et al.
Published: (2024)
by: Lauc, Davor, et al.
Published: (2024)
A Tale of Two Scripts: Transliteration and Post-Correction for Judeo-Arabic
by: Gonzalez, Juan Moreno, et al.
Published: (2025)
by: Gonzalez, Juan Moreno, et al.
Published: (2025)
Scripts Through Time: A Survey of the Evolving Role of Transliteration in NLP
by: Jayakumar, Thanmay, et al.
Published: (2026)
by: Jayakumar, Thanmay, et al.
Published: (2026)
Linear Script Representations in Speech Foundation Models Enable Zero-Shot Transliteration
by: Shim, Ryan Soh-Eun, et al.
Published: (2026)
by: Shim, Ryan Soh-Eun, et al.
Published: (2026)
Script Gap: Evaluating LLM Triage on Indian Languages in Native vs Romanized Scripts in a Real World Setting
by: Khullar, Manurag, et al.
Published: (2025)
by: Khullar, Manurag, et al.
Published: (2025)
Swa-bhasha Resource Hub: Romanized Sinhala to Sinhala Transliteration Systems and Data Resources
by: Sumanathilaka, Deshan, et al.
Published: (2025)
by: Sumanathilaka, Deshan, et al.
Published: (2025)
Breaking the Script Barrier in Multilingual Pre-Trained Language Models with Transliteration-Based Post-Training Alignment
by: Xhelili, Orgest, et al.
Published: (2024)
by: Xhelili, Orgest, et al.
Published: (2024)
Exploring the Role of Transliteration in In-Context Learning for Low-resource Languages Written in Non-Latin Scripts
by: Ma, Chunlan, et al.
Published: (2024)
by: Ma, Chunlan, et al.
Published: (2024)
IndoNLP 2025: Shared Task on Real-Time Reverse Transliteration for Romanized Indo-Aryan languages
by: Sumanathilaka, Deshan, et al.
Published: (2025)
by: Sumanathilaka, Deshan, et al.
Published: (2025)
Acoustic to Articulatory Inversion of Speech; Data Driven Approaches, Challenges, Applications, and Future Scope
by: Pillai, Leena G, et al.
Published: (2025)
by: Pillai, Leena G, et al.
Published: (2025)
Script Sensitivity: Benchmarking Language Models on Unicode, Romanized and Mixed-Script Sinhala
by: Rajapakse, Minuri, et al.
Published: (2026)
by: Rajapakse, Minuri, et al.
Published: (2026)
Evaluating Cultural Awareness of LLMs for Yoruba, Malayalam, and English
by: Dawson, Fiifi, et al.
Published: (2024)
by: Dawson, Fiifi, et al.
Published: (2024)
Scalable Offline ASR for Command-Style Dictation in Courtrooms
by: Nethil, Kumarmanas, et al.
Published: (2025)
by: Nethil, Kumarmanas, et al.
Published: (2025)
Encoder-Decoder or Decoder-Only? Revisiting Encoder-Decoder Large Language Model
by: Zhang, Biao, et al.
Published: (2025)
by: Zhang, Biao, et al.
Published: (2025)
ILID: Native Script Language Identification for Indian Languages
by: Ingle, Yash, et al.
Published: (2025)
by: Ingle, Yash, et al.
Published: (2025)
Neural Machine Translation for Malayalam Paraphrase Generation
by: Varghese, Christeena, et al.
Published: (2024)
by: Varghese, Christeena, et al.
Published: (2024)
How Transliterations Improve Crosslingual Alignment
by: Liu, Yihong, et al.
Published: (2024)
by: Liu, Yihong, et al.
Published: (2024)
Vividh-ASR: A Complexity-Tiered Benchmark and Optimization Dynamics for Robust Indic Speech Recognition
by: Juvekar, Kush, et al.
Published: (2026)
by: Juvekar, Kush, et al.
Published: (2026)
Malayalam Sign Language Identification using Finetuned YOLOv8 and Computer Vision Techniques
by: K., Abhinand, et al.
Published: (2024)
by: K., Abhinand, et al.
Published: (2024)
Connecting the Persian-speaking World through Transliteration
by: Merchant, Rayyan, et al.
Published: (2025)
by: Merchant, Rayyan, et al.
Published: (2025)
Jailbreaking LLMs with Arabic Transliteration and Arabizi
by: Ghanim, Mansour Al, et al.
Published: (2024)
by: Ghanim, Mansour Al, et al.
Published: (2024)
Language Detection for Transliterated Content
by: S, Selva Kumar, et al.
Published: (2024)
by: S, Selva Kumar, et al.
Published: (2024)
Efficient Encoder-Decoder Transformer Decoding for Decomposable Tasks
by: Lu, Bo-Ru, et al.
Published: (2024)
by: Lu, Bo-Ru, et al.
Published: (2024)
DecoderLens: Layerwise Interpretation of Encoder-Decoder Transformers
by: Langedijk, Anna, et al.
Published: (2023)
by: Langedijk, Anna, et al.
Published: (2023)
Swa Bhasha: Message-Based Singlish to Sinhala Transliteration
by: Athukorala, Maneesha U., et al.
Published: (2024)
by: Athukorala, Maneesha U., et al.
Published: (2024)
SumTablets: A Transliteration Dataset of Sumerian Tablets
by: Simmons, Cole, et al.
Published: (2026)
by: Simmons, Cole, et al.
Published: (2026)
ParsTranslit: Truly Versatile Tajik-Farsi Transliteration
by: Merchant, Rayyan, et al.
Published: (2025)
by: Merchant, Rayyan, et al.
Published: (2025)
Decoders Laugh as Loud as Encoders
by: Borodach, Eli, et al.
Published: (2025)
by: Borodach, Eli, et al.
Published: (2025)
EDDA: A Encoder-Decoder Data Augmentation Framework for Zero-Shot Stance Detection
by: Ding, Daijun, et al.
Published: (2024)
by: Ding, Daijun, et al.
Published: (2024)
Extract-and-Abstract: Unifying Extractive and Abstractive Summarization within Single Encoder-Decoder Framework
by: Wu, Yuping, et al.
Published: (2024)
by: Wu, Yuping, et al.
Published: (2024)
Beyond Specialization: Benchmarking LLMs for Transliteration of Indian Languages
by: Azam, Gulfarogh, et al.
Published: (2025)
by: Azam, Gulfarogh, et al.
Published: (2025)
TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data
by: Liu, Yihong, et al.
Published: (2024)
by: Liu, Yihong, et al.
Published: (2024)
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)
Historic Scripts to Modern Vision: A Novel Dataset and A VLM Framework for Transliteration of Modi Script to Devanagari
by: Kausadikar, Harshal, et al.
Published: (2025)
by: Kausadikar, Harshal, et al.
Published: (2025)
Happiness is Sharing a Vocabulary: A Study of Transliteration Methods
by: Jung, Haeji, et al.
Published: (2025)
by: Jung, Haeji, et al.
Published: (2025)
Similar Items
-
What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations
by: Manohar, Kavya, et al.
Published: (2024) -
Multistage Fine-tuning Strategies for Automatic Speech Recognition in Low-resource Languages
by: Pillai, Leena G, et al.
Published: (2024) -
Tracking Articulatory Dynamics in Speech with a Fixed-Weight BiLSTM-CNN Architecture
by: Pillai, Leena G, et al.
Published: (2025) -
One Script Instead of Hundreds? On Pretraining Romanized Encoder Language Models
by: Ebing, Benedikt, et al.
Published: (2026) -
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
by: Butt, Umer, et al.
Published: (2025)