Saved in:
| Main Authors: | Antall, Abdul Rehman, Akhtar, Naveed |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.09865 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
UrduLM: A Resource-Efficient Monolingual Urdu Language Model
by: Ali, Syed Muhammad, et al.
Published: (2026)
by: Ali, Syed Muhammad, et al.
Published: (2026)
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
by: Butt, Umer, et al.
Published: (2025)
by: Butt, Umer, et al.
Published: (2025)
Modeling Authorial Style in Urdu Novels Using Character Interaction Graphs and Graph Neural Networks
by: Mujtaba, Hassan, et al.
Published: (2025)
by: Mujtaba, Hassan, et al.
Published: (2025)
Generalists vs. Specialists: Evaluating Large Language Models for Urdu
by: Arif, Samee, et al.
Published: (2024)
by: Arif, Samee, et al.
Published: (2024)
Fake News Classification in Urdu: A Domain Adaptation Approach for a Low-Resource Language
by: Ali, Muhammad Zain, et al.
Published: (2025)
by: Ali, Muhammad Zain, et al.
Published: (2025)
Enhanced Urdu Intent Detection with Large Language Models and Prototype-Informed Predictive Pipelines
by: Hassan, Faiza, et al.
Published: (2025)
by: Hassan, Faiza, et al.
Published: (2025)
MALT: Mechanistic Ablation of Lossy Translation in LLMs for a Low-Resource Language: Urdu
by: Bajwa, Taaha Saleem
Published: (2025)
by: Bajwa, Taaha Saleem
Published: (2025)
Whispering Context: Distilling Syntax and Semantics for Long Speech Transcripts
by: Altinok, Duygu
Published: (2025)
by: Altinok, Duygu
Published: (2025)
AI-Generated Text Detection in Low-Resource Languages: A Case Study on Urdu
by: Ammar, Muhammad, et al.
Published: (2025)
by: Ammar, Muhammad, et al.
Published: (2025)
uDistil-Whisper: Label-Free Data Filtering for Knowledge Distillation in Low-Data Regimes
by: Waheed, Abdul, et al.
Published: (2024)
by: Waheed, Abdul, et al.
Published: (2024)
LEGAL-UQA: A Low-Resource Urdu-English Dataset for Legal Question Answering
by: Faisal, Faizan, et al.
Published: (2024)
by: Faisal, Faizan, et al.
Published: (2024)
Whispering in Amharic: Fine-tuning Whisper for Low-resource Language
by: Gete, Dawit Ketema, et al.
Published: (2025)
by: Gete, Dawit Ketema, et al.
Published: (2025)
Fine-tuning Whisper on Low-Resource Languages for Real-World Applications
by: Timmel, Vincenzo, et al.
Published: (2024)
by: Timmel, Vincenzo, et al.
Published: (2024)
Urdu News Article Recommendation Model using Natural Language Processing Techniques
by: Abbas, Syed Zain, et al.
Published: (2022)
by: Abbas, Syed Zain, et al.
Published: (2022)
Enabling Low-Resource Language Retrieval: Establishing Baselines for Urdu MS MARCO
by: Butt, Umer, et al.
Published: (2024)
by: Butt, Umer, et al.
Published: (2024)
UrduLLaMA 1.0: Dataset Curation, Preprocessing, and Evaluation in Low-Resource Settings
by: Fiaz, Layba, et al.
Published: (2025)
by: Fiaz, Layba, et al.
Published: (2025)
Ax-to-Grind Urdu: Benchmark Dataset for Urdu Fake News Detection
by: Harris, Sheetal, et al.
Published: (2024)
by: Harris, Sheetal, et al.
Published: (2024)
A Paradigm Gap in Urdu
by: Adeeba, Farah, et al.
Published: (2025)
by: Adeeba, Farah, et al.
Published: (2025)
Transcript-Prompted Whisper with Dictionary-Enhanced Decoding for Japanese Speech Annotation
by: Hu, Rui, et al.
Published: (2025)
by: Hu, Rui, et al.
Published: (2025)
Analyzing and Fine-Tuning Whisper Models for Multilingual Pilot Speech Transcription in the Cockpit
by: Nareddy, Kartheek Kumar Reddy, et al.
Published: (2025)
by: Nareddy, Kartheek Kumar Reddy, et al.
Published: (2025)
Evaluating Large Language Models on Urdu Idiom Translation
by: Khan, Muhammad Farmal, et al.
Published: (2025)
by: Khan, Muhammad Farmal, et al.
Published: (2025)
A Comprehensive Overview of Large Language Models
by: Naveed, Humza, et al.
Published: (2023)
by: Naveed, Humza, et al.
Published: (2023)
Left Behind: Cross-Lingual Transfer as a Bridge for Low-Resource Languages in Large Language Models
by: Beibitkhan, Abdul-Salem
Published: (2026)
by: Beibitkhan, Abdul-Salem
Published: (2026)
UrduFactCheck: An Agentic Fact-Checking Framework for Urdu with Evidence Boosting and Benchmarking
by: Ahmad, Sarfraz, et al.
Published: (2025)
by: Ahmad, Sarfraz, et al.
Published: (2025)
Pashto Common Voice: Building the First Open Speech Corpus for a 60-Million-Speaker Low-Resource Language
by: Rahman, Hanif, et al.
Published: (2026)
by: Rahman, Hanif, et al.
Published: (2026)
LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
by: Zhuo, Le, et al.
Published: (2023)
by: Zhuo, Le, et al.
Published: (2023)
ERUPD -- English to Roman Urdu Parallel Dataset
by: Furqan, Mohammed, et al.
Published: (2024)
by: Furqan, Mohammed, et al.
Published: (2024)
Whisper-LM: Improving ASR Models with Language Models for Low-Resource Languages
by: de Zuazo, Xabier, et al.
Published: (2025)
by: de Zuazo, Xabier, et al.
Published: (2025)
Large Language Models for Computer-Aided Design: A Survey
by: Zhang, Licheng, et al.
Published: (2025)
by: Zhang, Licheng, et al.
Published: (2025)
Enhancing Aviation Communication Transcription: Fine-Tuning Distil-Whisper with LoRA
by: Mirzaei, Shokoufeh, et al.
Published: (2025)
by: Mirzaei, Shokoufeh, et al.
Published: (2025)
WER We Stand: Benchmarking Urdu ASR Models
by: Arif, Samee, et al.
Published: (2024)
by: Arif, Samee, et al.
Published: (2024)
Lost in Transcription, Found in Distribution Shift: Demystifying Hallucination in Speech Foundation Models
by: Atwany, Hanin, et al.
Published: (2025)
by: Atwany, Hanin, et al.
Published: (2025)
Breaking the Transcription Bottleneck: Fine-tuning ASR Models for Extremely Low-Resource Fieldwork Languages
by: Liang, Siyu, et al.
Published: (2025)
by: Liang, Siyu, et al.
Published: (2025)
A Comparative Study of LLM-based ASR and Whisper in Low Resource and Code Switching Scenario
by: Song, Zheshu, et al.
Published: (2024)
by: Song, Zheshu, et al.
Published: (2024)
From Statistical Methods to Pre-Trained Models; A Survey on Automatic Speech Recognition for Resource Scarce Urdu Language
by: Sharif, Muhammad, et al.
Published: (2024)
by: Sharif, Muhammad, et al.
Published: (2024)
COCO-Urdu: A Large-Scale Urdu Image-Caption Dataset with Multimodal Quality Estimation
by: Hassan, Umair
Published: (2025)
by: Hassan, Umair
Published: (2025)
Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla
by: Ridoy, Md Sazzadul Islam, et al.
Published: (2025)
by: Ridoy, Md Sazzadul Islam, et al.
Published: (2025)
AC-Lite : A Lightweight Image Captioning Model for Low-Resource Assamese Language
by: Choudhury, Pankaj, et al.
Published: (2025)
by: Choudhury, Pankaj, et al.
Published: (2025)
Whisper Turns Stronger: Augmenting Wav2Vec 2.0 for Superior ASR in Low-Resource Languages
by: Anidjar, Or Haim, et al.
Published: (2024)
by: Anidjar, Or Haim, et al.
Published: (2024)
UrduBench: An Urdu Reasoning Benchmark using Contextually Ensembled Translations with Human-in-the-Loop
by: Shafique, Muhammad Ali, et al.
Published: (2026)
by: Shafique, Muhammad Ali, et al.
Published: (2026)
Similar Items
-
UrduLM: A Resource-Efficient Monolingual Urdu Language Model
by: Ali, Syed Muhammad, et al.
Published: (2026) -
Low-Resource Transliteration for Roman-Urdu and Urdu Using Transformer-Based Models
by: Butt, Umer, et al.
Published: (2025) -
Modeling Authorial Style in Urdu Novels Using Character Interaction Graphs and Graph Neural Networks
by: Mujtaba, Hassan, et al.
Published: (2025) -
Generalists vs. Specialists: Evaluating Large Language Models for Urdu
by: Arif, Samee, et al.
Published: (2024) -
Fake News Classification in Urdu: A Domain Adaptation Approach for a Low-Resource Language
by: Ali, Muhammad Zain, et al.
Published: (2025)