Saved in:
| Main Author: | Gupta, Pranav |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2507.18827 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation
by: Hasan, Kazi Reyazul, et al.
Published: (2025)
by: Hasan, Kazi Reyazul, et al.
Published: (2025)
BgGPT 1.0: Extending English-centric LLMs to other languages
by: Alexandrov, Anton, et al.
Published: (2024)
by: Alexandrov, Anton, et al.
Published: (2024)
Do "English" Named Entity Recognizers Work Well on Global Englishes?
by: Shan, Alexander, et al.
Published: (2024)
by: Shan, Alexander, et al.
Published: (2024)
A Breadth-First Catalog of Text Processing, Speech Processing and Multimodal Research in South Asian Languages
by: Gupta, Pranav
Published: (2024)
by: Gupta, Pranav
Published: (2024)
Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages
by: Rohera, Pritika, et al.
Published: (2025)
by: Rohera, Pritika, et al.
Published: (2025)
Traditional Readability Formulas Compared for English
by: Lee, Bruce W., et al.
Published: (2023)
by: Lee, Bruce W., et al.
Published: (2023)
Non-native speakers of English or ChatGPT: Who thinks better?
by: Shormani, Mohammed Q.
Published: (2024)
by: Shormani, Mohammed Q.
Published: (2024)
HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned Samples
by: Chigrupaatii, Rishikant, et al.
Published: (2025)
by: Chigrupaatii, Rishikant, et al.
Published: (2025)
Do Multilingual LLMs Think In English?
by: Schut, Lisa, et al.
Published: (2025)
by: Schut, Lisa, et al.
Published: (2025)
TextAge: A Curated and Diverse Text Dataset for Age Classification
by: Cheekati, Shravan, et al.
Published: (2024)
by: Cheekati, Shravan, et al.
Published: (2024)
Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English
by: Nguyen, Duke, et al.
Published: (2025)
by: Nguyen, Duke, et al.
Published: (2025)
CroissantLLM: A Truly Bilingual French-English Language Model
by: Faysse, Manuel, et al.
Published: (2024)
by: Faysse, Manuel, et al.
Published: (2024)
Domain-Aware Speaker Diarization On African-Accented English
by: Okocha, Chibuzor, et al.
Published: (2025)
by: Okocha, Chibuzor, et al.
Published: (2025)
Edeflip: Supervised Word Translation between English and Yoruba
by: Abioye, Ikeoluwa, et al.
Published: (2025)
by: Abioye, Ikeoluwa, et al.
Published: (2025)
Translating Hanja Historical Documents to Contemporary Korean and English
by: Son, Juhee, et al.
Published: (2022)
by: Son, Juhee, et al.
Published: (2022)
On the effective transfer of knowledge from English to Hindi Wikipedia
by: Das, Paramita, et al.
Published: (2024)
by: Das, Paramita, et al.
Published: (2024)
Which English Do LLMs Prefer? Triangulating Structural Bias Towards American English in Foundation Models
by: Nayeem, Mir Tafseer, et al.
Published: (2026)
by: Nayeem, Mir Tafseer, et al.
Published: (2026)
Evaluating Machine Translation Models for English-Hindi Language Pairs: A Comparative Analysis
by: Shetty, Ahan Prasannakumar
Published: (2025)
by: Shetty, Ahan Prasannakumar
Published: (2025)
Many-to-English Machine Translation Tools, Data, and Pretrained Models
by: Gowda, Thamme, et al.
Published: (2021)
by: Gowda, Thamme, et al.
Published: (2021)
The Translation Tax Is Not a Scalar: A Counterfactual Audit of English-Source Cue Inheritance in Chinese Multilingual Benchmarks
by: Lin, Zezheng, et al.
Published: (2026)
by: Lin, Zezheng, et al.
Published: (2026)
A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain
by: Lérida, Jorge del Pozo, et al.
Published: (2025)
by: Lérida, Jorge del Pozo, et al.
Published: (2025)
Assessing the validity of new paradigmatic complexity measures as criterial features for proficiency in L2 writings in English
by: Mallart, Cyriel, et al.
Published: (2025)
by: Mallart, Cyriel, et al.
Published: (2025)
What makes a word hard to learn? Modeling L1 influence on English vocabulary difficulty
by: Martins, Jonas Mayer, et al.
Published: (2026)
by: Martins, Jonas Mayer, et al.
Published: (2026)
Keyword Extraction, and Aspect Classification in Sinhala, English, and Code-Mixed Content
by: Rizvi, F. A., et al.
Published: (2025)
by: Rizvi, F. A., et al.
Published: (2025)
Exploring Large Language Models for Translating Romanian Computational Problems into English
by: Dumitran, Adrian Marius, et al.
Published: (2025)
by: Dumitran, Adrian Marius, et al.
Published: (2025)
On Creating an English-Thai Code-switched Machine Translation in Medical Domain
by: Pengpun, Parinthapat, et al.
Published: (2024)
by: Pengpun, Parinthapat, et al.
Published: (2024)
Enhanced Labeling Technique for Reddit Text and Fine-Tuned Longformer Models for Classifying Depression Severity in English and Luganda
by: Kimera, Richard, et al.
Published: (2024)
by: Kimera, Richard, et al.
Published: (2024)
Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
by: Gupta, Raavi, et al.
Published: (2025)
by: Gupta, Raavi, et al.
Published: (2025)
Building Large-Scale English-Romanian Literary Translation Resources with Open Models
by: Nadas, Mihai, et al.
Published: (2025)
by: Nadas, Mihai, et al.
Published: (2025)
Enhancing Multilingual Sentiment Analysis with Explainability for Sinhala, English, and Code-Mixed Content
by: Rizvi, Azmarah, et al.
Published: (2025)
by: Rizvi, Azmarah, et al.
Published: (2025)
Chapter Bringing down the wall of native-speakerism in English language teaching
by: Llurda, Enric
Published: (2026)
by: Llurda, Enric
Published: (2026)
The Saturation Point of Backtranslation in High Quality Low Resource English Gujarati Machine Translation
by: Arif, Arwa
Published: (2025)
by: Arif, Arwa
Published: (2025)
Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ?
by: Chiaranaipanich, Jirat, et al.
Published: (2024)
by: Chiaranaipanich, Jirat, et al.
Published: (2024)
Automated Multi-Language to English Machine Translation Using Generative Pre-Trained Transformers
by: Pelofske, Elijah, et al.
Published: (2024)
by: Pelofske, Elijah, et al.
Published: (2024)
Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language
by: Sharma, Yash, et al.
Published: (2024)
by: Sharma, Yash, et al.
Published: (2024)
Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation
by: Billah, Syed Mohammed Mostaque, et al.
Published: (2024)
by: Billah, Syed Mohammed Mostaque, et al.
Published: (2024)
Does Continued Pretraining on a Learner Corpus Improve Automated Essay Scoring on English Proficiency Tests? Evidence from EFCAMDAT
by: Nguyen, Duy Anh
Published: (2026)
by: Nguyen, Duy Anh
Published: (2026)
Sociodemographic Bias in Language Models: A Survey and Forward Path
by: Gupta, Vipul, et al.
Published: (2023)
by: Gupta, Vipul, et al.
Published: (2023)
Creating and Evaluating Code-Mixed Nepali-English and Telugu-English Datasets for Abusive Language Detection Using Traditional and Deep Learning Models
by: Pandey, Manish, et al.
Published: (2025)
by: Pandey, Manish, et al.
Published: (2025)
A comparison of oral evaluation ratings by native English speaker teachers and non-native English speaker teachers
by: Brittany Baitman
Published: (2013)
by: Brittany Baitman
Published: (2013)
Similar Items
-
BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation
by: Hasan, Kazi Reyazul, et al.
Published: (2025) -
BgGPT 1.0: Extending English-centric LLMs to other languages
by: Alexandrov, Anton, et al.
Published: (2024) -
Do "English" Named Entity Recognizers Work Well on Global Englishes?
by: Shan, Alexander, et al.
Published: (2024) -
A Breadth-First Catalog of Text Processing, Speech Processing and Multimodal Research in South Asian Languages
by: Gupta, Pranav
Published: (2024) -
Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages
by: Rohera, Pritika, et al.
Published: (2025)