:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Gupta, Pranav
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2507.18827
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

BanglaSTEM: A Parallel Corpus for Technical Domain Bangla-English Translation
by: Hasan, Kazi Reyazul, et al.
Published: (2025)

BgGPT 1.0: Extending English-centric LLMs to other languages
by: Alexandrov, Anton, et al.
Published: (2024)

Do "English" Named Entity Recognizers Work Well on Global Englishes?
by: Shan, Alexander, et al.
Published: (2024)

A Breadth-First Catalog of Text Processing, Speech Processing and Multimodal Research in South Asian Languages
by: Gupta, Pranav
Published: (2024)

Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages
by: Rohera, Pritika, et al.
Published: (2025)

Traditional Readability Formulas Compared for English
by: Lee, Bruce W., et al.
Published: (2023)

Non-native speakers of English or ChatGPT: Who thinks better?
by: Shormani, Mohammed Q.
Published: (2024)

HinTel-AlignBench: A Framework and Benchmark for Hindi-Telugu with English-Aligned Samples
by: Chigrupaatii, Rishikant, et al.
Published: (2025)

Do Multilingual LLMs Think In English?
by: Schut, Lisa, et al.
Published: (2025)

TextAge: A Curated and Diverse Text Dataset for Age Classification
by: Cheekati, Shravan, et al.
Published: (2024)

Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English
by: Nguyen, Duke, et al.
Published: (2025)

CroissantLLM: A Truly Bilingual French-English Language Model
by: Faysse, Manuel, et al.
Published: (2024)

Domain-Aware Speaker Diarization On African-Accented English
by: Okocha, Chibuzor, et al.
Published: (2025)

Edeflip: Supervised Word Translation between English and Yoruba
by: Abioye, Ikeoluwa, et al.
Published: (2025)

Translating Hanja Historical Documents to Contemporary Korean and English
by: Son, Juhee, et al.
Published: (2022)

On the effective transfer of knowledge from English to Hindi Wikipedia
by: Das, Paramita, et al.
Published: (2024)

Which English Do LLMs Prefer? Triangulating Structural Bias Towards American English in Foundation Models
by: Nayeem, Mir Tafseer, et al.
Published: (2026)

Evaluating Machine Translation Models for English-Hindi Language Pairs: A Comparative Analysis
by: Shetty, Ahan Prasannakumar
Published: (2025)

Many-to-English Machine Translation Tools, Data, and Pretrained Models
by: Gowda, Thamme, et al.
Published: (2021)

The Translation Tax Is Not a Scalar: A Counterfactual Audit of English-Source Cue Inheritance in Chinese Multilingual Benchmarks
by: Lin, Zezheng, et al.
Published: (2026)

A comparison of data filtering techniques for English-Polish LLM-based machine translation in the biomedical domain
by: Lérida, Jorge del Pozo, et al.
Published: (2025)

Assessing the validity of new paradigmatic complexity measures as criterial features for proficiency in L2 writings in English
by: Mallart, Cyriel, et al.
Published: (2025)

What makes a word hard to learn? Modeling L1 influence on English vocabulary difficulty
by: Martins, Jonas Mayer, et al.
Published: (2026)

Keyword Extraction, and Aspect Classification in Sinhala, English, and Code-Mixed Content
by: Rizvi, F. A., et al.
Published: (2025)

Exploring Large Language Models for Translating Romanian Computational Problems into English
by: Dumitran, Adrian Marius, et al.
Published: (2025)

On Creating an English-Thai Code-switched Machine Translation in Medical Domain
by: Pengpun, Parinthapat, et al.
Published: (2024)

Enhanced Labeling Technique for Reddit Text and Fine-Tuned Longformer Models for Classifying Depression Severity in English and Luganda
by: Kimera, Richard, et al.
Published: (2024)

Consistency Is the Key: Detecting Hallucinations in LLM Generated Text By Checking Inconsistencies About Key Facts
by: Gupta, Raavi, et al.
Published: (2025)

Building Large-Scale English-Romanian Literary Translation Resources with Open Models
by: Nadas, Mihai, et al.
Published: (2025)

Enhancing Multilingual Sentiment Analysis with Explainability for Sinhala, English, and Code-Mixed Content
by: Rizvi, Azmarah, et al.
Published: (2025)

Chapter Bringing down the wall of native-speakerism in English language teaching
by: Llurda, Enric
Published: (2026)

The Saturation Point of Backtranslation in High Quality Low Resource English Gujarati Machine Translation
by: Arif, Arwa
Published: (2025)

Can General-Purpose Large Language Models Generalize to English-Thai Machine Translation ?
by: Chiaranaipanich, Jirat, et al.
Published: (2024)

Automated Multi-Language to English Machine Translation Using Generative Pre-Trained Transformers
by: Pelofske, Elijah, et al.
Published: (2024)

Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language
by: Sharma, Yash, et al.
Published: (2024)

Towards Santali Linguistic Inclusion: Building the First Santali-to-English Translation Model using mT5 Transformer and Data Augmentation
by: Billah, Syed Mohammed Mostaque, et al.
Published: (2024)

Does Continued Pretraining on a Learner Corpus Improve Automated Essay Scoring on English Proficiency Tests? Evidence from EFCAMDAT
by: Nguyen, Duy Anh
Published: (2026)

Sociodemographic Bias in Language Models: A Survey and Forward Path
by: Gupta, Vipul, et al.
Published: (2023)

Creating and Evaluating Code-Mixed Nepali-English and Telugu-English Datasets for Abusive Language Detection Using Traditional and Deep Learning Models
by: Pandey, Manish, et al.
Published: (2025)

A comparison of oral evaluation ratings by native English speaker teachers and non-native English speaker teachers
by: Brittany Baitman
Published: (2013)