Saved in:
| Main Authors: | Maiti, Agniva, Pandey, Manya, Mandal, Murari |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.12537 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TharuChat: Bootstrapping Large Language Models for a Low-Resource Language via Synthetic Data and Human Validation
by: Panth, Prajwal, et al.
Published: (2026)
by: Panth, Prajwal, et al.
Published: (2026)
Connecting Ideas in 'Lower-Resource' Scenarios: NLP for National Varieties, Creoles and Other Low-resource Scenarios
by: Joshi, Aditya, et al.
Published: (2024)
by: Joshi, Aditya, et al.
Published: (2024)
Confidence is Not Competence
by: Sanyal, Debdeep, et al.
Published: (2025)
by: Sanyal, Debdeep, et al.
Published: (2025)
Dealing with the Hard Facts of Low-Resource African NLP
by: Diarra, Yacouba, et al.
Published: (2025)
by: Diarra, Yacouba, et al.
Published: (2025)
Exploring NLP Benchmarks in an Extremely Low-Resource Setting
by: Nuha, Ulin, et al.
Published: (2025)
by: Nuha, Ulin, et al.
Published: (2025)
CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP
by: Evuru, Chandra Kiran Reddy, et al.
Published: (2024)
by: Evuru, Chandra Kiran Reddy, et al.
Published: (2024)
From Automation to Collaboration: Human-in-the-Loop Methods for Safe and Trustworthy NLP
by: Samu, Most. Sharmin Sultana, et al.
Published: (2026)
by: Samu, Most. Sharmin Sultana, et al.
Published: (2026)
Towards Open-Ended Discovery for Low-Resource NLP
by: Dossou, Bonaventure F. P., et al.
Published: (2025)
by: Dossou, Bonaventure F. P., et al.
Published: (2025)
On Importance of Pruning and Distillation for Efficient Low Resource NLP
by: Mirashi, Aishwarya, et al.
Published: (2024)
by: Mirashi, Aishwarya, et al.
Published: (2024)
NaijaNLP: A Survey of Nigerian Low-Resource Languages
by: Inuwa-Dutse, Isa
Published: (2025)
by: Inuwa-Dutse, Isa
Published: (2025)
TurkicNLP: An NLP Toolkit for Turkic Languages
by: Hakimov, Sherzod
Published: (2026)
by: Hakimov, Sherzod
Published: (2026)
The Nature of NLP: Analyzing Contributions in NLP Papers
by: Pramanick, Aniket, et al.
Published: (2024)
by: Pramanick, Aniket, et al.
Published: (2024)
Textless NLP -- Zero Resource Challenge with Low Resource Compute
by: Ramadass, Krithiga, et al.
Published: (2024)
by: Ramadass, Krithiga, et al.
Published: (2024)
LLM as a Meta-Judge: Synthetic Data for NLP Evaluation Metric Validation
by: Eigler, Lukáš, et al.
Published: (2026)
by: Eigler, Lukáš, et al.
Published: (2026)
GhanaNLP Parallel Corpora: Comprehensive Multilingual Resources for Low-Resource Ghanaian Languages
by: Gyamfi, Lawrence Adu, et al.
Published: (2026)
by: Gyamfi, Lawrence Adu, et al.
Published: (2026)
NLP-ADBench: NLP Anomaly Detection Benchmark
by: Li, Yuangang, et al.
Published: (2024)
by: Li, Yuangang, et al.
Published: (2024)
A Low-Resource Speech-Driven NLP Pipeline for Sinhala Dyslexia Assistance
by: Perera, Peshala, et al.
Published: (2025)
by: Perera, Peshala, et al.
Published: (2025)
Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data
by: Chen, Shan, et al.
Published: (2024)
by: Chen, Shan, et al.
Published: (2024)
Building Community-Centred NLP Resources for Puno Quechua
by: Huaman, Elwin, et al.
Published: (2026)
by: Huaman, Elwin, et al.
Published: (2026)
The Annotation Scarcity Paradox in Low-Resource NLP Evaluation: A Decade of Acceleration and Emerging Constraints
by: Marivate, Vukosi
Published: (2026)
by: Marivate, Vukosi
Published: (2026)
SYNTHEVAL: Hybrid Behavioral Testing of NLP Models with Synthetic CheckLists
by: Zhao, Raoyuan, et al.
Published: (2024)
by: Zhao, Raoyuan, et al.
Published: (2024)
Beyond Catalogue Counts: the Dataset Visibility Asymmetry in Low-Resource Multilingual NLP
by: Tan, Zhiyin, et al.
Published: (2026)
by: Tan, Zhiyin, et al.
Published: (2026)
Foundations and Evaluations in NLP
by: Park, Jungyeul
Published: (2025)
by: Park, Jungyeul
Published: (2025)
The African Languages Lab: A Collaborative Approach to Advancing Low-Resource African NLP
by: Issaka, Sheriff, et al.
Published: (2025)
by: Issaka, Sheriff, et al.
Published: (2025)
Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects
by: Ahia, Orevaoghene, et al.
Published: (2024)
by: Ahia, Orevaoghene, et al.
Published: (2024)
AraFinNLP 2024: The First Arabic Financial NLP Shared Task
by: Malaysha, Sanad, et al.
Published: (2024)
by: Malaysha, Sanad, et al.
Published: (2024)
Part-of-speech tagging for Nagamese Language using CRF
by: Shohe, Alovi N, et al.
Published: (2025)
by: Shohe, Alovi N, et al.
Published: (2025)
TajPersLexon: A Tajik-Persian Lexical Resource and Hybrid Model for Cross-Script Low-Resource NLP
by: Arabov, Mullosharaf K.
Published: (2026)
by: Arabov, Mullosharaf K.
Published: (2026)
Agents Are All You Need for LLM Unlearning
by: Sanyal, Debdeep, et al.
Published: (2025)
by: Sanyal, Debdeep, et al.
Published: (2025)
How Good is Your Wikipedia? Auditing Data Quality for Low-resource and Multilingual NLP
by: Tatariya, Kushal, et al.
Published: (2024)
by: Tatariya, Kushal, et al.
Published: (2024)
Pretraining Language Models with Subword Regularization: An Empirical Study of BPE Dropout in Low-Resource NLP
by: Visser, Ruan, et al.
Published: (2026)
by: Visser, Ruan, et al.
Published: (2026)
Trans-Tokenization and Cross-lingual Vocabulary Transfers: Language Adaptation of LLMs for Low-Resource NLP
by: Remy, François, et al.
Published: (2024)
by: Remy, François, et al.
Published: (2024)
Indigenous Languages Spoken in Argentina: A Survey of NLP and Speech Resources
by: Ticona, Belu, et al.
Published: (2025)
by: Ticona, Belu, et al.
Published: (2025)
EduNLP: Towards a Unified and Modularized Library for Educational Resources
by: Huang, Zhenya, et al.
Published: (2024)
by: Huang, Zhenya, et al.
Published: (2024)
Evaluation Metrics for Text Data Augmentation in NLP
by: Amadeus, Marcellus, et al.
Published: (2024)
by: Amadeus, Marcellus, et al.
Published: (2024)
What is "Typological Diversity" in NLP?
by: Ploeger, Esther, et al.
Published: (2024)
by: Ploeger, Esther, et al.
Published: (2024)
GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek
by: Loukas, Lefteris, et al.
Published: (2024)
by: Loukas, Lefteris, et al.
Published: (2024)
NLP-AKG: Few-Shot Construction of NLP Academic Knowledge Graph Based on LLM
by: Lan, Jiayin, et al.
Published: (2025)
by: Lan, Jiayin, et al.
Published: (2025)
Few-Shot Optimized Framework for Hallucination Detection in Resource-Limited NLP Systems
by: Hikal, Baraa, et al.
Published: (2025)
by: Hikal, Baraa, et al.
Published: (2025)
NLP Privacy Risk Identification in Social Media (NLP-PRISM): A Survey
by: Goswami, Dhiman, et al.
Published: (2026)
by: Goswami, Dhiman, et al.
Published: (2026)
Similar Items
-
TharuChat: Bootstrapping Large Language Models for a Low-Resource Language via Synthetic Data and Human Validation
by: Panth, Prajwal, et al.
Published: (2026) -
Connecting Ideas in 'Lower-Resource' Scenarios: NLP for National Varieties, Creoles and Other Low-resource Scenarios
by: Joshi, Aditya, et al.
Published: (2024) -
Confidence is Not Competence
by: Sanyal, Debdeep, et al.
Published: (2025) -
Dealing with the Hard Facts of Low-Resource African NLP
by: Diarra, Yacouba, et al.
Published: (2025) -
Exploring NLP Benchmarks in an Extremely Low-Resource Setting
by: Nuha, Ulin, et al.
Published: (2025)