Saved in:
| Main Authors: | Vaidya, Aatman, Arora, Arnav, Joshi, Aditya, Prabhakar, Tarunima |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.03677 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Analysis of Indic Language Capabilities in LLMs
by: Vaidya, Aatman, et al.
Published: (2025)
by: Vaidya, Aatman, et al.
Published: (2025)
IndicSQuAD: A Comprehensive Multilingual Question Answering Dataset for Indic Languages
by: Endait, Sharvi, et al.
Published: (2025)
by: Endait, Sharvi, et al.
Published: (2025)
L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
by: Mirashi, Aishwarya, et al.
Published: (2024)
by: Mirashi, Aishwarya, et al.
Published: (2024)
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions
by: Kaffee, Lucie-Aimée, et al.
Published: (2023)
by: Kaffee, Lucie-Aimée, et al.
Published: (2023)
L3Cube-IndicQuest: A Benchmark Question Answering Dataset for Evaluating Knowledge of LLMs in Indic Context
by: Rohera, Pritika, et al.
Published: (2024)
by: Rohera, Pritika, et al.
Published: (2024)
Multi-Stage Training for Abusive Comment Detection in Indic Languages
by: Rastogi, Pranshu, et al.
Published: (2026)
by: Rastogi, Pranshu, et al.
Published: (2026)
MORPHOGEN: A Multilingual Benchmark for Evaluating Gender-Aware Morphological Generation
by: Agarwal, Mehul, et al.
Published: (2026)
by: Agarwal, Mehul, et al.
Published: (2026)
Few-Shot Contrastive Adaptation for Audio Abuse Detection in Low-Resource Indic Languages
by: Sankaran, Aditya Narayan, et al.
Published: (2026)
by: Sankaran, Aditya Narayan, et al.
Published: (2026)
L3Cube-IndicHeadline-ID: A Dataset for Headline Identification and Semantic Evaluation in Low-Resource Indian Languages
by: Tanksale, Nishant, et al.
Published: (2025)
by: Tanksale, Nishant, et al.
Published: (2025)
Overview of the VLSP 2023 -- ComOM Shared Task: A Data Challenge for Comparative Opinion Mining from Vietnamese Product Reviews
by: Le, Hoang-Quynh, et al.
Published: (2024)
by: Le, Hoang-Quynh, et al.
Published: (2024)
Revealing Fine-Grained Values and Opinions in Large Language Models
by: Wright, Dustin, et al.
Published: (2024)
by: Wright, Dustin, et al.
Published: (2024)
Deep Prompt Multi-task Network for Abuse Language Detection
by: Zhu, Jian, et al.
Published: (2024)
by: Zhu, Jian, et al.
Published: (2024)
IndicSentEval: How Effectively do Multilingual Transformer Models encode Linguistic Properties for Indic Languages?
by: Aravapalli, Akhilesh, et al.
Published: (2024)
by: Aravapalli, Akhilesh, et al.
Published: (2024)
DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection
by: Chakraborty, Joymallya, et al.
Published: (2024)
by: Chakraborty, Joymallya, et al.
Published: (2024)
Multi-Modal Framing Analysis of News
by: Arora, Arnav, et al.
Published: (2025)
by: Arora, Arnav, et al.
Published: (2025)
Aligning Large Language Models to Low-Resource Languages through LLM-Based Selective Translation: A Systematic Study
by: Paul, Rakesh, et al.
Published: (2025)
by: Paul, Rakesh, et al.
Published: (2025)
Parallel Corpora for Machine Translation in Low-resource Indic Languages: A Comprehensive Review
by: Raja, Rahul, et al.
Published: (2025)
by: Raja, Rahul, et al.
Published: (2025)
Queryable LoRA: Instruction-Regularized Routing Over Shared Low-Rank Update Atoms
by: Vaidya, Omatharv Bharat, et al.
Published: (2026)
by: Vaidya, Omatharv Bharat, et al.
Published: (2026)
IndicMedDialog: A Parallel Multi-Turn Medical Dialogue Dataset for Accessible Healthcare in Indic Languages
by: Nigam, Shubham Kumar, et al.
Published: (2026)
by: Nigam, Shubham Kumar, et al.
Published: (2026)
Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English
by: Nguyen, Duke, et al.
Published: (2025)
by: Nguyen, Duke, et al.
Published: (2025)
RLSF: Fine-tuning LLMs via Symbolic Feedback
by: Jha, Piyush, et al.
Published: (2024)
by: Jha, Piyush, et al.
Published: (2024)
SMS Spam Detection and Classification to Combat Abuse in Telephone Networks Using Natural Language Processing
by: Oyeyemi, Dare Azeez, et al.
Published: (2024)
by: Oyeyemi, Dare Azeez, et al.
Published: (2024)
Adapting Multilingual LLMs to Low-Resource Languages using Continued Pre-training and Synthetic Corpus
by: Joshi, Raviraj, et al.
Published: (2024)
by: Joshi, Raviraj, et al.
Published: (2024)
TextGram: Towards a better domain-adaptive pretraining
by: Hiwarkhedkar, Sharayu, et al.
Published: (2024)
by: Hiwarkhedkar, Sharayu, et al.
Published: (2024)
In Vino Veritas and Vulnerabilities: Examining LLM Safety via Drunk Language Inducement
by: Shetty, Anudeex, et al.
Published: (2026)
by: Shetty, Anudeex, et al.
Published: (2026)
LoRA Soups: Merging LoRAs for Practical Skill Composition Tasks
by: Prabhakar, Akshara, et al.
Published: (2024)
by: Prabhakar, Akshara, et al.
Published: (2024)
Training Language Models to Reason Efficiently
by: Arora, Daman, et al.
Published: (2025)
by: Arora, Daman, et al.
Published: (2025)
Multilingual Open QA on the MIA Shared Task
by: Yarrabelly, Navya, et al.
Published: (2025)
by: Yarrabelly, Navya, et al.
Published: (2025)
Decoding the Diversity: A Review of the Indic AI Research Landscape
by: KJ, Sankalp, et al.
Published: (2024)
by: KJ, Sankalp, et al.
Published: (2024)
BiasGym: A Simple and Generalizable Framework for Analyzing and Removing Biases through Elicitation
by: Islam, Sekh Mainul, et al.
Published: (2025)
by: Islam, Sekh Mainul, et al.
Published: (2025)
Benchmarking Hindi LLMs: A New Suite of Datasets and a Comparative Analysis
by: Kamath, Anusha, et al.
Published: (2025)
by: Kamath, Anusha, et al.
Published: (2025)
IndicParam: Benchmark to evaluate LLMs on low-resource Indic Languages
by: Maheshwari, Ayush, et al.
Published: (2025)
by: Maheshwari, Ayush, et al.
Published: (2025)
The Uli Dataset: An Exercise in Experience Led Annotation of oGBV
by: Arora, Arnav, et al.
Published: (2023)
by: Arora, Arnav, et al.
Published: (2023)
Selective Neuron Amplification in Transformer Language Models
by: Akhtar, Ryyan, et al.
Published: (2026)
by: Akhtar, Ryyan, et al.
Published: (2026)
Data Contamination Report from the 2024 CONDA Shared Task
by: Sainz, Oscar, et al.
Published: (2024)
by: Sainz, Oscar, et al.
Published: (2024)
Detecting Gender Bias in Course Evaluations
by: Lindau, Sarah, et al.
Published: (2024)
by: Lindau, Sarah, et al.
Published: (2024)
MahaSQuAD: Bridging Linguistic Divides in Marathi Question-Answering
by: Ghatage, Ruturaj, et al.
Published: (2024)
by: Ghatage, Ruturaj, et al.
Published: (2024)
Overview of AuTexTification at IberLEF 2023: Detection and Attribution of Machine-Generated Text in Multiple Domains
by: Sarvazyan, Areg Mikael, et al.
Published: (2023)
by: Sarvazyan, Areg Mikael, et al.
Published: (2023)
PolyKV: A Shared Asymmetrically-Compressed KV Cache Pool for Multi-Agent LLM Inference
by: Patel, Ishan, et al.
Published: (2026)
by: Patel, Ishan, et al.
Published: (2026)
Basis Sharing: Cross-Layer Parameter Sharing for Large Language Model Compression
by: Wang, Jingcun, et al.
Published: (2024)
by: Wang, Jingcun, et al.
Published: (2024)
Similar Items
-
Analysis of Indic Language Capabilities in LLMs
by: Vaidya, Aatman, et al.
Published: (2025) -
IndicSQuAD: A Comprehensive Multilingual Question Answering Dataset for Indic Languages
by: Endait, Sharvi, et al.
Published: (2025) -
L3Cube-IndicNews: News-based Short Text and Long Document Classification Datasets in Indic Languages
by: Mirashi, Aishwarya, et al.
Published: (2024) -
Why Should This Article Be Deleted? Transparent Stance Detection in Multilingual Wikipedia Editor Discussions
by: Kaffee, Lucie-Aimée, et al.
Published: (2023) -
L3Cube-IndicQuest: A Benchmark Question Answering Dataset for Evaluating Knowledge of LLMs in Indic Context
by: Rohera, Pritika, et al.
Published: (2024)