Guardado en:
| Autores principales: | Suryanarayanan, Sanjay, Song, Haiyue, Khan, Mohammed Safi Ur Rahman, Kunchukuttan, Anoop, Dabre, Raj |
|---|---|
| Formato: | Preprint |
| Publicado: |
2024
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2411.19096 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages
por: Jayakumar, Thanmay, et al.
Publicado: (2026)
por: Jayakumar, Thanmay, et al.
Publicado: (2026)
Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
por: Doddapaneni, Sumanth, et al.
Publicado: (2024)
por: Doddapaneni, Sumanth, et al.
Publicado: (2024)
IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
por: Prasanjith, Pasunuti, et al.
Publicado: (2025)
por: Prasanjith, Pasunuti, et al.
Publicado: (2025)
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
por: Khan, Mohammed Safi Ur Rahman, et al.
Publicado: (2024)
por: Khan, Mohammed Safi Ur Rahman, et al.
Publicado: (2024)
Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages
por: Sankar, Ashwin, et al.
Publicado: (2024)
por: Sankar, Ashwin, et al.
Publicado: (2024)
Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models
por: Khan, Mohammed Safi Ur Rahman, et al.
Publicado: (2026)
por: Khan, Mohammed Safi Ur Rahman, et al.
Publicado: (2026)
The Reasoning Lingua Franca: A Double-Edged Sword for Multilingual AI
por: Saji, Alan, et al.
Publicado: (2025)
por: Saji, Alan, et al.
Publicado: (2025)
MILU: A Multi-task Indic Language Understanding Benchmark
por: Verma, Sshubam, et al.
Publicado: (2024)
por: Verma, Sshubam, et al.
Publicado: (2024)
Airavata: Introducing Hindi Instruction-tuned LLM
por: Gala, Jay, et al.
Publicado: (2024)
por: Gala, Jay, et al.
Publicado: (2024)
RiddleBench: A New Generative Reasoning Benchmark for LLMs
por: Halder, Deepon, et al.
Publicado: (2025)
por: Halder, Deepon, et al.
Publicado: (2025)
How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
por: Singh, Anushka, et al.
Publicado: (2024)
por: Singh, Anushka, et al.
Publicado: (2024)
Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages
por: Ghosh, Poulami, et al.
Publicado: (2024)
por: Ghosh, Poulami, et al.
Publicado: (2024)
An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
por: Mundra, Nandini, et al.
Publicado: (2024)
por: Mundra, Nandini, et al.
Publicado: (2024)
RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization
por: Husain, Jaavid Aktar, et al.
Publicado: (2024)
por: Husain, Jaavid Aktar, et al.
Publicado: (2024)
RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
por: Saji, Alan, et al.
Publicado: (2025)
por: Saji, Alan, et al.
Publicado: (2025)
When Alignment Hurts: Decoupling Representational Spaces in Multilingual Models
por: Elshabrawy, Ahmed, et al.
Publicado: (2025)
por: Elshabrawy, Ahmed, et al.
Publicado: (2025)
PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation
por: Kaing, Hour, et al.
Publicado: (2025)
por: Kaing, Hour, et al.
Publicado: (2025)
Can Vision-Language Models Evaluate Handwritten Math?
por: Nath, Oikantik, et al.
Publicado: (2025)
por: Nath, Oikantik, et al.
Publicado: (2025)
Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
por: Doddapaneni, Sumanth, et al.
Publicado: (2024)
por: Doddapaneni, Sumanth, et al.
Publicado: (2024)
Top-b: Entropic Regulation of Relative Probability Bands in Autoregressive Language Processes
por: Halder, Deepon, et al.
Publicado: (2026)
por: Halder, Deepon, et al.
Publicado: (2026)
IteRABRe: Iterative Recovery-Aided Block Reduction
por: Wibowo, Haryo Akbarianto, et al.
Publicado: (2025)
por: Wibowo, Haryo Akbarianto, et al.
Publicado: (2025)
CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages
por: Maurya, Kaushal Kumar, et al.
Publicado: (2023)
por: Maurya, Kaushal Kumar, et al.
Publicado: (2023)
Pretraining Language Models Using Translationese
por: Doshi, Meet, et al.
Publicado: (2024)
por: Doshi, Meet, et al.
Publicado: (2024)
Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties
por: Dhasmana, Akriti, et al.
Publicado: (2026)
por: Dhasmana, Akriti, et al.
Publicado: (2026)
FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes
por: Nawale, Janki Atul, et al.
Publicado: (2025)
por: Nawale, Janki Atul, et al.
Publicado: (2025)
How effective is Multi-source pivoting for Translation of Low Resource Indian Languages?
por: Gaikwad, Pranav, et al.
Publicado: (2024)
por: Gaikwad, Pranav, et al.
Publicado: (2024)
Scripts Through Time: A Survey of the Evolving Role of Transliteration in NLP
por: Jayakumar, Thanmay, et al.
Publicado: (2026)
por: Jayakumar, Thanmay, et al.
Publicado: (2026)
Cross-Lingual Word Alignment for ASEAN Languages with Contrastive Learning
por: Zhang, Jingshen, et al.
Publicado: (2024)
por: Zhang, Jingshen, et al.
Publicado: (2024)
CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
por: Halder, Deepon, et al.
Publicado: (2025)
por: Halder, Deepon, et al.
Publicado: (2025)
Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
por: Kartik, Kartik, et al.
Publicado: (2024)
por: Kartik, Kartik, et al.
Publicado: (2024)
An Empirical Study of In-context Learning in LLMs for Machine Translation
por: Chitale, Pranjal A., et al.
Publicado: (2024)
por: Chitale, Pranjal A., et al.
Publicado: (2024)
Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders
por: Veitsman, Yana, et al.
Publicado: (2026)
por: Veitsman, Yana, et al.
Publicado: (2026)
Understanding Cross-Lingual Alignment -- A Survey
por: Hämmerl, Katharina, et al.
Publicado: (2024)
por: Hämmerl, Katharina, et al.
Publicado: (2024)
BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques
por: Kabir, Muhammad Rafsan, et al.
Publicado: (2024)
por: Kabir, Muhammad Rafsan, et al.
Publicado: (2024)
Unicode Normalization and Grapheme Parsing of Indic Languages
por: Ansary, Nazmuddoha, et al.
Publicado: (2023)
por: Ansary, Nazmuddoha, et al.
Publicado: (2023)
QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
por: Khan, Mohammad Aflah, et al.
Publicado: (2024)
por: Khan, Mohammad Aflah, et al.
Publicado: (2024)
Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
por: Huang, Yongxin, et al.
Publicado: (2024)
por: Huang, Yongxin, et al.
Publicado: (2024)
A Morphology-Based Investigation of Positional Encodings
por: Ghosh, Poulami, et al.
Publicado: (2024)
por: Ghosh, Poulami, et al.
Publicado: (2024)
Anveshana: A New Benchmark Dataset for Cross-Lingual Information Retrieval On English Queries and Sanskrit Documents
por: Jagadeeshan, Manoj Balaji, et al.
Publicado: (2025)
por: Jagadeeshan, Manoj Balaji, et al.
Publicado: (2025)
Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts
por: Pulipaka, Sidharth, et al.
Publicado: (2025)
por: Pulipaka, Sidharth, et al.
Publicado: (2025)
Ejemplares similares
-
IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages
por: Jayakumar, Thanmay, et al.
Publicado: (2026) -
Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
por: Doddapaneni, Sumanth, et al.
Publicado: (2024) -
IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
por: Prasanjith, Pasunuti, et al.
Publicado: (2025) -
IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
por: Khan, Mohammed Safi Ur Rahman, et al.
Publicado: (2024) -
Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages
por: Sankar, Ashwin, et al.
Publicado: (2024)