:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Suryanarayanan, Sanjay, Song, Haiyue, Khan, Mohammed Safi Ur Rahman, Kunchukuttan, Anoop, Dabre, Raj
Formato:	Preprint
Publicado:	2024
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2411.19096
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

IndicIFEval: A Benchmark for Verifiable Instruction-Following Evaluation in 14 Indic Languages
por: Jayakumar, Thanmay, et al.
Publicado: (2026)

Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs
por: Doddapaneni, Sumanth, et al.
Publicado: (2024)

IndicRAGSuite: Large-Scale Datasets and a Benchmark for Indian Language RAG Systems
por: Prasanjith, Pasunuti, et al.
Publicado: (2025)

IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages
por: Khan, Mohammed Safi Ur Rahman, et al.
Publicado: (2024)

Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages
por: Sankar, Ashwin, et al.
Publicado: (2024)

Seeing Isn't Believing: Uncovering Blind Spots in Evaluator Vision-Language Models
por: Khan, Mohammed Safi Ur Rahman, et al.
Publicado: (2026)

The Reasoning Lingua Franca: A Double-Edged Sword for Multilingual AI
por: Saji, Alan, et al.
Publicado: (2025)

MILU: A Multi-task Indic Language Understanding Benchmark
por: Verma, Sshubam, et al.
Publicado: (2024)

Airavata: Introducing Hindi Instruction-tuned LLM
por: Gala, Jay, et al.
Publicado: (2024)

RiddleBench: A New Generative Reasoning Benchmark for LLMs
por: Halder, Deepon, et al.
Publicado: (2025)

How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
por: Singh, Anushka, et al.
Publicado: (2024)

Are Language Models Agnostic to Linguistically Grounded Perturbations? A Case Study of Indic Languages
por: Ghosh, Poulami, et al.
Publicado: (2024)

An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models
por: Mundra, Nandini, et al.
Publicado: (2024)

RomanSetu: Efficiently unlocking multilingual capabilities of Large Language Models via Romanization
por: Husain, Jaavid Aktar, et al.
Publicado: (2024)

RomanLens: The Role Of Latent Romanization In Multilinguality In LLMs
por: Saji, Alan, et al.
Publicado: (2025)

When Alignment Hurts: Decoupling Representational Spaces in Multilingual Models
por: Elshabrawy, Ahmed, et al.
Publicado: (2025)

PrahokBART: A Pre-trained Sequence-to-Sequence Model for Khmer Natural Language Generation
por: Kaing, Hour, et al.
Publicado: (2025)

Can Vision-Language Models Evaluate Handwritten Math?
por: Nath, Oikantik, et al.
Publicado: (2025)

Finding Blind Spots in Evaluator LLMs with Interpretable Checklists
por: Doddapaneni, Sumanth, et al.
Publicado: (2024)

Top-b: Entropic Regulation of Relative Probability Bands in Autoregressive Language Processes
por: Halder, Deepon, et al.
Publicado: (2026)

IteRABRe: Iterative Recovery-Aided Block Reduction
por: Wibowo, Haryo Akbarianto, et al.
Publicado: (2025)

CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages
por: Maurya, Kaushal Kumar, et al.
Publicado: (2023)

Pretraining Language Models Using Translationese
por: Doshi, Meet, et al.
Publicado: (2024)

Dialect Matters: Cross-Lingual ASR Transfer for Low-Resource Indic Language Varieties
por: Dhasmana, Akriti, et al.
Publicado: (2026)

FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes
por: Nawale, Janki Atul, et al.
Publicado: (2025)

How effective is Multi-source pivoting for Translation of Low Resource Indian Languages?
por: Gaikwad, Pranav, et al.
Publicado: (2024)

Scripts Through Time: A Survey of the Evolving Role of Transliteration in NLP
por: Jayakumar, Thanmay, et al.
Publicado: (2026)

Cross-Lingual Word Alignment for ASEAN Languages with Contrastive Learning
por: Zhang, Jingshen, et al.
Publicado: (2024)

CycleDistill: Bootstrapping Machine Translation using LLMs with Cyclical Distillation
por: Halder, Deepon, et al.
Publicado: (2025)

Synthetic Data Generation and Joint Learning for Robust Code-Mixed Translation
por: Kartik, Kartik, et al.
Publicado: (2024)

An Empirical Study of In-context Learning in LLMs for Machine Translation
por: Chitale, Pranjal A., et al.
Publicado: (2024)

Why Better Cross-Lingual Alignment Fails for Better Cross-Lingual Transfer: Case of Encoders
por: Veitsman, Yana, et al.
Publicado: (2026)

Understanding Cross-Lingual Alignment -- A Survey
por: Hämmerl, Katharina, et al.
Publicado: (2024)

BanglaEmbed: Efficient Sentence Embedding Models for a Low-Resource Language Using Cross-Lingual Distillation Techniques
por: Kabir, Muhammad Rafsan, et al.
Publicado: (2024)

Unicode Normalization and Grapheme Parsing of Indic Languages
por: Ansary, Nazmuddoha, et al.
Publicado: (2023)

QUENCH: Measuring the gap between Indic and Non-Indic Contextual General Reasoning in LLMs
por: Khan, Mohammad Aflah, et al.
Publicado: (2024)

Modular Sentence Encoders: Separating Language Specialization from Cross-Lingual Alignment
por: Huang, Yongxin, et al.
Publicado: (2024)

A Morphology-Based Investigation of Positional Encodings
por: Ghosh, Poulami, et al.
Publicado: (2024)

Anveshana: A New Benchmark Dataset for Cross-Lingual Information Retrieval On English Queries and Sanskrit Documents
por: Jagadeeshan, Manoj Balaji, et al.
Publicado: (2025)

Mark My Words: A Robust Multilingual Model for Punctuation in Text and Speech Transcripts
por: Pulipaka, Sidharth, et al.
Publicado: (2025)