:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Furtado, Anna Beatriz Dimas, Ranasinghe, Tharindu, Blain, Frédéric, Mitkov, Ruslan
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2403.18018
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LLM-based Embedders for Prior Case Retrieval
by: Premasiri, Damith, et al.
Published: (2025)

AHaSIS: Shared Task on Sentiment Analysis for Arabic Dialects
by: Alharbi, Maram, et al.
Published: (2025)

A Federated Learning Approach to Privacy Preserving Offensive Language Identification
by: Zampieri, Marcos, et al.
Published: (2024)

Guided Distant Supervision for Multilingual Relation Extraction Data: Adapting to a New Language
by: Plum, Alistair, et al.
Published: (2024)

From Text to Graph: Leveraging Graph Neural Networks for Enhanced Explainability in NLP
by: Yáñez-Romero, Fabio, et al.
Published: (2025)

NSINA: A News Corpus for Sinhala
by: Hettiarachchi, Hansi, et al.
Published: (2024)

Vicarious Offense and Noise Audit of Offensive Speech Classifiers: Unifying Human and Machine Disagreement on What is Offensive
by: Weerasooriya, Tharindu Cyril, et al.
Published: (2023)

ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification
by: North, Kai, et al.
Published: (2022)

Overview of the First Workshop on Language Models for Low-Resource Languages (LoResLM 2025)
by: Hettiarachchi, Hansi, et al.
Published: (2024)

SOLD: Sinhala Offensive Language Dataset
by: Ranasinghe, Tharindu, et al.
Published: (2022)

What do Large Language Models Need for Machine Translation Evaluation?
by: Qian, Shenbin, et al.
Published: (2024)

Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy
by: Plum, Alistair, et al.
Published: (2024)

Do LLMs Judge Distantly Supervised Named Entity Labels Well? Constructing the JudgeWEL Dataset
by: Plum, Alistair, et al.
Published: (2026)

ALBA: A European Portuguese Benchmark for Evaluating Language and Linguistic Dimensions in Generative LLMs
by: Vieira, Inês, et al.
Published: (2026)

Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey
by: Agrawal, Garima, et al.
Published: (2023)

Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
by: Li, Hao, et al.
Published: (2024)

EUROPA: A Legal Multilingual Keyphrase Generation Dataset
by: Salaün, Olivier, et al.
Published: (2024)

Brazilian Portuguese Image Captioning with Transformers: A Study on Cross-Native-Translated Dataset
by: Bromonschenkel, Gabriel, et al.
Published: (2026)

Tucano: Advancing Neural Text Generation for Portuguese
by: Corrêa, Nicholas Kluge, et al.
Published: (2024)

Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement
by: Sheth, Paras, et al.
Published: (2024)

DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation
by: Rahman, A B M Ashikur, et al.
Published: (2024)

Towards Generalized Offensive Language Identification
by: Dmonte, Alphaeus, et al.
Published: (2024)

AMALIA Technical Report: A Fully Open Source Large Language Model for European Portuguese
by: Simplício, Afonso, et al.
Published: (2026)

SALSA: Single-pass Autoregressive LLM Structured Classification
by: Berdichevsky, Ruslan, et al.
Published: (2025)

Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
by: Mohammadi, Seyedali, et al.
Published: (2025)

ARTICLE: Annotator Reliability Through In-Context Learning
by: Dutta, Sujan, et al.
Published: (2024)

Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions
by: Valentino, Marco, et al.
Published: (2023)

Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks
by: Jang, Lawrence Keunho, et al.
Published: (2026)

FalAR: A Large-scale Speaker-Annotated European Portuguese Speech Corpus of Parliamentary Sessions
by: Teixeira, Francisco, et al.
Published: (2026)

A Unified Definition of Hallucination: It's The World Model, Stupid!
by: Liu, Emmy, et al.
Published: (2025)

Fairness Definitions in Language Models Explained
by: Yin, Zhipeng, et al.
Published: (2024)

TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese
by: Corrêa, Nicholas Kluge, et al.
Published: (2024)

MultiLS: A Multi-task Lexical Simplification Framework
by: North, Kai, et al.
Published: (2024)

Multi-Agent Computer Use
by: Koh, Jing Yu, et al.
Published: (2026)

A Survey on Multilingual Mental Disorders Detection from Social Media Data
by: Bucur, Ana-Maria, et al.
Published: (2025)

LPI-RIT at LeWiDi-2025: Improving Distributional Predictions via Metadata and Loss Reweighting with DisCo
by: Sawkar, Mandira, et al.
Published: (2025)

Predictive Authoring for Brazilian Portuguese Augmentative and Alternative Communication
by: Pereira, Jayr, et al.
Published: (2023)

DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton
by: Sun, Yiyou, et al.
Published: (2024)

WolBanking77: Wolof Banking Speech Intent Classification Dataset
by: Kandji, Abdou Karim, et al.
Published: (2025)

Training a Generally Curious Agent
by: Tajwar, Fahim, et al.
Published: (2025)