Saved in:
| Main Authors: | Furtado, Anna Beatriz Dimas, Ranasinghe, Tharindu, Blain, Frédéric, Mitkov, Ruslan |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.18018 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM-based Embedders for Prior Case Retrieval
by: Premasiri, Damith, et al.
Published: (2025)
by: Premasiri, Damith, et al.
Published: (2025)
AHaSIS: Shared Task on Sentiment Analysis for Arabic Dialects
by: Alharbi, Maram, et al.
Published: (2025)
by: Alharbi, Maram, et al.
Published: (2025)
A Federated Learning Approach to Privacy Preserving Offensive Language Identification
by: Zampieri, Marcos, et al.
Published: (2024)
by: Zampieri, Marcos, et al.
Published: (2024)
Guided Distant Supervision for Multilingual Relation Extraction Data: Adapting to a New Language
by: Plum, Alistair, et al.
Published: (2024)
by: Plum, Alistair, et al.
Published: (2024)
From Text to Graph: Leveraging Graph Neural Networks for Enhanced Explainability in NLP
by: Yáñez-Romero, Fabio, et al.
Published: (2025)
by: Yáñez-Romero, Fabio, et al.
Published: (2025)
NSINA: A News Corpus for Sinhala
by: Hettiarachchi, Hansi, et al.
Published: (2024)
by: Hettiarachchi, Hansi, et al.
Published: (2024)
Vicarious Offense and Noise Audit of Offensive Speech Classifiers: Unifying Human and Machine Disagreement on What is Offensive
by: Weerasooriya, Tharindu Cyril, et al.
Published: (2023)
by: Weerasooriya, Tharindu Cyril, et al.
Published: (2023)
ALEXSIS-PT: A New Resource for Portuguese Lexical Simplification
by: North, Kai, et al.
Published: (2022)
by: North, Kai, et al.
Published: (2022)
Overview of the First Workshop on Language Models for Low-Resource Languages (LoResLM 2025)
by: Hettiarachchi, Hansi, et al.
Published: (2024)
by: Hettiarachchi, Hansi, et al.
Published: (2024)
SOLD: Sinhala Offensive Language Dataset
by: Ranasinghe, Tharindu, et al.
Published: (2022)
by: Ranasinghe, Tharindu, et al.
Published: (2022)
What do Large Language Models Need for Machine Translation Evaluation?
by: Qian, Shenbin, et al.
Published: (2024)
by: Qian, Shenbin, et al.
Published: (2024)
Text Generation Models for Luxembourgish with Limited Data: A Balanced Multilingual Strategy
by: Plum, Alistair, et al.
Published: (2024)
by: Plum, Alistair, et al.
Published: (2024)
Do LLMs Judge Distantly Supervised Named Entity Labels Well? Constructing the JudgeWEL Dataset
by: Plum, Alistair, et al.
Published: (2026)
by: Plum, Alistair, et al.
Published: (2026)
ALBA: A European Portuguese Benchmark for Evaluating Language and Linguistic Dimensions in Generative LLMs
by: Vieira, Inês, et al.
Published: (2026)
by: Vieira, Inês, et al.
Published: (2026)
Can Knowledge Graphs Reduce Hallucinations in LLMs? : A Survey
by: Agrawal, Garima, et al.
Published: (2023)
by: Agrawal, Garima, et al.
Published: (2023)
Which Side Are You On? A Multi-task Dataset for End-to-End Argument Summarisation and Evaluation
by: Li, Hao, et al.
Published: (2024)
by: Li, Hao, et al.
Published: (2024)
EUROPA: A Legal Multilingual Keyphrase Generation Dataset
by: Salaün, Olivier, et al.
Published: (2024)
by: Salaün, Olivier, et al.
Published: (2024)
Brazilian Portuguese Image Captioning with Transformers: A Study on Cross-Native-Translated Dataset
by: Bromonschenkel, Gabriel, et al.
Published: (2026)
by: Bromonschenkel, Gabriel, et al.
Published: (2026)
Tucano: Advancing Neural Text Generation for Portuguese
by: Corrêa, Nicholas Kluge, et al.
Published: (2024)
by: Corrêa, Nicholas Kluge, et al.
Published: (2024)
Cross-Platform Hate Speech Detection with Weakly Supervised Causal Disentanglement
by: Sheth, Paras, et al.
Published: (2024)
by: Sheth, Paras, et al.
Published: (2024)
DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation
by: Rahman, A B M Ashikur, et al.
Published: (2024)
by: Rahman, A B M Ashikur, et al.
Published: (2024)
Towards Generalized Offensive Language Identification
by: Dmonte, Alphaeus, et al.
Published: (2024)
by: Dmonte, Alphaeus, et al.
Published: (2024)
AMALIA Technical Report: A Fully Open Source Large Language Model for European Portuguese
by: Simplício, Afonso, et al.
Published: (2026)
by: Simplício, Afonso, et al.
Published: (2026)
SALSA: Single-pass Autoregressive LLM Structured Classification
by: Berdichevsky, Ruslan, et al.
Published: (2025)
by: Berdichevsky, Ruslan, et al.
Published: (2025)
Do LLMs Adhere to Label Definitions? Examining Their Receptivity to External Label Definitions
by: Mohammadi, Seyedali, et al.
Published: (2025)
by: Mohammadi, Seyedali, et al.
Published: (2025)
ARTICLE: Annotator Reliability Through In-Context Learning
by: Dutta, Sujan, et al.
Published: (2024)
by: Dutta, Sujan, et al.
Published: (2024)
Multi-Relational Hyperbolic Word Embeddings from Natural Language Definitions
by: Valentino, Marco, et al.
Published: (2023)
by: Valentino, Marco, et al.
Published: (2023)
Odysseys: Benchmarking Web Agents on Realistic Long Horizon Tasks
by: Jang, Lawrence Keunho, et al.
Published: (2026)
by: Jang, Lawrence Keunho, et al.
Published: (2026)
FalAR: A Large-scale Speaker-Annotated European Portuguese Speech Corpus of Parliamentary Sessions
by: Teixeira, Francisco, et al.
Published: (2026)
by: Teixeira, Francisco, et al.
Published: (2026)
A Unified Definition of Hallucination: It's The World Model, Stupid!
by: Liu, Emmy, et al.
Published: (2025)
by: Liu, Emmy, et al.
Published: (2025)
Fairness Definitions in Language Models Explained
by: Yin, Zhipeng, et al.
Published: (2024)
by: Yin, Zhipeng, et al.
Published: (2024)
TeenyTinyLlama: open-source tiny language models trained in Brazilian Portuguese
by: Corrêa, Nicholas Kluge, et al.
Published: (2024)
by: Corrêa, Nicholas Kluge, et al.
Published: (2024)
MultiLS: A Multi-task Lexical Simplification Framework
by: North, Kai, et al.
Published: (2024)
by: North, Kai, et al.
Published: (2024)
Multi-Agent Computer Use
by: Koh, Jing Yu, et al.
Published: (2026)
by: Koh, Jing Yu, et al.
Published: (2026)
A Survey on Multilingual Mental Disorders Detection from Social Media Data
by: Bucur, Ana-Maria, et al.
Published: (2025)
by: Bucur, Ana-Maria, et al.
Published: (2025)
LPI-RIT at LeWiDi-2025: Improving Distributional Predictions via Metadata and Loss Reweighting with DisCo
by: Sawkar, Mandira, et al.
Published: (2025)
by: Sawkar, Mandira, et al.
Published: (2025)
Predictive Authoring for Brazilian Portuguese Augmentative and Alternative Communication
by: Pereira, Jayr, et al.
Published: (2023)
by: Pereira, Jayr, et al.
Published: (2023)
DFA-RAG: Conversational Semantic Router for Large Language Model with Definite Finite Automaton
by: Sun, Yiyou, et al.
Published: (2024)
by: Sun, Yiyou, et al.
Published: (2024)
WolBanking77: Wolof Banking Speech Intent Classification Dataset
by: Kandji, Abdou Karim, et al.
Published: (2025)
by: Kandji, Abdou Karim, et al.
Published: (2025)
Training a Generally Curious Agent
by: Tajwar, Fahim, et al.
Published: (2025)
by: Tajwar, Fahim, et al.
Published: (2025)
Similar Items
-
LLM-based Embedders for Prior Case Retrieval
by: Premasiri, Damith, et al.
Published: (2025) -
AHaSIS: Shared Task on Sentiment Analysis for Arabic Dialects
by: Alharbi, Maram, et al.
Published: (2025) -
A Federated Learning Approach to Privacy Preserving Offensive Language Identification
by: Zampieri, Marcos, et al.
Published: (2024) -
Guided Distant Supervision for Multilingual Relation Extraction Data: Adapting to a New Language
by: Plum, Alistair, et al.
Published: (2024) -
From Text to Graph: Leveraging Graph Neural Networks for Enhanced Explainability in NLP
by: Yáñez-Romero, Fabio, et al.
Published: (2025)