Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Jiang, Aiqi, Zubiaga, Arkaitz
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2401.09244
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911558866567168
author	Jiang, Aiqi Zubiaga, Arkaitz
author_facet	Jiang, Aiqi Zubiaga, Arkaitz
contents	The growing prevalence and rapid evolution of offensive language in social media amplify the complexities of detection, particularly highlighting the challenges in identifying such content across diverse languages. This survey presents a systematic and comprehensive exploration of Cross-Lingual Transfer Learning (CLTL) techniques in offensive language detection in social media. Our study stands as the first holistic overview to focus exclusively on the cross-lingual scenario in this domain. We analyse 67 relevant papers and categorise these studies across various dimensions, including the characteristics of multilingual datasets used, the cross-lingual resources employed, and the specific CLTL strategies implemented. According to "what to transfer", we also summarise three main CLTL transfer approaches: instance, feature, and parameter transfer. Additionally, we shed light on the current challenges and future research opportunities in this field. Furthermore, we have made our survey resources available online, including two comprehensive tables that provide accessible references to the multilingual datasets and CLTL methods used in the reviewed literature.
format	Preprint
id	arxiv_https___arxiv_org_abs_2401_09244
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges Jiang, Aiqi Zubiaga, Arkaitz Computation and Language The growing prevalence and rapid evolution of offensive language in social media amplify the complexities of detection, particularly highlighting the challenges in identifying such content across diverse languages. This survey presents a systematic and comprehensive exploration of Cross-Lingual Transfer Learning (CLTL) techniques in offensive language detection in social media. Our study stands as the first holistic overview to focus exclusively on the cross-lingual scenario in this domain. We analyse 67 relevant papers and categorise these studies across various dimensions, including the characteristics of multilingual datasets used, the cross-lingual resources employed, and the specific CLTL strategies implemented. According to "what to transfer", we also summarise three main CLTL transfer approaches: instance, feature, and parameter transfer. Additionally, we shed light on the current challenges and future research opportunities in this field. Furthermore, we have made our survey resources available online, including two comprehensive tables that provide accessible references to the multilingual datasets and CLTL methods used in the reviewed literature.
title	Cross-lingual Offensive Language Detection: A Systematic Review of Datasets, Transfer Approaches and Challenges
topic	Computation and Language
url	https://arxiv.org/abs/2401.09244

Similar Items