Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Chakraborty, Joymallya, Xia, Wei, Majumder, Anirban, Ma, Dan, Chaabene, Walid, Janvekar, Naveed
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2409.06072
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866929492951302144
author	Chakraborty, Joymallya Xia, Wei Majumder, Anirban Ma, Dan Chaabene, Walid Janvekar, Naveed
author_facet	Chakraborty, Joymallya Xia, Wei Majumder, Anirban Ma, Dan Chaabene, Walid Janvekar, Naveed
contents	Large language models (LLMs) have demonstrated remarkable capabilities in natural language processing tasks. However, their practical application in high-stake domains, such as fraud and abuse detection, remains an area that requires further exploration. The existing applications often narrowly focus on specific tasks like toxicity or hate speech detection. In this paper, we present a comprehensive benchmark suite designed to assess the performance of LLMs in identifying and mitigating fraudulent and abusive language across various real-world scenarios. Our benchmark encompasses a diverse set of tasks, including detecting spam emails, hate speech, misogynistic language, and more. We evaluated several state-of-the-art LLMs, including models from Anthropic, Mistral AI, and the AI21 family, to provide a comprehensive assessment of their capabilities in this critical domain. The results indicate that while LLMs exhibit proficient baseline performance in individual fraud and abuse detection tasks, their performance varies considerably across tasks, particularly struggling with tasks that demand nuanced pragmatic reasoning, such as identifying diverse forms of misogynistic language. These findings have important implications for the responsible development and deployment of LLMs in high-risk applications. Our benchmark suite can serve as a tool for researchers and practitioners to systematically evaluate LLMs for multi-task fraud detection and drive the creation of more robust, trustworthy, and ethically-aligned systems for fraud and abuse detection.
format	Preprint
id	arxiv_https___arxiv_org_abs_2409_06072
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection Chakraborty, Joymallya Xia, Wei Majumder, Anirban Ma, Dan Chaabene, Walid Janvekar, Naveed Computation and Language Machine Learning Large language models (LLMs) have demonstrated remarkable capabilities in natural language processing tasks. However, their practical application in high-stake domains, such as fraud and abuse detection, remains an area that requires further exploration. The existing applications often narrowly focus on specific tasks like toxicity or hate speech detection. In this paper, we present a comprehensive benchmark suite designed to assess the performance of LLMs in identifying and mitigating fraudulent and abusive language across various real-world scenarios. Our benchmark encompasses a diverse set of tasks, including detecting spam emails, hate speech, misogynistic language, and more. We evaluated several state-of-the-art LLMs, including models from Anthropic, Mistral AI, and the AI21 family, to provide a comprehensive assessment of their capabilities in this critical domain. The results indicate that while LLMs exhibit proficient baseline performance in individual fraud and abuse detection tasks, their performance varies considerably across tasks, particularly struggling with tasks that demand nuanced pragmatic reasoning, such as identifying diverse forms of misogynistic language. These findings have important implications for the responsible development and deployment of LLMs in high-risk applications. Our benchmark suite can serve as a tool for researchers and practitioners to systematically evaluate LLMs for multi-task fraud detection and drive the creation of more robust, trustworthy, and ethically-aligned systems for fraud and abuse detection.
title	DetoxBench: Benchmarking Large Language Models for Multitask Fraud & Abuse Detection
topic	Computation and Language Machine Learning
url	https://arxiv.org/abs/2409.06072

Similar Items