Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kumar, Pankaj, Mishra, Subhankar
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence Machine Learning 68T07, 68T05, 62C10, 90C31 I.2.6; I.2.7; B.8.1; H.3.3
Online Access:	https://arxiv.org/abs/2505.18658
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866915601062035456
author	Kumar, Pankaj Mishra, Subhankar
author_facet	Kumar, Pankaj Mishra, Subhankar
contents	Large Language Models (LLMs) have emerged as a promising cornerstone for the development of natural language processing (NLP) and artificial intelligence (AI). However, ensuring the robustness of LLMs remains a critical challenge. To address these challenges and advance the field, this survey provides a comprehensive overview of current studies in this area. First, we systematically examine the nature of robustness in LLMs, including its conceptual foundations, the importance of consistent performance across diverse inputs, and the implications of failure modes in real-world applications. Next, we analyze the sources of non-robustness, categorizing intrinsic model limitations, data-driven vulnerabilities, and external adversarial factors that compromise reliability. Following this, we review state-of-the-art mitigation strategies, and then we discuss widely adopted benchmarks, emerging metrics, and persistent gaps in assessing real-world reliability. Finally, we synthesize findings from existing surveys and interdisciplinary studies to highlight trends, unresolved issues, and pathways for future research.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_18658
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics Kumar, Pankaj Mishra, Subhankar Computation and Language Artificial Intelligence Machine Learning 68T07, 68T05, 62C10, 90C31 I.2.6; I.2.7; B.8.1; H.3.3 Large Language Models (LLMs) have emerged as a promising cornerstone for the development of natural language processing (NLP) and artificial intelligence (AI). However, ensuring the robustness of LLMs remains a critical challenge. To address these challenges and advance the field, this survey provides a comprehensive overview of current studies in this area. First, we systematically examine the nature of robustness in LLMs, including its conceptual foundations, the importance of consistent performance across diverse inputs, and the implications of failure modes in real-world applications. Next, we analyze the sources of non-robustness, categorizing intrinsic model limitations, data-driven vulnerabilities, and external adversarial factors that compromise reliability. Following this, we review state-of-the-art mitigation strategies, and then we discuss widely adopted benchmarks, emerging metrics, and persistent gaps in assessing real-world reliability. Finally, we synthesize findings from existing surveys and interdisciplinary studies to highlight trends, unresolved issues, and pathways for future research.
title	Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics
topic	Computation and Language Artificial Intelligence Machine Learning 68T07, 68T05, 62C10, 90C31 I.2.6; I.2.7; B.8.1; H.3.3
url	https://arxiv.org/abs/2505.18658

Similar Items