Saved in:
Bibliographic Details
Main Authors: Kumar, Pankaj, Mishra, Subhankar
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2505.18658
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915601062035456
author Kumar, Pankaj
Mishra, Subhankar
author_facet Kumar, Pankaj
Mishra, Subhankar
contents Large Language Models (LLMs) have emerged as a promising cornerstone for the development of natural language processing (NLP) and artificial intelligence (AI). However, ensuring the robustness of LLMs remains a critical challenge. To address these challenges and advance the field, this survey provides a comprehensive overview of current studies in this area. First, we systematically examine the nature of robustness in LLMs, including its conceptual foundations, the importance of consistent performance across diverse inputs, and the implications of failure modes in real-world applications. Next, we analyze the sources of non-robustness, categorizing intrinsic model limitations, data-driven vulnerabilities, and external adversarial factors that compromise reliability. Following this, we review state-of-the-art mitigation strategies, and then we discuss widely adopted benchmarks, emerging metrics, and persistent gaps in assessing real-world reliability. Finally, we synthesize findings from existing surveys and interdisciplinary studies to highlight trends, unresolved issues, and pathways for future research.
format Preprint
id arxiv_https___arxiv_org_abs_2505_18658
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics
Kumar, Pankaj
Mishra, Subhankar
Computation and Language
Artificial Intelligence
Machine Learning
68T07, 68T05, 62C10, 90C31
I.2.6; I.2.7; B.8.1; H.3.3
Large Language Models (LLMs) have emerged as a promising cornerstone for the development of natural language processing (NLP) and artificial intelligence (AI). However, ensuring the robustness of LLMs remains a critical challenge. To address these challenges and advance the field, this survey provides a comprehensive overview of current studies in this area. First, we systematically examine the nature of robustness in LLMs, including its conceptual foundations, the importance of consistent performance across diverse inputs, and the implications of failure modes in real-world applications. Next, we analyze the sources of non-robustness, categorizing intrinsic model limitations, data-driven vulnerabilities, and external adversarial factors that compromise reliability. Following this, we review state-of-the-art mitigation strategies, and then we discuss widely adopted benchmarks, emerging metrics, and persistent gaps in assessing real-world reliability. Finally, we synthesize findings from existing surveys and interdisciplinary studies to highlight trends, unresolved issues, and pathways for future research.
title Robustness in Large Language Models: A Survey of Mitigation Strategies and Evaluation Metrics
topic Computation and Language
Artificial Intelligence
Machine Learning
68T07, 68T05, 62C10, 90C31
I.2.6; I.2.7; B.8.1; H.3.3
url https://arxiv.org/abs/2505.18658