Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Ye, Meryl, Ibrahim, Lujain, Bo, Jessica Y., Cheng, Myra, Mattsson, Ida, Vennemeyer, Daniel, Kraut, Robert, Rathje, Steve
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2605.21778
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866914585140789248
author	Ye, Meryl Ibrahim, Lujain Bo, Jessica Y. Cheng, Myra Mattsson, Ida Vennemeyer, Daniel Kraut, Robert Rathje, Steve
author_facet	Ye, Meryl Ibrahim, Lujain Bo, Jessica Y. Cheng, Myra Mattsson, Ida Vennemeyer, Daniel Kraut, Robert Rathje, Steve
contents	AI sycophancy has become a prominent concern in large language model (LLM) research. Yet the term lacks a consistent definition and has been applied to behaviors ranging from agreeing with a user's false claim to excessively praising the user to withholding corrective feedback. When researchers, companies, and policymakers use the same term to describe different behaviors, evaluation results become difficult to compare, mitigation strategies fail to transfer, and systems that are resistant to one form of sycophancy continue exhibiting other forms. To address this, we make two contributions. First, we reviewed 70 papers on AI sycophancy to develop a taxonomy of how the behavior has been defined and measured. The taxonomy distinguishes (1) whether a model is sycophantic toward a user's positions and beliefs, or toward the user's broader personal traits and emotions, and (2) whether this occurs through explicit, direct language or more implicit, subtle behaviors such as framing, omission, or tone. Mapping existing literature to our taxonomy reveals that current research has focused on overt forms of sycophancy toward users' beliefs, leaving more subtle and person-directed behaviors relatively understudied. Second, we surveyed 106 experts in AI sycophancy and related fields to examine whether researchers agree on which model behaviors are sycophantic. While experts are nearly unanimous in believing that sycophancy is a significant problem in current AI systems (94.3% agree), they disagree substantially on which specific behaviors qualify. Together, these findings demonstrate that AI sycophancy is a broad family of behaviors with different measurement challenges, intervention requirements, and governance implications. Our taxonomy provides a shared vocabulary for understanding and addressing these behaviors.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_21778
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct Ye, Meryl Ibrahim, Lujain Bo, Jessica Y. Cheng, Myra Mattsson, Ida Vennemeyer, Daniel Kraut, Robert Rathje, Steve Artificial Intelligence AI sycophancy has become a prominent concern in large language model (LLM) research. Yet the term lacks a consistent definition and has been applied to behaviors ranging from agreeing with a user's false claim to excessively praising the user to withholding corrective feedback. When researchers, companies, and policymakers use the same term to describe different behaviors, evaluation results become difficult to compare, mitigation strategies fail to transfer, and systems that are resistant to one form of sycophancy continue exhibiting other forms. To address this, we make two contributions. First, we reviewed 70 papers on AI sycophancy to develop a taxonomy of how the behavior has been defined and measured. The taxonomy distinguishes (1) whether a model is sycophantic toward a user's positions and beliefs, or toward the user's broader personal traits and emotions, and (2) whether this occurs through explicit, direct language or more implicit, subtle behaviors such as framing, omission, or tone. Mapping existing literature to our taxonomy reveals that current research has focused on overt forms of sycophancy toward users' beliefs, leaving more subtle and person-directed behaviors relatively understudied. Second, we surveyed 106 experts in AI sycophancy and related fields to examine whether researchers agree on which model behaviors are sycophantic. While experts are nearly unanimous in believing that sycophancy is a significant problem in current AI systems (94.3% agree), they disagree substantially on which specific behaviors qualify. Together, these findings demonstrate that AI sycophancy is a broad family of behaviors with different measurement challenges, intervention requirements, and governance implications. Our taxonomy provides a shared vocabulary for understanding and addressing these behaviors.
title	What Counts as AI Sycophancy? A Taxonomy and Expert Survey of a Fragmented Construct
topic	Artificial Intelligence
url	https://arxiv.org/abs/2605.21778

Similar Items