:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Carro, María Victoria
Format:	Preprint
Published:	2024
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2412.02802
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Large Language Models can Strategically Deceive their Users when Put Under Pressure
by: Scheurer, Jérémy, et al.
Published: (2023)

When Large Language Models contradict humans? Large Language Models' Sycophantic Behaviour
by: Ranaldi, Leonardo, et al.
Published: (2023)

Sycophantic Anchors: Localizing and Quantifying User Agreement in Reasoning Models
by: Duszenko, Jacek
Published: (2026)

Deceive, Detect, and Disclose: Large Language Models Play Mini-Mafia
by: Costa, Davi Bastos, et al.
Published: (2025)

Complacent, Not Sycophantic: Reframing Large Language Models and Designing AI Literacy for Complacent Machines
by: Germani, Federico, et al.
Published: (2026)

Are UFOs Driving Innovation? The Illusion of Causality in Large Language Models
by: Carro, María Victoria, et al.
Published: (2024)

Human Attribution of Causality to AI Across Agency, Misuse, and Misalignment
by: Carro, Maria Victoria, et al.
Published: (2026)

TRAPDOC: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents
by: Jin, Hyundong, et al.
Published: (2025)

Are Your Agents Upward Deceivers?
by: Guo, Dadi, et al.
Published: (2025)

Seeing is Deceiving: Exploitation of Visual Pathways in Multi-Modal Language Models
by: Janowczyk, Pete, et al.
Published: (2024)

HoneyTrap: Deceiving Large Language Model Attackers to Honeypot Traps with Resilient Multi-Agent Defense
by: Li, Siyuan, et al.
Published: (2026)

"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust
by: Kim, Sunnie S. Y., et al.
Published: (2024)

Do Large Language Models Show Biases in Causal Learning? Insights from Contingency Judgment
by: Carro, María Victoria, et al.
Published: (2025)

Do Large Language Models Show Biases in Causal Learning?
by: Carro, Maria Victoria, et al.
Published: (2024)

Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence
by: Cheng, Myra, et al.
Published: (2025)

User Behavior Simulation with Large Language Model based Agents
by: Wang, Lei, et al.
Published: (2023)

LIBER: Lifelong User Behavior Modeling Based on Large Language Models
by: Zhu, Chenxu, et al.
Published: (2024)

A Rational Analysis of the Effects of Sycophantic AI
by: Batista, Rafael M., et al.
Published: (2026)

Can Large Language Model Agents Simulate Human Trust Behavior?
by: Xie, Chengxing, et al.
Published: (2024)

Acting Flatterers via LLMs Sycophancy: Combating Clickbait with LLMs Opposing-Stance Reasoning
by: Zhang, Chaowei, et al.
Published: (2026)

Large Language Models and User Trust: Consequence of Self-Referential Learning Loop and the Deskilling of Healthcare Professionals
by: Choudhury, Avishek, et al.
Published: (2024)

Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model
by: Xia, Yu, et al.
Published: (2025)

FVA-RAG: Falsification-Verification Alignment for Mitigating Sycophantic Hallucinations
by: Ravishankara, Mayank
Published: (2025)

Exploring Trust Calibration in XAI - The Impact of Exposing Model Limitations to Lay Users
by: Ventura, Alfio, et al.
Published: (2026)

Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal Bayesians
by: Chandra, Kartik, et al.
Published: (2026)

SimUSER: Simulating User Behavior with Large Language Models for Recommender System Evaluation
by: Bougie, Nicolas, et al.
Published: (2025)

To Trust or Not to Trust? Enhancing Large Language Models' Situated Faithfulness to External Contexts
by: Huang, Yukun, et al.
Published: (2024)

Hide or Highlight: Understanding the Impact of Factuality Expression on User Trust
by: Do, Hyo Jin, et al.
Published: (2025)

Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
by: Yang, Wenkai, et al.
Published: (2024)

User Privacy and Large Language Models: An Analysis of Frontier Developers' Privacy Policies
by: King, Jennifer, et al.
Published: (2025)

LUMOS: Large User MOdels for User Behavior Prediction
by: Nigam, Dhruv, et al.
Published: (2025)

Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models
by: Li, Rubing, et al.
Published: (2025)

Deceiving Question-Answering Models: A Hybrid Word-Level Adversarial Approach
by: Li, Jiyao, et al.
Published: (2024)

Objective Decoupling in Social Reinforcement Learning: Recovering Ground Truth from Sycophantic Majorities
by: Ghasemi, Majid, et al.
Published: (2026)

Trust-Oriented Adaptive Guardrails for Large Language Models
by: Hu, Jinwei, et al.
Published: (2024)

Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes
by: Xu, Zhiyao, et al.
Published: (2025)

Gaslight, Gatekeep, V1-V3: Early Visual Cortex Alignment Shields Vision-Language Models from Sycophantic Manipulation
by: Shah, Arya, et al.
Published: (2026)

Adversarial Magnification to Deceive Deepfake Detection through Super Resolution
by: Coccomini, Davide Alessandro, et al.
Published: (2024)

Simulating Students or Sycophantic Problem Solving? On Misconception Faithfulness of LLM Simulators
by: Do, Heejin, et al.
Published: (2026)

Anthropomorphism and Trust in Human-Large Language Model interactions
by: Kadambi, Akila, et al.
Published: (2026)