Saved in:
| Main Author: | Carro, María Victoria |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2412.02802 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Large Language Models can Strategically Deceive their Users when Put Under Pressure
by: Scheurer, Jérémy, et al.
Published: (2023)
by: Scheurer, Jérémy, et al.
Published: (2023)
When Large Language Models contradict humans? Large Language Models' Sycophantic Behaviour
by: Ranaldi, Leonardo, et al.
Published: (2023)
by: Ranaldi, Leonardo, et al.
Published: (2023)
Sycophantic Anchors: Localizing and Quantifying User Agreement in Reasoning Models
by: Duszenko, Jacek
Published: (2026)
by: Duszenko, Jacek
Published: (2026)
Deceive, Detect, and Disclose: Large Language Models Play Mini-Mafia
by: Costa, Davi Bastos, et al.
Published: (2025)
by: Costa, Davi Bastos, et al.
Published: (2025)
Complacent, Not Sycophantic: Reframing Large Language Models and Designing AI Literacy for Complacent Machines
by: Germani, Federico, et al.
Published: (2026)
by: Germani, Federico, et al.
Published: (2026)
Are UFOs Driving Innovation? The Illusion of Causality in Large Language Models
by: Carro, María Victoria, et al.
Published: (2024)
by: Carro, María Victoria, et al.
Published: (2024)
Human Attribution of Causality to AI Across Agency, Misuse, and Misalignment
by: Carro, Maria Victoria, et al.
Published: (2026)
by: Carro, Maria Victoria, et al.
Published: (2026)
TRAPDOC: Deceiving LLM Users by Injecting Imperceptible Phantom Tokens into Documents
by: Jin, Hyundong, et al.
Published: (2025)
by: Jin, Hyundong, et al.
Published: (2025)
Are Your Agents Upward Deceivers?
by: Guo, Dadi, et al.
Published: (2025)
by: Guo, Dadi, et al.
Published: (2025)
Seeing is Deceiving: Exploitation of Visual Pathways in Multi-Modal Language Models
by: Janowczyk, Pete, et al.
Published: (2024)
by: Janowczyk, Pete, et al.
Published: (2024)
HoneyTrap: Deceiving Large Language Model Attackers to Honeypot Traps with Resilient Multi-Agent Defense
by: Li, Siyuan, et al.
Published: (2026)
by: Li, Siyuan, et al.
Published: (2026)
"I'm Not Sure, But...": Examining the Impact of Large Language Models' Uncertainty Expression on User Reliance and Trust
by: Kim, Sunnie S. Y., et al.
Published: (2024)
by: Kim, Sunnie S. Y., et al.
Published: (2024)
Do Large Language Models Show Biases in Causal Learning? Insights from Contingency Judgment
by: Carro, María Victoria, et al.
Published: (2025)
by: Carro, María Victoria, et al.
Published: (2025)
Do Large Language Models Show Biases in Causal Learning?
by: Carro, Maria Victoria, et al.
Published: (2024)
by: Carro, Maria Victoria, et al.
Published: (2024)
Sycophantic AI Decreases Prosocial Intentions and Promotes Dependence
by: Cheng, Myra, et al.
Published: (2025)
by: Cheng, Myra, et al.
Published: (2025)
User Behavior Simulation with Large Language Model based Agents
by: Wang, Lei, et al.
Published: (2023)
by: Wang, Lei, et al.
Published: (2023)
LIBER: Lifelong User Behavior Modeling Based on Large Language Models
by: Zhu, Chenxu, et al.
Published: (2024)
by: Zhu, Chenxu, et al.
Published: (2024)
A Rational Analysis of the Effects of Sycophantic AI
by: Batista, Rafael M., et al.
Published: (2026)
by: Batista, Rafael M., et al.
Published: (2026)
Can Large Language Model Agents Simulate Human Trust Behavior?
by: Xie, Chengxing, et al.
Published: (2024)
by: Xie, Chengxing, et al.
Published: (2024)
Acting Flatterers via LLMs Sycophancy: Combating Clickbait with LLMs Opposing-Stance Reasoning
by: Zhang, Chaowei, et al.
Published: (2026)
by: Zhang, Chaowei, et al.
Published: (2026)
Large Language Models and User Trust: Consequence of Self-Referential Learning Loop and the Deskilling of Healthcare Professionals
by: Choudhury, Avishek, et al.
Published: (2024)
by: Choudhury, Avishek, et al.
Published: (2024)
Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model
by: Xia, Yu, et al.
Published: (2025)
by: Xia, Yu, et al.
Published: (2025)
FVA-RAG: Falsification-Verification Alignment for Mitigating Sycophantic Hallucinations
by: Ravishankara, Mayank
Published: (2025)
by: Ravishankara, Mayank
Published: (2025)
Exploring Trust Calibration in XAI - The Impact of Exposing Model Limitations to Lay Users
by: Ventura, Alfio, et al.
Published: (2026)
by: Ventura, Alfio, et al.
Published: (2026)
Sycophantic Chatbots Cause Delusional Spiraling, Even in Ideal Bayesians
by: Chandra, Kartik, et al.
Published: (2026)
by: Chandra, Kartik, et al.
Published: (2026)
SimUSER: Simulating User Behavior with Large Language Models for Recommender System Evaluation
by: Bougie, Nicolas, et al.
Published: (2025)
by: Bougie, Nicolas, et al.
Published: (2025)
To Trust or Not to Trust? Enhancing Large Language Models' Situated Faithfulness to External Contexts
by: Huang, Yukun, et al.
Published: (2024)
by: Huang, Yukun, et al.
Published: (2024)
Hide or Highlight: Understanding the Impact of Factuality Expression on User Trust
by: Do, Hyo Jin, et al.
Published: (2025)
by: Do, Hyo Jin, et al.
Published: (2025)
Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization
by: Yang, Wenkai, et al.
Published: (2024)
by: Yang, Wenkai, et al.
Published: (2024)
User Privacy and Large Language Models: An Analysis of Frontier Developers' Privacy Policies
by: King, Jennifer, et al.
Published: (2025)
by: King, Jennifer, et al.
Published: (2025)
LUMOS: Large User MOdels for User Behavior Prediction
by: Nigam, Dhruv, et al.
Published: (2025)
by: Nigam, Dhruv, et al.
Published: (2025)
Reasoning and the Trusting Behavior of DeepSeek and GPT: An Experiment Revealing Hidden Fault Lines in Large Language Models
by: Li, Rubing, et al.
Published: (2025)
by: Li, Rubing, et al.
Published: (2025)
Deceiving Question-Answering Models: A Hybrid Word-Level Adversarial Approach
by: Li, Jiyao, et al.
Published: (2024)
by: Li, Jiyao, et al.
Published: (2024)
Objective Decoupling in Social Reinforcement Learning: Recovering Ground Truth from Sycophantic Majorities
by: Ghasemi, Majid, et al.
Published: (2026)
by: Ghasemi, Majid, et al.
Published: (2026)
Trust-Oriented Adaptive Guardrails for Large Language Models
by: Hu, Jinwei, et al.
Published: (2024)
by: Hu, Jinwei, et al.
Published: (2024)
Synthetic User Behavior Sequence Generation with Large Language Models for Smart Homes
by: Xu, Zhiyao, et al.
Published: (2025)
by: Xu, Zhiyao, et al.
Published: (2025)
Gaslight, Gatekeep, V1-V3: Early Visual Cortex Alignment Shields Vision-Language Models from Sycophantic Manipulation
by: Shah, Arya, et al.
Published: (2026)
by: Shah, Arya, et al.
Published: (2026)
Adversarial Magnification to Deceive Deepfake Detection through Super Resolution
by: Coccomini, Davide Alessandro, et al.
Published: (2024)
by: Coccomini, Davide Alessandro, et al.
Published: (2024)
Simulating Students or Sycophantic Problem Solving? On Misconception Faithfulness of LLM Simulators
by: Do, Heejin, et al.
Published: (2026)
by: Do, Heejin, et al.
Published: (2026)
Anthropomorphism and Trust in Human-Large Language Model interactions
by: Kadambi, Akila, et al.
Published: (2026)
by: Kadambi, Akila, et al.
Published: (2026)
Similar Items
-
Large Language Models can Strategically Deceive their Users when Put Under Pressure
by: Scheurer, Jérémy, et al.
Published: (2023) -
When Large Language Models contradict humans? Large Language Models' Sycophantic Behaviour
by: Ranaldi, Leonardo, et al.
Published: (2023) -
Sycophantic Anchors: Localizing and Quantifying User Agreement in Reasoning Models
by: Duszenko, Jacek
Published: (2026) -
Deceive, Detect, and Disclose: Large Language Models Play Mini-Mafia
by: Costa, Davi Bastos, et al.
Published: (2025) -
Complacent, Not Sycophantic: Reframing Large Language Models and Designing AI Literacy for Complacent Machines
by: Germani, Federico, et al.
Published: (2026)