:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Volkova, Svitlana, Dupree, Will, Kao, Hsien-Te, Bautista, Peter, Ganberg, Gabe, Beaubien, Jeff, Cassani, Laura
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.21749
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Building Resilient Information Ecosystems: Large LLM-Generated Dataset of Persuasion Attacks
by: Kao, Hsien-Te, et al.
Published: (2025)

Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
by: Volkova, Svitlana, et al.
Published: (2025)

Imperfectly Cooperative Human-AI Interactions: Comparing the Impacts of Human and AI Attributes in Simulated and User Studies
by: Cohen, Myke C., et al.
Published: (2026)

Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues
by: Cohen, Myke C., et al.
Published: (2025)

Towards Safer Online Spaces: Simulating and Assessing Intervention Strategies for Eating Disorder Discussions
by: Penafiel, Louis, et al.
Published: (2024)

Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
by: Gerard, Patrick, et al.
Published: (2026)

Exploratory Models of Human-AI Teams: Leveraging Human Digital Twins to Investigate Trust Development
by: Nguyen, Daniel, et al.
Published: (2024)

Community-Aligned Behavior Under Uncertainty: Evidence of Epistemic Stance Transfer in LLMs
by: Gerard, Patrick, et al.
Published: (2025)

Stop Tracking Me! Proactive Defense Against Attribute Inference Attack in LLMs
by: Yan, Dong, et al.
Published: (2026)

‘Who Is Afraid of Fairenesse or Wanton Ladies Appearing in Their Barenesse?’: Laughing at Female Desire in Early Modern English Reception of the Myth of the Trojan War☆
by: Evgeniia Ganberg
Published: (2024)

Uncovering the Persuasive Fingerprint of LLMs in Jailbreaking Attacks
by: Noughabi, Havva Alizadeh, et al.
Published: (2025)

Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models
by: Ke, Shih-Wen, et al.
Published: (2025)

Redefining Proactivity for Information Seeking Dialogue
by: Lee, Jing Yang, et al.
Published: (2024)

MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection
by: Modzelewski, Arkadiusz, et al.
Published: (2026)

Can AI-Generated Persuasion Be Detected? Persuaficial Benchmark and AI vs. Human Linguistic Differences
by: Modzelewski, Arkadiusz, et al.
Published: (2026)

ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
by: Park, Chan Young, et al.
Published: (2024)

Measuring and Improving Persuasiveness of Large Language Models
by: Singh, Somesh, et al.
Published: (2024)

Evaluating OpenAI GPT Models for Translation of Endangered Uralic Languages: A Comparison of Reasoning and Non-Reasoning Architectures
by: Tereshchenko, Yehor, et al.
Published: (2025)

Commercial Persuasion in AI-Mediated Conversations
by: Salvi, Francesco, et al.
Published: (2026)

Measuring Opinion Bias and Sycophancy via LLM-based Persuasion
by: Nogueira, Rodrigo, et al.
Published: (2026)

LLM-Based Adversarial Persuasion Attacks on Fact-Checking Systems
by: Leite, João A., et al.
Published: (2026)

A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models
by: Liu, Shuliang, et al.
Published: (2025)

Defending Against Social Engineering Attacks in the Age of LLMs
by: Ai, Lin, et al.
Published: (2024)

You Need Better Attention Priors
by: Litman, Elon, et al.
Published: (2026)

Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding
by: Guo, Gabe, et al.
Published: (2025)

The Levers of Political Persuasion with Conversational AI
by: Hackenburg, Kobi, et al.
Published: (2025)

Towards Detecting Persuasion on Social Media: From Model Development to Insights on Persuasion Strategies
by: Meguellati, Elyas, et al.
Published: (2025)

Improving QA Model Performance with Cartographic Inoculation
by: Chen, Allen, et al.
Published: (2024)

PVP: An Image Dataset for Personalized Visual Persuasion with Persuasion Strategies, Viewer Characteristics, and Persuasiveness Ratings
by: Kim, Junseo, et al.
Published: (2025)

AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
by: Zeng, Yifan, et al.
Published: (2024)

Detecting Winning Arguments with Large Language Models and Persuasion Strategies
by: Labruna, Tiziano, et al.
Published: (2026)

Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language
by: Pauli, Amalie Brogaard, et al.
Published: (2024)

Adversarial Attacks and Defense for Conversation Entailment Task
by: Yang, Zhenning, et al.
Published: (2024)

Teaching Models to Balance Resisting and Accepting Persuasion
by: Stengel-Eskin, Elias, et al.
Published: (2024)

Verification Required: The Impact of Information Credibility on AI Persuasion
by: Mahmud, Saaduddin, et al.
Published: (2026)

AI for Service: Proactive Assistance with AI Glasses
by: Wen, Zichen, et al.
Published: (2025)

PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues
by: Yu, Fangxu, et al.
Published: (2025)

UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models
by: Lin, Huawei, et al.
Published: (2025)

Defense Against Syntactic Textual Backdoor Attacks with Token Substitution
by: Li, Xinglin, et al.
Published: (2024)

The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples
by: Yang, Heng, et al.
Published: (2023)