Saved in:
| Main Authors: | Volkova, Svitlana, Dupree, Will, Kao, Hsien-Te, Bautista, Peter, Ganberg, Gabe, Beaubien, Jeff, Cassani, Laura |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.21749 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Building Resilient Information Ecosystems: Large LLM-Generated Dataset of Persuasion Attacks
by: Kao, Hsien-Te, et al.
Published: (2025)
by: Kao, Hsien-Te, et al.
Published: (2025)
Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
by: Volkova, Svitlana, et al.
Published: (2025)
by: Volkova, Svitlana, et al.
Published: (2025)
Imperfectly Cooperative Human-AI Interactions: Comparing the Impacts of Human and AI Attributes in Simulated and User Studies
by: Cohen, Myke C., et al.
Published: (2026)
by: Cohen, Myke C., et al.
Published: (2026)
Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues
by: Cohen, Myke C., et al.
Published: (2025)
by: Cohen, Myke C., et al.
Published: (2025)
Towards Safer Online Spaces: Simulating and Assessing Intervention Strategies for Eating Disorder Discussions
by: Penafiel, Louis, et al.
Published: (2024)
by: Penafiel, Louis, et al.
Published: (2024)
Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
by: Gerard, Patrick, et al.
Published: (2026)
by: Gerard, Patrick, et al.
Published: (2026)
Exploratory Models of Human-AI Teams: Leveraging Human Digital Twins to Investigate Trust Development
by: Nguyen, Daniel, et al.
Published: (2024)
by: Nguyen, Daniel, et al.
Published: (2024)
Community-Aligned Behavior Under Uncertainty: Evidence of Epistemic Stance Transfer in LLMs
by: Gerard, Patrick, et al.
Published: (2025)
by: Gerard, Patrick, et al.
Published: (2025)
Stop Tracking Me! Proactive Defense Against Attribute Inference Attack in LLMs
by: Yan, Dong, et al.
Published: (2026)
by: Yan, Dong, et al.
Published: (2026)
‘Who Is Afraid of Fairenesse or Wanton Ladies Appearing in Their Barenesse?’: Laughing at Female Desire in Early Modern English Reception of the Myth of the Trojan War☆
by: Evgeniia Ganberg
Published: (2024)
by: Evgeniia Ganberg
Published: (2024)
Uncovering the Persuasive Fingerprint of LLMs in Jailbreaking Attacks
by: Noughabi, Havva Alizadeh, et al.
Published: (2025)
by: Noughabi, Havva Alizadeh, et al.
Published: (2025)
Iterative Prompting with Persuasion Skills in Jailbreaking Large Language Models
by: Ke, Shih-Wen, et al.
Published: (2025)
by: Ke, Shih-Wen, et al.
Published: (2025)
Redefining Proactivity for Information Seeking Dialogue
by: Lee, Jing Yang, et al.
Published: (2024)
by: Lee, Jing Yang, et al.
Published: (2024)
MALicious INTent Dataset and Inoculating LLMs for Enhanced Disinformation Detection
by: Modzelewski, Arkadiusz, et al.
Published: (2026)
by: Modzelewski, Arkadiusz, et al.
Published: (2026)
Can AI-Generated Persuasion Be Detected? Persuaficial Benchmark and AI vs. Human Linguistic Differences
by: Modzelewski, Arkadiusz, et al.
Published: (2026)
by: Modzelewski, Arkadiusz, et al.
Published: (2026)
ValueScope: Unveiling Implicit Norms and Values via Return Potential Model of Social Interactions
by: Park, Chan Young, et al.
Published: (2024)
by: Park, Chan Young, et al.
Published: (2024)
Measuring and Improving Persuasiveness of Large Language Models
by: Singh, Somesh, et al.
Published: (2024)
by: Singh, Somesh, et al.
Published: (2024)
Evaluating OpenAI GPT Models for Translation of Endangered Uralic Languages: A Comparison of Reasoning and Non-Reasoning Architectures
by: Tereshchenko, Yehor, et al.
Published: (2025)
by: Tereshchenko, Yehor, et al.
Published: (2025)
Commercial Persuasion in AI-Mediated Conversations
by: Salvi, Francesco, et al.
Published: (2026)
by: Salvi, Francesco, et al.
Published: (2026)
Measuring Opinion Bias and Sycophancy via LLM-based Persuasion
by: Nogueira, Rodrigo, et al.
Published: (2026)
by: Nogueira, Rodrigo, et al.
Published: (2026)
LLM-Based Adversarial Persuasion Attacks on Fact-Checking Systems
by: Leite, João A., et al.
Published: (2026)
by: Leite, João A., et al.
Published: (2026)
A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models
by: Liu, Shuliang, et al.
Published: (2025)
by: Liu, Shuliang, et al.
Published: (2025)
Defending Against Social Engineering Attacks in the Age of LLMs
by: Ai, Lin, et al.
Published: (2024)
by: Ai, Lin, et al.
Published: (2024)
You Need Better Attention Priors
by: Litman, Elon, et al.
Published: (2026)
by: Litman, Elon, et al.
Published: (2026)
Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding
by: Guo, Gabe, et al.
Published: (2025)
by: Guo, Gabe, et al.
Published: (2025)
The Levers of Political Persuasion with Conversational AI
by: Hackenburg, Kobi, et al.
Published: (2025)
by: Hackenburg, Kobi, et al.
Published: (2025)
Towards Detecting Persuasion on Social Media: From Model Development to Insights on Persuasion Strategies
by: Meguellati, Elyas, et al.
Published: (2025)
by: Meguellati, Elyas, et al.
Published: (2025)
Improving QA Model Performance with Cartographic Inoculation
by: Chen, Allen, et al.
Published: (2024)
by: Chen, Allen, et al.
Published: (2024)
PVP: An Image Dataset for Personalized Visual Persuasion with Persuasion Strategies, Viewer Characteristics, and Persuasiveness Ratings
by: Kim, Junseo, et al.
Published: (2025)
by: Kim, Junseo, et al.
Published: (2025)
AutoDefense: Multi-Agent LLM Defense against Jailbreak Attacks
by: Zeng, Yifan, et al.
Published: (2024)
by: Zeng, Yifan, et al.
Published: (2024)
Detecting Winning Arguments with Large Language Models and Persuasion Strategies
by: Labruna, Tiziano, et al.
Published: (2026)
by: Labruna, Tiziano, et al.
Published: (2026)
Measuring and Benchmarking Large Language Models' Capabilities to Generate Persuasive Language
by: Pauli, Amalie Brogaard, et al.
Published: (2024)
by: Pauli, Amalie Brogaard, et al.
Published: (2024)
Adversarial Attacks and Defense for Conversation Entailment Task
by: Yang, Zhenning, et al.
Published: (2024)
by: Yang, Zhenning, et al.
Published: (2024)
Teaching Models to Balance Resisting and Accepting Persuasion
by: Stengel-Eskin, Elias, et al.
Published: (2024)
by: Stengel-Eskin, Elias, et al.
Published: (2024)
Verification Required: The Impact of Information Credibility on AI Persuasion
by: Mahmud, Saaduddin, et al.
Published: (2026)
by: Mahmud, Saaduddin, et al.
Published: (2026)
AI for Service: Proactive Assistance with AI Glasses
by: Wen, Zichen, et al.
Published: (2025)
by: Wen, Zichen, et al.
Published: (2025)
PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues
by: Yu, Fangxu, et al.
Published: (2025)
by: Yu, Fangxu, et al.
Published: (2025)
UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models
by: Lin, Huawei, et al.
Published: (2025)
by: Lin, Huawei, et al.
Published: (2025)
Defense Against Syntactic Textual Backdoor Attacks with Token Substitution
by: Li, Xinglin, et al.
Published: (2024)
by: Li, Xinglin, et al.
Published: (2024)
The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples
by: Yang, Heng, et al.
Published: (2023)
by: Yang, Heng, et al.
Published: (2023)
Similar Items
-
Building Resilient Information Ecosystems: Large LLM-Generated Dataset of Persuasion Attacks
by: Kao, Hsien-Te, et al.
Published: (2025) -
Cross-Disciplinary Knowledge Retrieval and Synthesis: A Compound AI Architecture for Scientific Discovery
by: Volkova, Svitlana, et al.
Published: (2025) -
Imperfectly Cooperative Human-AI Interactions: Comparing the Impacts of Human and AI Attributes in Simulated and User Studies
by: Cohen, Myke C., et al.
Published: (2026) -
Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues
by: Cohen, Myke C., et al.
Published: (2025) -
Towards Safer Online Spaces: Simulating and Assessing Intervention Strategies for Eating Disorder Discussions
by: Penafiel, Louis, et al.
Published: (2024)