:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Timm, Jasper, Talele, Chetan, Haimes, Jacob
Formato:	Preprint
Publicado:	2025
Materias:	Computation and Language
Acceso en línea:	https://arxiv.org/abs/2501.17273
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

MEQA: A Meta-Evaluation Framework for Question & Answer LLM Benchmarks
por: Veuthey, Jaime Raldua, et al.
Publicado: (2025)

View From Above: A Framework for Evaluating Distribution Shifts in Model Behavior
por: Chopra, Tanush, et al.
Publicado: (2024)

LLM-Generated Ads: From Personalization Parity to Persuasion Superiority
por: Meguellati, Elyas, et al.
Publicado: (2025)

Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis
por: Reese, May Lynn, et al.
Publicado: (2026)

HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants
por: Sturgeon, Benjamin, et al.
Publicado: (2025)

Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts
por: Haimes, Jacob, et al.
Publicado: (2024)

Debating with More Persuasive LLMs Leads to More Truthful Answers
por: Khan, Akbir, et al.
Publicado: (2024)

PVP: An Image Dataset for Personalized Visual Persuasion with Persuasion Strategies, Viewer Characteristics, and Persuasiveness Ratings
por: Kim, Junseo, et al.
Publicado: (2025)

Truth with a Twist: The Rhetoric of Persuasion in Professional vs. Community-Authored Fact-Checks
por: Razuvayevskaya, Olesya, et al.
Publicado: (2026)

When Persuasion Overrides Truth in Multi-Agent LLM Debates: Introducing a Confidence-Weighted Persuasion Override Rate (CW-POR)
por: Agarwal, Mahak, et al.
Publicado: (2025)

TriAlign: Towards Universal Truth Consistency in Personalized LLM Alignment
por: Nguyen, Thi-Nhung, et al.
Publicado: (2026)

Counterfactual Reasoning Using Predicted Latent Personality Dimensions for Optimizing Persuasion Outcome
por: Zeng, Donghuo, et al.
Publicado: (2024)

Teaching LLM to be Persuasive: Reward-Enhanced Policy Optimization for Alignment from Heterogeneous Rewards
por: Zeng, Xia, et al.
Publicado: (2025)

The Dark Patterns of Personalized Persuasion in Large Language Models: Exposing Persuasive Linguistic Features for Big Five Personality Traits in LLMs Responses
por: Mieleszczenko-Kowszewicz, Wiktoria, et al.
Publicado: (2024)

Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations
por: Ferreira, Pedro, et al.
Publicado: (2025)

Personality Modeling for Persuasion of Misinformation using AI Agent
por: Lou, Qianmin, et al.
Publicado: (2025)

TruthTorchLM: A Comprehensive Library for Predicting Truthfulness in LLM Outputs
por: Yaldiz, Duygu Nur, et al.
Publicado: (2025)

TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability
por: Khatun, Aisha, et al.
Publicado: (2024)

PARROT: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs
por: Çelebi, Yusuf, et al.
Publicado: (2025)

PersonalLLM: Tailoring LLMs to Individual Preferences
por: Zollo, Thomas P., et al.
Publicado: (2024)

Persuasiveness and Bias in LLM: Investigating the Impact of Persuasiveness and Reinforcement of Bias in Language Models
por: Roy, Saumya
Publicado: (2025)

LLM-Based Intelligent Notification Composition: From Static Personalization to Context-Aware Persuasive Messaging
por: Agrawal, Nilesh
Publicado: (2026)

The Anatomy of Speech Persuasion: Linguistic Shifts in LLM-Modified Speeches
por: Barkar, Alisa, et al.
Publicado: (2025)

Can You Trick the Grader? Adversarial Persuasion of LLM Judges
por: Hwang, Yerin, et al.
Publicado: (2025)

Measuring Opinion Bias and Sycophancy via LLM-based Persuasion
por: Nogueira, Rodrigo, et al.
Publicado: (2026)

LLM-Based Persuasion Enables Guardrail Override in Frontier LLMs
por: Nogueira, Rodrigo, et al.
Publicado: (2026)

Generative Framework for Personalized Persuasion: Inferring Causal, Counterfactual, and Latent Knowledge
por: Zeng, Donghuo, et al.
Publicado: (2025)

Zero-shot Persuasive Chatbots with LLM-Generated Strategies and Information Retrieval
por: Furumai, Kazuaki, et al.
Publicado: (2024)

Chord Colourizer: A Near Real-Time System for Visualizing Musical Key
por: Haimes, Paul
Publicado: (2025)

Learning to Retrieve User History and Generate User Profiles for Personalized Persuasiveness Prediction
por: Park, Sejun, et al.
Publicado: (2026)

TruthFlow: Truthful LLM Generation via Representation Flow Correction
por: Wang, Hanyu, et al.
Publicado: (2025)

Approximating Human Preferences Using a Multi-Judge Learned System
por: Sprejer, Eitán, et al.
Publicado: (2025)

Persuasion at Play: Understanding Misinformation Dynamics in Demographic-Aware Human-LLM Interactions
por: Borah, Angana, et al.
Publicado: (2025)

Train Yourself as an LLM: Exploring Effects of AI Literacy on Persuasion via Role-playing LLM Training
por: Fan, Qihui, et al.
Publicado: (2026)

Spontaneous Persuasion: An Audit of Model Persuasiveness in Everyday Conversations
por: Poungpeth, Nalin, et al.
Publicado: (2026)

PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues
por: Yu, Fangxu, et al.
Publicado: (2025)

The Facade of Truth: Uncovering and Mitigating LLM Susceptibility to Deceptive Evidence
por: Wan, Herun, et al.
Publicado: (2026)

Revealing the Truth with ConLLM for Detecting Multi-Modal Deepfakes
por: Kashyap, Gautam Siddharth, et al.
Publicado: (2026)

Causal Discovery and Counterfactual Reasoning to Optimize Persuasive Dialogue Policies
por: Zeng, Donghuo, et al.
Publicado: (2025)

Two Pathways to Truthfulness: On the Intrinsic Encoding of LLM Hallucinations
por: Luo, Wen, et al.
Publicado: (2026)