Guardado en:
| Autores principales: | Timm, Jasper, Talele, Chetan, Haimes, Jacob |
|---|---|
| Formato: | Preprint |
| Publicado: |
2025
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2501.17273 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
MEQA: A Meta-Evaluation Framework for Question & Answer LLM Benchmarks
por: Veuthey, Jaime Raldua, et al.
Publicado: (2025)
por: Veuthey, Jaime Raldua, et al.
Publicado: (2025)
View From Above: A Framework for Evaluating Distribution Shifts in Model Behavior
por: Chopra, Tanush, et al.
Publicado: (2024)
por: Chopra, Tanush, et al.
Publicado: (2024)
LLM-Generated Ads: From Personalization Parity to Persuasion Superiority
por: Meguellati, Elyas, et al.
Publicado: (2025)
por: Meguellati, Elyas, et al.
Publicado: (2025)
Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis
por: Reese, May Lynn, et al.
Publicado: (2026)
por: Reese, May Lynn, et al.
Publicado: (2026)
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants
por: Sturgeon, Benjamin, et al.
Publicado: (2025)
por: Sturgeon, Benjamin, et al.
Publicado: (2025)
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts
por: Haimes, Jacob, et al.
Publicado: (2024)
por: Haimes, Jacob, et al.
Publicado: (2024)
Debating with More Persuasive LLMs Leads to More Truthful Answers
por: Khan, Akbir, et al.
Publicado: (2024)
por: Khan, Akbir, et al.
Publicado: (2024)
PVP: An Image Dataset for Personalized Visual Persuasion with Persuasion Strategies, Viewer Characteristics, and Persuasiveness Ratings
por: Kim, Junseo, et al.
Publicado: (2025)
por: Kim, Junseo, et al.
Publicado: (2025)
Truth with a Twist: The Rhetoric of Persuasion in Professional vs. Community-Authored Fact-Checks
por: Razuvayevskaya, Olesya, et al.
Publicado: (2026)
por: Razuvayevskaya, Olesya, et al.
Publicado: (2026)
When Persuasion Overrides Truth in Multi-Agent LLM Debates: Introducing a Confidence-Weighted Persuasion Override Rate (CW-POR)
por: Agarwal, Mahak, et al.
Publicado: (2025)
por: Agarwal, Mahak, et al.
Publicado: (2025)
TriAlign: Towards Universal Truth Consistency in Personalized LLM Alignment
por: Nguyen, Thi-Nhung, et al.
Publicado: (2026)
por: Nguyen, Thi-Nhung, et al.
Publicado: (2026)
Counterfactual Reasoning Using Predicted Latent Personality Dimensions for Optimizing Persuasion Outcome
por: Zeng, Donghuo, et al.
Publicado: (2024)
por: Zeng, Donghuo, et al.
Publicado: (2024)
Teaching LLM to be Persuasive: Reward-Enhanced Policy Optimization for Alignment from Heterogeneous Rewards
por: Zeng, Xia, et al.
Publicado: (2025)
por: Zeng, Xia, et al.
Publicado: (2025)
The Dark Patterns of Personalized Persuasion in Large Language Models: Exposing Persuasive Linguistic Features for Big Five Personality Traits in LLMs Responses
por: Mieleszczenko-Kowszewicz, Wiktoria, et al.
Publicado: (2024)
por: Mieleszczenko-Kowszewicz, Wiktoria, et al.
Publicado: (2024)
Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations
por: Ferreira, Pedro, et al.
Publicado: (2025)
por: Ferreira, Pedro, et al.
Publicado: (2025)
Personality Modeling for Persuasion of Misinformation using AI Agent
por: Lou, Qianmin, et al.
Publicado: (2025)
por: Lou, Qianmin, et al.
Publicado: (2025)
TruthTorchLM: A Comprehensive Library for Predicting Truthfulness in LLM Outputs
por: Yaldiz, Duygu Nur, et al.
Publicado: (2025)
por: Yaldiz, Duygu Nur, et al.
Publicado: (2025)
TruthEval: A Dataset to Evaluate LLM Truthfulness and Reliability
por: Khatun, Aisha, et al.
Publicado: (2024)
por: Khatun, Aisha, et al.
Publicado: (2024)
PARROT: Persuasion and Agreement Robustness Rating of Output Truth -- A Sycophancy Robustness Benchmark for LLMs
por: Çelebi, Yusuf, et al.
Publicado: (2025)
por: Çelebi, Yusuf, et al.
Publicado: (2025)
PersonalLLM: Tailoring LLMs to Individual Preferences
por: Zollo, Thomas P., et al.
Publicado: (2024)
por: Zollo, Thomas P., et al.
Publicado: (2024)
Persuasiveness and Bias in LLM: Investigating the Impact of Persuasiveness and Reinforcement of Bias in Language Models
por: Roy, Saumya
Publicado: (2025)
por: Roy, Saumya
Publicado: (2025)
LLM-Based Intelligent Notification Composition: From Static Personalization to Context-Aware Persuasive Messaging
por: Agrawal, Nilesh
Publicado: (2026)
por: Agrawal, Nilesh
Publicado: (2026)
The Anatomy of Speech Persuasion: Linguistic Shifts in LLM-Modified Speeches
por: Barkar, Alisa, et al.
Publicado: (2025)
por: Barkar, Alisa, et al.
Publicado: (2025)
Can You Trick the Grader? Adversarial Persuasion of LLM Judges
por: Hwang, Yerin, et al.
Publicado: (2025)
por: Hwang, Yerin, et al.
Publicado: (2025)
Measuring Opinion Bias and Sycophancy via LLM-based Persuasion
por: Nogueira, Rodrigo, et al.
Publicado: (2026)
por: Nogueira, Rodrigo, et al.
Publicado: (2026)
LLM-Based Persuasion Enables Guardrail Override in Frontier LLMs
por: Nogueira, Rodrigo, et al.
Publicado: (2026)
por: Nogueira, Rodrigo, et al.
Publicado: (2026)
Generative Framework for Personalized Persuasion: Inferring Causal, Counterfactual, and Latent Knowledge
por: Zeng, Donghuo, et al.
Publicado: (2025)
por: Zeng, Donghuo, et al.
Publicado: (2025)
Zero-shot Persuasive Chatbots with LLM-Generated Strategies and Information Retrieval
por: Furumai, Kazuaki, et al.
Publicado: (2024)
por: Furumai, Kazuaki, et al.
Publicado: (2024)
Chord Colourizer: A Near Real-Time System for Visualizing Musical Key
por: Haimes, Paul
Publicado: (2025)
por: Haimes, Paul
Publicado: (2025)
Learning to Retrieve User History and Generate User Profiles for Personalized Persuasiveness Prediction
por: Park, Sejun, et al.
Publicado: (2026)
por: Park, Sejun, et al.
Publicado: (2026)
TruthFlow: Truthful LLM Generation via Representation Flow Correction
por: Wang, Hanyu, et al.
Publicado: (2025)
por: Wang, Hanyu, et al.
Publicado: (2025)
Approximating Human Preferences Using a Multi-Judge Learned System
por: Sprejer, Eitán, et al.
Publicado: (2025)
por: Sprejer, Eitán, et al.
Publicado: (2025)
Persuasion at Play: Understanding Misinformation Dynamics in Demographic-Aware Human-LLM Interactions
por: Borah, Angana, et al.
Publicado: (2025)
por: Borah, Angana, et al.
Publicado: (2025)
Train Yourself as an LLM: Exploring Effects of AI Literacy on Persuasion via Role-playing LLM Training
por: Fan, Qihui, et al.
Publicado: (2026)
por: Fan, Qihui, et al.
Publicado: (2026)
Spontaneous Persuasion: An Audit of Model Persuasiveness in Everyday Conversations
por: Poungpeth, Nalin, et al.
Publicado: (2026)
por: Poungpeth, Nalin, et al.
Publicado: (2026)
PersuasiveToM: A Benchmark for Evaluating Machine Theory of Mind in Persuasive Dialogues
por: Yu, Fangxu, et al.
Publicado: (2025)
por: Yu, Fangxu, et al.
Publicado: (2025)
The Facade of Truth: Uncovering and Mitigating LLM Susceptibility to Deceptive Evidence
por: Wan, Herun, et al.
Publicado: (2026)
por: Wan, Herun, et al.
Publicado: (2026)
Revealing the Truth with ConLLM for Detecting Multi-Modal Deepfakes
por: Kashyap, Gautam Siddharth, et al.
Publicado: (2026)
por: Kashyap, Gautam Siddharth, et al.
Publicado: (2026)
Causal Discovery and Counterfactual Reasoning to Optimize Persuasive Dialogue Policies
por: Zeng, Donghuo, et al.
Publicado: (2025)
por: Zeng, Donghuo, et al.
Publicado: (2025)
Two Pathways to Truthfulness: On the Intrinsic Encoding of LLM Hallucinations
por: Luo, Wen, et al.
Publicado: (2026)
por: Luo, Wen, et al.
Publicado: (2026)
Ejemplares similares
-
MEQA: A Meta-Evaluation Framework for Question & Answer LLM Benchmarks
por: Veuthey, Jaime Raldua, et al.
Publicado: (2025) -
View From Above: A Framework for Evaluating Distribution Shifts in Model Behavior
por: Chopra, Tanush, et al.
Publicado: (2024) -
LLM-Generated Ads: From Personalization Parity to Persuasion Superiority
por: Meguellati, Elyas, et al.
Publicado: (2025) -
Using LLM-as-a-Judge/Jury to Advance Scalable, Clinically-Validated Safety Evaluations of Model Responses to Users Demonstrating Psychosis
por: Reese, May Lynn, et al.
Publicado: (2026) -
HumanAgencyBench: Scalable Evaluation of Human Agency Support in AI Assistants
por: Sturgeon, Benjamin, et al.
Publicado: (2025)