Saved in:
| Main Author: | Steinle, Sean |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.03399 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Understanding AI Trustworthiness: A Scoping Review of AIES & FAccT Articles
by: Mehrotra, Siddharth, et al.
Published: (2025)
by: Mehrotra, Siddharth, et al.
Published: (2025)
Ran Score: a LLM-based Evaluation Score for Radiology Report Generation
by: Zhang, Ran, et al.
Published: (2026)
by: Zhang, Ran, et al.
Published: (2026)
The Journey to Trustworthy AI: Pursuit of Pragmatic Frameworks
by: Nasr-Azadani, Mohamad M, et al.
Published: (2024)
by: Nasr-Azadani, Mohamad M, et al.
Published: (2024)
Designing The Internet of Agents: A Framework for Trustworthy, Transparent, and Collaborative Human-Agent Interaction (HAX)
by: Scibelli, Marc, et al.
Published: (2025)
by: Scibelli, Marc, et al.
Published: (2025)
Learning to Plan with Personalized Preferences
by: Xu, Manjie, et al.
Published: (2025)
by: Xu, Manjie, et al.
Published: (2025)
Creating 'Full-Stack' Hybrid Reasoning Systems that Prioritize and Enhance Human Intelligence
by: Koon, Sean
Published: (2025)
by: Koon, Sean
Published: (2025)
A Beautiful Mind: Principles and Strategies for AI-Augmented Human Reasoning
by: Koon, Sean
Published: (2025)
by: Koon, Sean
Published: (2025)
PILAR: Personalizing Augmented Reality Interactions with LLM-based Human-Centric and Trustworthy Explanations for Daily Use Cases
by: Kundu, Ripan Kumar, et al.
Published: (2025)
by: Kundu, Ripan Kumar, et al.
Published: (2025)
Bidirectional Human-AI Alignment in Education for Trustworthy Learning Environments
by: Shen, Hua
Published: (2025)
by: Shen, Hua
Published: (2025)
FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
by: Lyu, Hanfang, et al.
Published: (2024)
by: Lyu, Hanfang, et al.
Published: (2024)
Harmonic LLMs are Trustworthy
by: Kersting, Nicholas S., et al.
Published: (2024)
by: Kersting, Nicholas S., et al.
Published: (2024)
Aligning Trustworthy AI with Democracy: A Dual Taxonomy of Opportunities and Risks
by: Mentxaka, Oier, et al.
Published: (2025)
by: Mentxaka, Oier, et al.
Published: (2025)
A Checklist for Trustworthy, Safe, and User-Friendly Mental Health Chatbots
by: Haran, Shreya, et al.
Published: (2026)
by: Haran, Shreya, et al.
Published: (2026)
Visual Analytics for Explainable and Trustworthy Artificial Intelligence
by: Chatzimparmpas, Angelos
Published: (2025)
by: Chatzimparmpas, Angelos
Published: (2025)
Modeling User Preferences via Brain-Computer Interfacing
by: Leiva, Luis A., et al.
Published: (2024)
by: Leiva, Luis A., et al.
Published: (2024)
Advancing Trustworthy AI for Sustainable Development: Recommendations for Standardising AI Incident Reporting
by: Agarwal, Avinash, et al.
Published: (2025)
by: Agarwal, Avinash, et al.
Published: (2025)
Steerable Chatbots: Personalizing LLMs with Preference-Based Activation Steering
by: Bo, Jessica Y., et al.
Published: (2025)
by: Bo, Jessica Y., et al.
Published: (2025)
In Pursuit of Predictive Models of Human Preferences Toward AI Teammates
by: Siu, Ho Chit, et al.
Published: (2025)
by: Siu, Ho Chit, et al.
Published: (2025)
Problem Solving Through Human-AI Preference-Based Cooperation
by: Dutta, Subhabrata, et al.
Published: (2024)
by: Dutta, Subhabrata, et al.
Published: (2024)
Trustworthy and Practical AI for Healthcare: A Guided Deferral System with Large Language Models
by: Strong, Joshua, et al.
Published: (2024)
by: Strong, Joshua, et al.
Published: (2024)
Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions
by: Do, Hyo Jin, et al.
Published: (2024)
by: Do, Hyo Jin, et al.
Published: (2024)
LAPPI: Interactive Optimization with LLM-Assisted Preference-Based Problem Instantiation
by: Kuroki, So, et al.
Published: (2025)
by: Kuroki, So, et al.
Published: (2025)
Communication Styles and Reader Preferences of LLM and Human Experts in Explaining Health Information
by: Zhou, Jiawei, et al.
Published: (2025)
by: Zhou, Jiawei, et al.
Published: (2025)
PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories
by: Aroca-Ouellette, Stephane, et al.
Published: (2024)
by: Aroca-Ouellette, Stephane, et al.
Published: (2024)
Explainable AI for Maritime Autonomous Surface Ships (MASS): Adaptive Interfaces and Trustworthy Human-AI Collaboration
by: Zhang, Zhuoyue, et al.
Published: (2025)
by: Zhang, Zhuoyue, et al.
Published: (2025)
TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains
by: Zabolotnii, Serhii
Published: (2026)
by: Zabolotnii, Serhii
Published: (2026)
Progressive Autonomy as Preference Learning: A Formalization of Trust Calibration for Agentic Tool Use
by: Ou, Changkun
Published: (2026)
by: Ou, Changkun
Published: (2026)
Who Validates the Validators? Aligning LLM-Assisted Evaluation of LLM Outputs with Human Preferences
by: Shankar, Shreya, et al.
Published: (2024)
by: Shankar, Shreya, et al.
Published: (2024)
Evaluating the Effectiveness of Large Language Models in Solving Simple Programming Tasks: A User-Centered Study
by: Deng, Kai
Published: (2025)
by: Deng, Kai
Published: (2025)
The Role of AI in Peer Support for Young People: A Study of Preferences for Human- and AI-Generated Responses
by: Young, Jordyn, et al.
Published: (2024)
by: Young, Jordyn, et al.
Published: (2024)
World of ScoreCraft: Novel Multi Scorer Experiment on the Impact of a Decision Support System in Sleep Staging
by: Holm, Benedikt, et al.
Published: (2025)
by: Holm, Benedikt, et al.
Published: (2025)
Aligning LLMs with Individual Preferences via Interaction
by: Wu, Shujin, et al.
Published: (2024)
by: Wu, Shujin, et al.
Published: (2024)
Vibrotactile Preference Learning: Uncertainty-Aware Preference Learning for Personalized Vibration Feedback
by: Zhang, Rongtao, et al.
Published: (2026)
by: Zhang, Rongtao, et al.
Published: (2026)
Towards Balancing Preference and Performance through Adaptive Personalized Explainability
by: Silva, Andrew, et al.
Published: (2025)
by: Silva, Andrew, et al.
Published: (2025)
Trustworthy AI Psychotherapy: Multi-Agent LLM Workflow for Counseling and Explainable Mental Disorder Diagnosis
by: Ozgun, Mithat Can, et al.
Published: (2025)
by: Ozgun, Mithat Can, et al.
Published: (2025)
Improving Health Professionals' Onboarding with AI and XAI for Trustworthy Human-AI Collaborative Decision Making
by: Lee, Min Hun, et al.
Published: (2024)
by: Lee, Min Hun, et al.
Published: (2024)
Same Words, Different Judgments: How Preferences Vary Across Modalities
by: Broukhim, Aaron, et al.
Published: (2026)
by: Broukhim, Aaron, et al.
Published: (2026)
Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment
by: Li, Chenliang, et al.
Published: (2024)
by: Li, Chenliang, et al.
Published: (2024)
On The Stability of Moral Preferences: A Problem with Computational Elicitation Methods
by: Boerstler, Kyle, et al.
Published: (2024)
by: Boerstler, Kyle, et al.
Published: (2024)
De-skilling, Cognitive Offloading, and Misplaced Responsibilities: Potential Ironies of AI-Assisted Design
by: Shukla, Prakash, et al.
Published: (2025)
by: Shukla, Prakash, et al.
Published: (2025)
Similar Items
-
Understanding AI Trustworthiness: A Scoping Review of AIES & FAccT Articles
by: Mehrotra, Siddharth, et al.
Published: (2025) -
Ran Score: a LLM-based Evaluation Score for Radiology Report Generation
by: Zhang, Ran, et al.
Published: (2026) -
The Journey to Trustworthy AI: Pursuit of Pragmatic Frameworks
by: Nasr-Azadani, Mohamad M, et al.
Published: (2024) -
Designing The Internet of Agents: A Framework for Trustworthy, Transparent, and Collaborative Human-Agent Interaction (HAX)
by: Scibelli, Marc, et al.
Published: (2025) -
Learning to Plan with Personalized Preferences
by: Xu, Manjie, et al.
Published: (2025)