Saved in:
| Main Authors: | Jelínek, Matouš, Schlicker, Nadine, de Visser, Ewart |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.02371 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Explanation format does not matter; but explanations do -- An Eggsbert study on explaining Bayesian Optimisation tasks
by: Chakraborty, Tanmay, et al.
Published: (2025)
by: Chakraborty, Tanmay, et al.
Published: (2025)
Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
by: Chen, Chaoran, et al.
Published: (2025)
by: Chen, Chaoran, et al.
Published: (2025)
Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations
by: Pinhanez, Claudio, et al.
Published: (2024)
by: Pinhanez, Claudio, et al.
Published: (2024)
Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines
by: Toyin, Hawau Olamide, et al.
Published: (2026)
by: Toyin, Hawau Olamide, et al.
Published: (2026)
Engineering Trustworthy Automation: Design Principles and Evaluation for AutoML Tools for Novices
by: Thys, Jarne, et al.
Published: (2025)
by: Thys, Jarne, et al.
Published: (2025)
Gaze-informed Signatures of Trust and Collaboration in Human-Autonomy Teams
by: Ries, Anthony J., et al.
Published: (2024)
by: Ries, Anthony J., et al.
Published: (2024)
Harmonic LLMs are Trustworthy
by: Kersting, Nicholas S., et al.
Published: (2024)
by: Kersting, Nicholas S., et al.
Published: (2024)
Guidelines for Integrating Value Sensitive Design in Responsible AI Toolkits
by: Sadek, Malak, et al.
Published: (2024)
by: Sadek, Malak, et al.
Published: (2024)
Exploring the Ethical Concerns in User Reviews of Mental Health Apps using Topic Modeling and Sentiment Analysis
by: Rahman, Mohammad Masudur, et al.
Published: (2026)
by: Rahman, Mohammad Masudur, et al.
Published: (2026)
Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents
by: Chen, Chaoran, et al.
Published: (2025)
by: Chen, Chaoran, et al.
Published: (2025)
Trustworthy and Practical AI for Healthcare: A Guided Deferral System with Large Language Models
by: Strong, Joshua, et al.
Published: (2024)
by: Strong, Joshua, et al.
Published: (2024)
IELTS Writing Revision Platform with Automated Essay Scoring and Adaptive Feedback
by: Ramancauskas, Titas, et al.
Published: (2025)
by: Ramancauskas, Titas, et al.
Published: (2025)
Marked Pedagogies: Examining Linguistic Biases in Personalized Automated Writing Feedback
by: Tan, Mei, et al.
Published: (2026)
by: Tan, Mei, et al.
Published: (2026)
Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines
by: Koyuturk, Cansu, et al.
Published: (2025)
by: Koyuturk, Cansu, et al.
Published: (2025)
Automated Bias Assessment in AI-Generated Educational Content Using CEAT Framework
by: Peng, Jingyang, et al.
Published: (2025)
by: Peng, Jingyang, et al.
Published: (2025)
Automated Coding of Communications in Collaborative Problem-solving Tasks Using ChatGPT
by: Hao, Jiangang, et al.
Published: (2024)
by: Hao, Jiangang, et al.
Published: (2024)
TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains
by: Zabolotnii, Serhii
Published: (2026)
by: Zabolotnii, Serhii
Published: (2026)
AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
Toward Automated Qualitative Analysis: Leveraging Large Language Models for Tutoring Dialogue Evaluation
by: Gu, Megan, et al.
Published: (2025)
by: Gu, Megan, et al.
Published: (2025)
S-DAT: A Multilingual, GenAI-Driven Framework for Automated Divergent Thinking Assessment
by: Haase, Jennifer, et al.
Published: (2025)
by: Haase, Jennifer, et al.
Published: (2025)
An Iterative Associative Memory Model for Empathetic Response Generation
by: Yang, Zhou, et al.
Published: (2024)
by: Yang, Zhou, et al.
Published: (2024)
Exploring Automated Keyword Mnemonics Generation with Large Language Models via Overgenerate-and-Rank
by: Lee, Jaewook, et al.
Published: (2024)
by: Lee, Jaewook, et al.
Published: (2024)
Capturing Visualization Design Rationale
by: Hutchinson, Maeve, et al.
Published: (2025)
by: Hutchinson, Maeve, et al.
Published: (2025)
Think Outside the Data: Colonial Biases and Systemic Issues in Automated Moderation Pipelines for Low-Resource Languages
by: Shahid, Farhana, et al.
Published: (2025)
by: Shahid, Farhana, et al.
Published: (2025)
Do Ethical AI Principles Matter to Users? A Large-Scale Analysis of User Sentiment and Satisfaction
by: Pasch, Stefan, et al.
Published: (2025)
by: Pasch, Stefan, et al.
Published: (2025)
RELIC: Investigating Large Language Model Responses using Self-Consistency
by: Cheng, Furui, et al.
Published: (2023)
by: Cheng, Furui, et al.
Published: (2023)
Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation
by: Chen, Xiangyan, et al.
Published: (2025)
by: Chen, Xiangyan, et al.
Published: (2025)
Word Synchronization Challenge: A Benchmark for Word Association Responses for Large Language Models
by: Cazalets, Tanguy, et al.
Published: (2025)
by: Cazalets, Tanguy, et al.
Published: (2025)
Designing and Evaluating Chain-of-Hints for Scientific Question Answering
by: Jangra, Anubhav, et al.
Published: (2025)
by: Jangra, Anubhav, et al.
Published: (2025)
A Design Space for Intelligent and Interactive Writing Assistants
by: Lee, Mina, et al.
Published: (2024)
by: Lee, Mina, et al.
Published: (2024)
Chaplains' Reflections on the Design and Usage of AI for Conversational Care
by: Wester, Joel, et al.
Published: (2026)
by: Wester, Joel, et al.
Published: (2026)
PersoPilot: An Adaptive AI-Copilot for Transparent Contextualized Persona Classification and Personalized Response Generation
by: Afzoon, Saleh, et al.
Published: (2026)
by: Afzoon, Saleh, et al.
Published: (2026)
Think Twice: A Human-like Two-stage Conversational Agent for Emotional Response Generation
by: Qian, Yushan, et al.
Published: (2023)
by: Qian, Yushan, et al.
Published: (2023)
Disentangling Prompt Element Level Risk Factors for Hallucinations and Omissions in Mental Health LLM Responses
by: Ni, Congning, et al.
Published: (2026)
by: Ni, Congning, et al.
Published: (2026)
AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals
by: Pasch, Stefan
Published: (2025)
by: Pasch, Stefan
Published: (2025)
Are You the A-hole? A Fair, Multi-Perspective Ethical Reasoning Framework
by: Munir, Sheza, et al.
Published: (2026)
by: Munir, Sheza, et al.
Published: (2026)
MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL
by: Askari, Arian, et al.
Published: (2024)
by: Askari, Arian, et al.
Published: (2024)
UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
Risks and NLP Design: A Case Study on Procedural Document QA
by: Haduong, Nikita, et al.
Published: (2024)
by: Haduong, Nikita, et al.
Published: (2024)
UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
by: Lu, Yuxuan, et al.
Published: (2025)
by: Lu, Yuxuan, et al.
Published: (2025)
Similar Items
-
Explanation format does not matter; but explanations do -- An Eggsbert study on explaining Bayesian Optimisation tasks
by: Chakraborty, Tanmay, et al.
Published: (2025) -
Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
by: Chen, Chaoran, et al.
Published: (2025) -
Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations
by: Pinhanez, Claudio, et al.
Published: (2024) -
Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines
by: Toyin, Hawau Olamide, et al.
Published: (2026) -
Engineering Trustworthy Automation: Design Principles and Evaluation for AutoML Tools for Novices
by: Thys, Jarne, et al.
Published: (2025)