:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Jelínek, Matouš, Schlicker, Nadine, de Visser, Ewart
Format:	Preprint
Published:	2025
Subjects:	Human-Computer Interaction Computation and Language
Online Access:	https://arxiv.org/abs/2508.02371
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Explanation format does not matter; but explanations do -- An Eggsbert study on explaining Bayesian Optimisation tasks
by: Chakraborty, Tanmay, et al.
Published: (2025)

Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
by: Chen, Chaoran, et al.
Published: (2025)

Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations
by: Pinhanez, Claudio, et al.
Published: (2024)

Aligning Stuttered-Speech Research with End-User Needs: Scoping Review, Survey, and Guidelines
by: Toyin, Hawau Olamide, et al.
Published: (2026)

Engineering Trustworthy Automation: Design Principles and Evaluation for AutoML Tools for Novices
by: Thys, Jarne, et al.
Published: (2025)

Gaze-informed Signatures of Trust and Collaboration in Human-Autonomy Teams
by: Ries, Anthony J., et al.
Published: (2024)

Harmonic LLMs are Trustworthy
by: Kersting, Nicholas S., et al.
Published: (2024)

Guidelines for Integrating Value Sensitive Design in Responsible AI Toolkits
by: Sadek, Malak, et al.
Published: (2024)

Exploring the Ethical Concerns in User Reviews of Mental Health Apps using Topic Modeling and Sentiment Analysis
by: Rahman, Mohammad Masudur, et al.
Published: (2026)

Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents
by: Chen, Chaoran, et al.
Published: (2025)

Trustworthy and Practical AI for Healthcare: A Guided Deferral System with Large Language Models
by: Strong, Joshua, et al.
Published: (2024)

IELTS Writing Revision Platform with Automated Essay Scoring and Adaptive Feedback
by: Ramancauskas, Titas, et al.
Published: (2025)

Marked Pedagogies: Examining Linguistic Biases in Personalized Automated Writing Feedback
by: Tan, Mei, et al.
Published: (2026)

Understanding Learner-LLM Chatbot Interactions and the Impact of Prompting Guidelines
by: Koyuturk, Cansu, et al.
Published: (2025)

Automated Bias Assessment in AI-Generated Educational Content Using CEAT Framework
by: Peng, Jingyang, et al.
Published: (2025)

Automated Coding of Communications in Collaborative Problem-solving Tasks Using ChatGPT
by: Hao, Jiangang, et al.
Published: (2024)

TRACE: A Metrologically-Grounded Engineering Framework for Trustworthy Agentic AI Systems in Operationally Critical Domains
by: Zabolotnii, Serhii
Published: (2026)

AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents
by: Lu, Yuxuan, et al.
Published: (2025)

Toward Automated Qualitative Analysis: Leveraging Large Language Models for Tutoring Dialogue Evaluation
by: Gu, Megan, et al.
Published: (2025)

S-DAT: A Multilingual, GenAI-Driven Framework for Automated Divergent Thinking Assessment
by: Haase, Jennifer, et al.
Published: (2025)

An Iterative Associative Memory Model for Empathetic Response Generation
by: Yang, Zhou, et al.
Published: (2024)

Exploring Automated Keyword Mnemonics Generation with Large Language Models via Overgenerate-and-Rank
by: Lee, Jaewook, et al.
Published: (2024)

Capturing Visualization Design Rationale
by: Hutchinson, Maeve, et al.
Published: (2025)

Think Outside the Data: Colonial Biases and Systemic Issues in Automated Moderation Pipelines for Low-Resource Languages
by: Shahid, Farhana, et al.
Published: (2025)

Do Ethical AI Principles Matter to Users? A Large-Scale Analysis of User Sentiment and Satisfaction
by: Pasch, Stefan, et al.
Published: (2025)

RELIC: Investigating Large Language Model Responses using Self-Consistency
by: Cheng, Furui, et al.
Published: (2023)

Improving Factuality for Dialogue Response Generation via Graph-Based Knowledge Augmentation
by: Chen, Xiangyan, et al.
Published: (2025)

Word Synchronization Challenge: A Benchmark for Word Association Responses for Large Language Models
by: Cazalets, Tanguy, et al.
Published: (2025)

Designing and Evaluating Chain-of-Hints for Scientific Question Answering
by: Jangra, Anubhav, et al.
Published: (2025)

A Design Space for Intelligent and Interactive Writing Assistants
by: Lee, Mina, et al.
Published: (2024)

Chaplains' Reflections on the Design and Usage of AI for Conversational Care
by: Wester, Joel, et al.
Published: (2026)

PersoPilot: An Adaptive AI-Copilot for Transparent Contextualized Persona Classification and Personalized Response Generation
by: Afzoon, Saleh, et al.
Published: (2026)

Think Twice: A Human-like Two-stage Conversational Agent for Emotional Response Generation
by: Qian, Yushan, et al.
Published: (2023)

Disentangling Prompt Element Level Risk Factors for Hallucinations and Omissions in Mental Health LLM Responses
by: Ni, Congning, et al.
Published: (2026)

AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals
by: Pasch, Stefan
Published: (2025)

Are You the A-hole? A Fair, Multi-Perspective Ethical Reasoning Framework
by: Munir, Sheza, et al.
Published: (2026)

MAGIC: Generating Self-Correction Guideline for In-Context Text-to-SQL
by: Askari, Arian, et al.
Published: (2024)

UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
by: Lu, Yuxuan, et al.
Published: (2025)

Risks and NLP Design: A Case Study on Procedural Document QA
by: Haduong, Nikita, et al.
Published: (2024)

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents
by: Lu, Yuxuan, et al.
Published: (2025)