Saved in:
| Main Authors: | Navarro, David Fraile, Magrabi, Farah, Coiera, Enrico |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.11413 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
A Model for Intelligible Interaction Between Agents That Predict and Explain
by: Baskar, A., et al.
Published: (2023)
by: Baskar, A., et al.
Published: (2023)
One-shot emergency psychiatric triage across 15 frontier AI chatbots
by: Weilnhammer, Veith, et al.
Published: (2026)
by: Weilnhammer, Veith, et al.
Published: (2026)
Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving
by: Serrano, Sergio. Martín, et al.
Published: (2024)
by: Serrano, Sergio. Martín, et al.
Published: (2024)
Conversational AI for Automated Patient Questionnaire Completion: Development Insights and Design Principles
by: Navarro, David Fraile, et al.
Published: (2026)
by: Navarro, David Fraile, et al.
Published: (2026)
Synthetic Heuristic Evaluation: A Comparison between AI- and Human-Powered Usability Evaluation
by: Zhong, Ruican, et al.
Published: (2025)
by: Zhong, Ruican, et al.
Published: (2025)
An Empirical Examination of the Evaluative AI Framework
by: Kornowicz, Jaroslaw
Published: (2024)
by: Kornowicz, Jaroslaw
Published: (2024)
Evaluation of Architectural Synthesis Using Generative AI
by: Huang, Jingfei, et al.
Published: (2025)
by: Huang, Jingfei, et al.
Published: (2025)
Evaluating the Effects of AI Directors for Quest Selection
by: Yu, Kristen K., et al.
Published: (2024)
by: Yu, Kristen K., et al.
Published: (2024)
From Evidence to Decision: Exploring Evaluative AI
by: Le, Thao, et al.
Published: (2024)
by: Le, Thao, et al.
Published: (2024)
SPHERE: An Evaluation Card for Human-AI Systems
by: Ma, Qianou, et al.
Published: (2025)
by: Ma, Qianou, et al.
Published: (2025)
Towards Human-AI Deliberation: Design and Evaluation of LLM-Empowered Deliberative AI for AI-Assisted Decision-Making
by: Ma, Shuai, et al.
Published: (2024)
by: Ma, Shuai, et al.
Published: (2024)
Towards culturally-appropriate conversational AI for health in the majority world: An exploratory study with citizens and professionals in Latin America
by: Peters, Dorian, et al.
Published: (2025)
by: Peters, Dorian, et al.
Published: (2025)
Interactive AI Alignment: Specification, Process, and Evaluation Alignment
by: Terry, Michael, et al.
Published: (2023)
by: Terry, Michael, et al.
Published: (2023)
Creativity in the Age of AI: Evaluating the Impact of Generative AI on Design Outputs and Designers' Creative Thinking
by: Fu, Yue, et al.
Published: (2024)
by: Fu, Yue, et al.
Published: (2024)
Culling Misinformation from Gen AI: Toward Ethical Curation and Refinement
by: Khatiwada, Prerana, et al.
Published: (2025)
by: Khatiwada, Prerana, et al.
Published: (2025)
Evaluation and Incident Prevention in an Enterprise AI Assistant
by: Maharaj, Akash V., et al.
Published: (2025)
by: Maharaj, Akash V., et al.
Published: (2025)
Interoceptive Divergence in Aesthetic Evaluation and Implications for Human-AI Alignment
by: Abe, Yoshia, et al.
Published: (2026)
by: Abe, Yoshia, et al.
Published: (2026)
Evaluating AI for Law: Bridging the Gap with Open-Source Solutions
by: Bhambhoria, Rohan, et al.
Published: (2024)
by: Bhambhoria, Rohan, et al.
Published: (2024)
Evaluating graph-based explanations for AI-based recommender systems
by: Delarue, Simon, et al.
Published: (2024)
by: Delarue, Simon, et al.
Published: (2024)
Wikibench: Community-Driven Data Curation for AI Evaluation on Wikipedia
by: Kuo, Tzu-Sheng, et al.
Published: (2024)
by: Kuo, Tzu-Sheng, et al.
Published: (2024)
Evaluating Human-AI Collaboration: A Review and Methodological Framework
by: Fragiadakis, George, et al.
Published: (2024)
by: Fragiadakis, George, et al.
Published: (2024)
An Approach to Grounding AI Model Evaluations in Human-derived Criteria
by: Mitts, Sasha
Published: (2025)
by: Mitts, Sasha
Published: (2025)
PASTA: A Scalable Framework for Multi-Policy AI Compliance Evaluation
by: Yang, Yu, et al.
Published: (2026)
by: Yang, Yu, et al.
Published: (2026)
Useful for Exploration, Risky for Precision: Evaluating AI Tools in Academic Research
by: Dathe, Anthea, et al.
Published: (2026)
by: Dathe, Anthea, et al.
Published: (2026)
Towards a Comprehensive Human-Centred Evaluation Framework for Explainable AI
by: Donoso-Guzmán, Ivania, et al.
Published: (2023)
by: Donoso-Guzmán, Ivania, et al.
Published: (2023)
Privacy in Human-AI Romantic Relationships: Concerns, Boundaries, and Agency
by: Ma, Rongjun, et al.
Published: (2026)
by: Ma, Rongjun, et al.
Published: (2026)
Gaze-Aware AI: Mathematical modeling of epistemic experience of the Marginalized for Human-Computer Interaction & AI Systems
by: Hatti, Omkar Suresh
Published: (2025)
by: Hatti, Omkar Suresh
Published: (2025)
Learning to Generate and Evaluate Fact-checking Explanations with Transformers
by: Feher, Darius, et al.
Published: (2024)
by: Feher, Darius, et al.
Published: (2024)
Expert Evaluation and the Limits of Human Feedback in Mental Health AI Safety Testing
by: Jafari, Kiana, et al.
Published: (2026)
by: Jafari, Kiana, et al.
Published: (2026)
Designing and Evaluating Malinowski's Lens: An AI-Native Educational Game for Ethnographic Learning
by: Hoffmann, Michael, et al.
Published: (2025)
by: Hoffmann, Michael, et al.
Published: (2025)
Evaluating Trust in AI, Human, and Co-produced Feedback Among Undergraduate Students
by: Zhang, Audrey, et al.
Published: (2025)
by: Zhang, Audrey, et al.
Published: (2025)
GenLens: A Systematic Evaluation of Visual GenAI Model Outputs
by: Lin, Tica, et al.
Published: (2024)
by: Lin, Tica, et al.
Published: (2024)
How Human-Centered Explainable AI Interface Are Designed and Evaluated: A Systematic Survey
by: Nguyen, Thu, et al.
Published: (2024)
by: Nguyen, Thu, et al.
Published: (2024)
Evaluating the Impact of AI-Powered Audiovisual Personalization on Learner Emotion, Focus, and Learning Outcomes
by: Wang, George Xi, et al.
Published: (2025)
by: Wang, George Xi, et al.
Published: (2025)
Levels of Autonomy for AI Agents
by: Feng, K. J. Kevin, et al.
Published: (2025)
by: Feng, K. J. Kevin, et al.
Published: (2025)
EUDAIMONIA: Evaluating Undesirable Dynamics in AI
by: Huang, Jun Rui, et al.
Published: (2026)
by: Huang, Jun Rui, et al.
Published: (2026)
StreetDesignAI: Broadening Designer Perspectives Through Multi-Persona Evaluation of Cycling Infrastructure
by: Wang, Ziyi, et al.
Published: (2026)
by: Wang, Ziyi, et al.
Published: (2026)
Evaluating Machine Expertise: How Graduate Students Develop Frameworks for Assessing GenAI Content
by: Chen, Celia, et al.
Published: (2025)
by: Chen, Celia, et al.
Published: (2025)
Evaluating the Impact of LLM-guided Reflection on Learning Outcomes with Interactive AI-Generated Educational Podcasts
by: Menon, Vishnu, et al.
Published: (2025)
by: Menon, Vishnu, et al.
Published: (2025)
"My Grade is Wrong!": A Contestable AI Framework for Interactive Feedback in Evaluating Student Essays
by: Hong, Shengxin, et al.
Published: (2024)
by: Hong, Shengxin, et al.
Published: (2024)
Similar Items
-
A Model for Intelligible Interaction Between Agents That Predict and Explain
by: Baskar, A., et al.
Published: (2023) -
One-shot emergency psychiatric triage across 15 frontier AI chatbots
by: Weilnhammer, Veith, et al.
Published: (2026) -
Behavioural gap assessment of human-vehicle interaction in real and virtual reality-based scenarios in autonomous driving
by: Serrano, Sergio. Martín, et al.
Published: (2024) -
Conversational AI for Automated Patient Questionnaire Completion: Development Insights and Design Principles
by: Navarro, David Fraile, et al.
Published: (2026) -
Synthetic Heuristic Evaluation: A Comparison between AI- and Human-Powered Usability Evaluation
by: Zhong, Ruican, et al.
Published: (2025)