Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Kulkarni, Shubham, Lyzhov, Alexander, Chaitanya, Shiva, Joshi, Preetam
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2601.08690
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912820348583936
author	Kulkarni, Shubham Lyzhov, Alexander Chaitanya, Shiva Joshi, Preetam
author_facet	Kulkarni, Shubham Lyzhov, Alexander Chaitanya, Shiva Joshi, Preetam
contents	Conversational AI is starting to support real clinical work, but most evaluation methods miss how compliance depends on the full course of a conversation. We introduce Obligatory-Information Phase Structured Compliance Evaluation (OIP-SCE), an evaluation method that checks whether every required clinical obligation is met, in the right order, with clear evidence for clinicians to review. This makes complex rules practical and auditable, helping close the gap between technical progress and what healthcare actually needs. We demonstrate the method in two case studies (respiratory history, benefits verification) and show how phase-level evidence turns policy into shared, actionable steps. By giving clinicians control over what to check and engineers a clear specification to implement, OIP-SCE provides a single, auditable evaluation surface that aligns AI capability with clinical workflow and supports routine, safe use.
format	Preprint
id	arxiv_https___arxiv_org_abs_2601_08690
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	All Required, In Order: Phase-Level Evaluation for AI-Human Dialogue in Healthcare and Beyond Kulkarni, Shubham Lyzhov, Alexander Chaitanya, Shiva Joshi, Preetam Artificial Intelligence Conversational AI is starting to support real clinical work, but most evaluation methods miss how compliance depends on the full course of a conversation. We introduce Obligatory-Information Phase Structured Compliance Evaluation (OIP-SCE), an evaluation method that checks whether every required clinical obligation is met, in the right order, with clear evidence for clinicians to review. This makes complex rules practical and auditable, helping close the gap between technical progress and what healthcare actually needs. We demonstrate the method in two case studies (respiratory history, benefits verification) and show how phase-level evidence turns policy into shared, actionable steps. By giving clinicians control over what to check and engineers a clear specification to implement, OIP-SCE provides a single, auditable evaluation surface that aligns AI capability with clinical workflow and supports routine, safe use.
title	All Required, In Order: Phase-Level Evaluation for AI-Human Dialogue in Healthcare and Beyond
topic	Artificial Intelligence
url	https://arxiv.org/abs/2601.08690

Similar Items