Saved in:
Bibliographic Details
Main Authors: Kulkarni, Shubham, Lyzhov, Alexander, Chaitanya, Shiva, Joshi, Preetam
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.08690
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912820348583936
author Kulkarni, Shubham
Lyzhov, Alexander
Chaitanya, Shiva
Joshi, Preetam
author_facet Kulkarni, Shubham
Lyzhov, Alexander
Chaitanya, Shiva
Joshi, Preetam
contents Conversational AI is starting to support real clinical work, but most evaluation methods miss how compliance depends on the full course of a conversation. We introduce Obligatory-Information Phase Structured Compliance Evaluation (OIP-SCE), an evaluation method that checks whether every required clinical obligation is met, in the right order, with clear evidence for clinicians to review. This makes complex rules practical and auditable, helping close the gap between technical progress and what healthcare actually needs. We demonstrate the method in two case studies (respiratory history, benefits verification) and show how phase-level evidence turns policy into shared, actionable steps. By giving clinicians control over what to check and engineers a clear specification to implement, OIP-SCE provides a single, auditable evaluation surface that aligns AI capability with clinical workflow and supports routine, safe use.
format Preprint
id arxiv_https___arxiv_org_abs_2601_08690
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle All Required, In Order: Phase-Level Evaluation for AI-Human Dialogue in Healthcare and Beyond
Kulkarni, Shubham
Lyzhov, Alexander
Chaitanya, Shiva
Joshi, Preetam
Artificial Intelligence
Conversational AI is starting to support real clinical work, but most evaluation methods miss how compliance depends on the full course of a conversation. We introduce Obligatory-Information Phase Structured Compliance Evaluation (OIP-SCE), an evaluation method that checks whether every required clinical obligation is met, in the right order, with clear evidence for clinicians to review. This makes complex rules practical and auditable, helping close the gap between technical progress and what healthcare actually needs. We demonstrate the method in two case studies (respiratory history, benefits verification) and show how phase-level evidence turns policy into shared, actionable steps. By giving clinicians control over what to check and engineers a clear specification to implement, OIP-SCE provides a single, auditable evaluation surface that aligns AI capability with clinical workflow and supports routine, safe use.
title All Required, In Order: Phase-Level Evaluation for AI-Human Dialogue in Healthcare and Beyond
topic Artificial Intelligence
url https://arxiv.org/abs/2601.08690