Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Iftikhar, Zainab, Ransom, Sean, Xiao, Amy, Nugent, Nicole, Huang, Jeff
Format:	Preprint
Published:	2024
Subjects:	Human-Computer Interaction Computation and Language I.2.7; J.4
Online Access:	https://arxiv.org/abs/2409.02244
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866913910634840064
author	Iftikhar, Zainab Ransom, Sean Xiao, Amy Nugent, Nicole Huang, Jeff
author_facet	Iftikhar, Zainab Ransom, Sean Xiao, Amy Nugent, Nicole Huang, Jeff
contents	Large language models (LLMs) are being used as ad-hoc therapists. Research suggests that LLMs outperform human counselors when generating a single, isolated empathetic response; however, their session-level behavior remains understudied. In this study, we compare the session-level behaviors of human counselors with those of an LLM prompted by a team of peer counselors to deliver single-session Cognitive Behavioral Therapy (CBT). Our three-stage, mixed-methods study involved: a) a year-long ethnography of a text-based support platform where seven counselors iteratively refined CBT prompts through self-counseling and weekly focus groups; b) the manual simulation of human counselor sessions with a CBT-prompted LLM, given the full patient dialogue and contextual notes; and c) session evaluations of both human and LLM sessions by three licensed clinical psychologists using CBT competence measures. Our results show a clear trade-off. Human counselors excel at relational strategies -- small talk, self-disclosure, and culturally situated language -- that lead to higher empathy, collaboration, and deeper user reflection. LLM counselors demonstrate higher procedural adherence to CBT techniques but struggle to sustain collaboration, misread cultural cues, and sometimes produce "deceptive empathy," i.e., formulaic warmth that can inflate users' expectations of genuine human care. Taken together, our findings imply that while LLMs might outperform counselors in generating single empathetic responses, their ability to lead sessions is more limited, highlighting that therapy cannot be reduced to a standalone natural language processing (NLP) task. We call for carefully designed human-AI workflows in scalable support: LLMs can scaffold evidence-based techniques, while peers provide relational support. We conclude by mapping concrete design opportunities and ethical guardrails for such hybrid systems.
format	Preprint
id	arxiv_https___arxiv_org_abs_2409_02244
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Therapy as an NLP Task: Psychologists' Comparison of LLMs and Human Peers in CBT Iftikhar, Zainab Ransom, Sean Xiao, Amy Nugent, Nicole Huang, Jeff Human-Computer Interaction Computation and Language I.2.7; J.4 Large language models (LLMs) are being used as ad-hoc therapists. Research suggests that LLMs outperform human counselors when generating a single, isolated empathetic response; however, their session-level behavior remains understudied. In this study, we compare the session-level behaviors of human counselors with those of an LLM prompted by a team of peer counselors to deliver single-session Cognitive Behavioral Therapy (CBT). Our three-stage, mixed-methods study involved: a) a year-long ethnography of a text-based support platform where seven counselors iteratively refined CBT prompts through self-counseling and weekly focus groups; b) the manual simulation of human counselor sessions with a CBT-prompted LLM, given the full patient dialogue and contextual notes; and c) session evaluations of both human and LLM sessions by three licensed clinical psychologists using CBT competence measures. Our results show a clear trade-off. Human counselors excel at relational strategies -- small talk, self-disclosure, and culturally situated language -- that lead to higher empathy, collaboration, and deeper user reflection. LLM counselors demonstrate higher procedural adherence to CBT techniques but struggle to sustain collaboration, misread cultural cues, and sometimes produce "deceptive empathy," i.e., formulaic warmth that can inflate users' expectations of genuine human care. Taken together, our findings imply that while LLMs might outperform counselors in generating single empathetic responses, their ability to lead sessions is more limited, highlighting that therapy cannot be reduced to a standalone natural language processing (NLP) task. We call for carefully designed human-AI workflows in scalable support: LLMs can scaffold evidence-based techniques, while peers provide relational support. We conclude by mapping concrete design opportunities and ethical guardrails for such hybrid systems.
title	Therapy as an NLP Task: Psychologists' Comparison of LLMs and Human Peers in CBT
topic	Human-Computer Interaction Computation and Language I.2.7; J.4
url	https://arxiv.org/abs/2409.02244

Similar Items