Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Xu, Chen, Lyu, Zhenyu, Lan, Tian, Yang, Yi, Ji, Yu, Ji, Luyao, Shen, Jian, Wang, Zhihua, Cui, Leyang, Zhang, Jieshuo, Dong, Qunxi, Yang, Minqiang, Wang, Juan, Liu, Xiuling, Hu, Bin
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2508.09042
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917478564626432
author	Xu, Chen Lyu, Zhenyu Lan, Tian Yang, Yi Ji, Yu Ji, Luyao Shen, Jian Wang, Zhihua Cui, Leyang Zhang, Jieshuo Dong, Qunxi Yang, Minqiang Wang, Juan Liu, Xiuling Hu, Bin
author_facet	Xu, Chen Lyu, Zhenyu Lan, Tian Yang, Yi Ji, Yu Ji, Luyao Shen, Jian Wang, Zhihua Cui, Leyang Zhang, Jieshuo Dong, Qunxi Yang, Minqiang Wang, Juan Liu, Xiuling Hu, Bin
contents	The most dangerous mistakes a novice counselor makes are not the obvious ones: they are utterances that sound caring while quietly violating professional ethics and leaving vulnerable clients less protected. We build an AI supervisor that does not replace novice counselors, but grows them-teaching them to internalize ethical violations they would otherwise never notice. What makes this supervisor non-trivial is not detection but teaching: it must locate the ethical-violating utterance, diagnose the ethical violation against APA principles, and deliver feedback that explains not just what went wrong, but why it is risky and how to respond differently. The core obstacle is that (1) ethical violations are by nature unlabeled in real clinical data, and (2) existing AI counselors trained only to match correct answers will never learn to teach. We resolve both at once: a controllable AI novice that intentionally enacts predefined mistake categories makes supervision labels a natural byproduct of generation, yielding ETHICSCAFF, a 9,915-instance human-in-the-loop dataset; and GRPO under a Novice Growth Reward (NGR) optimizes the supervisor not for answer correctness but for whether a weaker novice model actually improves after reading its explanation. Experiments show that a novice guided by our supervisor outperforms an unguided peer on clinical metrics, and that teaching-oriented optimization via NGR further sharpens the supervisor's own ethical detection. In a user study with novice counseling-psychology students, participants show significant self-efficacy gains across all eight assessed competencies after receiving AI supervisory feedback, demonstrating that the scaffold transfers from simulation to real-world practice.
format	Preprint
id	arxiv_https___arxiv_org_abs_2508_09042
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	First, Do No Harm: AI Supervisor Scaffolds Novice Growth in Counselor Education Xu, Chen Lyu, Zhenyu Lan, Tian Yang, Yi Ji, Yu Ji, Luyao Shen, Jian Wang, Zhihua Cui, Leyang Zhang, Jieshuo Dong, Qunxi Yang, Minqiang Wang, Juan Liu, Xiuling Hu, Bin Computation and Language The most dangerous mistakes a novice counselor makes are not the obvious ones: they are utterances that sound caring while quietly violating professional ethics and leaving vulnerable clients less protected. We build an AI supervisor that does not replace novice counselors, but grows them-teaching them to internalize ethical violations they would otherwise never notice. What makes this supervisor non-trivial is not detection but teaching: it must locate the ethical-violating utterance, diagnose the ethical violation against APA principles, and deliver feedback that explains not just what went wrong, but why it is risky and how to respond differently. The core obstacle is that (1) ethical violations are by nature unlabeled in real clinical data, and (2) existing AI counselors trained only to match correct answers will never learn to teach. We resolve both at once: a controllable AI novice that intentionally enacts predefined mistake categories makes supervision labels a natural byproduct of generation, yielding ETHICSCAFF, a 9,915-instance human-in-the-loop dataset; and GRPO under a Novice Growth Reward (NGR) optimizes the supervisor not for answer correctness but for whether a weaker novice model actually improves after reading its explanation. Experiments show that a novice guided by our supervisor outperforms an unguided peer on clinical metrics, and that teaching-oriented optimization via NGR further sharpens the supervisor's own ethical detection. In a user study with novice counseling-psychology students, participants show significant self-efficacy gains across all eight assessed competencies after receiving AI supervisory feedback, demonstrating that the scaffold transfers from simulation to real-world practice.
title	First, Do No Harm: AI Supervisor Scaffolds Novice Growth in Counselor Education
topic	Computation and Language
url	https://arxiv.org/abs/2508.09042

Similar Items