Saved in:
Bibliographic Details
Main Authors: Cao, Zhiyong, Liu, Dunqiang, Dai, Qi, Xu, Haojun, Xu, Huaiyan, He, Huan, Liu, Yafei, Liu, Siyuan, Lin, XiaoLin, Ma, Ke, Shi, Ruqian, Yao, Sijia, Wang, Hao, Zhou, Sicheng
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.02871
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Task-oriented proactive dialogue agents play a pivotal role in recruitment, particularly for steering conversations towards specific business outcomes, such as acquiring social-media contacts for private-channel conversion. Although supervised fine-tuning and reinforcement learning have proven effective for training such agents, their performance is heavily constrained by the scarcity of high-quality, goal-oriented domain-specific training data. To address this challenge, we propose SimRPD, a three-stage framework for training recruitment proactive dialogue agents. First, we develop a high-fidelity user simulator to synthesize large-scale conversational data through multi-turn online dialogue. Then we introduce a multi-dimensional evaluation framework based on Chain-of-Intention (CoI) to comprehensively assess the simulator and effectively select high-quality data, incorporating both global-level and instance-level metrics. Finally, we train the recruitment proactive dialogue agent on the selected dataset. Experiments in a real-world recruitment scenario demonstrate that SimRPD outperforms existing simulator-based data selection strategies, highlighting its practical value for industrial deployment and its potential applicability to other business-oriented dialogue scenarios.