Saved in:
Bibliographic Details
Main Authors: Mehndiratta, Akanksha, Asawa, Krishna
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2504.09073
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909576730771456
author Mehndiratta, Akanksha
Asawa, Krishna
author_facet Mehndiratta, Akanksha
Asawa, Krishna
contents Multiturn dialogue models aim to generate human-like responses by leveraging conversational context, consisting of utterances from previous exchanges. Existing methods often neglect the interactions between these utterances or treat all of them as equally significant. This paper introduces a discourse-aware framework for response selection in retrieval-based dialogue systems. The proposed model first encodes each utterance and response with contextual, positional, and syntactic features using Multi-view Canonical Correlation Analysis (MCCA). It then learns discourse tokens that capture relationships between an utterance and its surrounding turns in a shared subspace via Canonical Correlation Analysis (CCA). This two-step approach effectively integrates semantic and syntactic features to build discourse-level understanding. Experiments on the Ubuntu Dialogue Corpus demonstrate that our model achieves significant improvements in automatic evaluation metrics, highlighting its effectiveness in response selection.
format Preprint
id arxiv_https___arxiv_org_abs_2504_09073
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle A Multi-view Discourse Framework for Integrating Semantic and Syntactic Features in Dialog Agents
Mehndiratta, Akanksha
Asawa, Krishna
Computation and Language
Multiturn dialogue models aim to generate human-like responses by leveraging conversational context, consisting of utterances from previous exchanges. Existing methods often neglect the interactions between these utterances or treat all of them as equally significant. This paper introduces a discourse-aware framework for response selection in retrieval-based dialogue systems. The proposed model first encodes each utterance and response with contextual, positional, and syntactic features using Multi-view Canonical Correlation Analysis (MCCA). It then learns discourse tokens that capture relationships between an utterance and its surrounding turns in a shared subspace via Canonical Correlation Analysis (CCA). This two-step approach effectively integrates semantic and syntactic features to build discourse-level understanding. Experiments on the Ubuntu Dialogue Corpus demonstrate that our model achieves significant improvements in automatic evaluation metrics, highlighting its effectiveness in response selection.
title A Multi-view Discourse Framework for Integrating Semantic and Syntactic Features in Dialog Agents
topic Computation and Language
url https://arxiv.org/abs/2504.09073