Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Sanchez-Karhunen, Eduardo, Quesada-Moreno, Jose F., Gutiérrez-Naranjo, Miguel A.
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2408.02838
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866909280473448448
author	Sanchez-Karhunen, Eduardo Quesada-Moreno, Jose F. Gutiérrez-Naranjo, Miguel A.
author_facet	Sanchez-Karhunen, Eduardo Quesada-Moreno, Jose F. Gutiérrez-Naranjo, Miguel A.
contents	Intent detection is a text classification task whose aim is to recognize and label the semantics behind a users query. It plays a critical role in various business applications. The output of the intent detection module strongly conditions the behavior of the whole system. This sequence analysis task is mainly tackled using deep learning techniques. Despite the widespread use of these techniques, the internal mechanisms used by networks to solve the problem are poorly understood. Recent lines of work have analyzed the computational mechanisms learned by RNNs from a dynamical systems perspective. In this work, we investigate how different RNN architectures solve the SNIPS intent detection problem. Sentences injected into trained networks can be interpreted as trajectories traversing a hidden state space. This space is constrained to a low-dimensional manifold whose dimensionality is related to the embedding and hidden layer sizes. To generate predictions, RNN steers the trajectories towards concrete regions, spatially aligned with the output layer matrix rows directions. Underlying the system dynamics, an unexpected fixed point topology has been identified with a limited number of attractors. Our results provide new insights into the inner workings of networks that solve the intent detection task.
format	Preprint
id	arxiv_https___arxiv_org_abs_2408_02838
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space Sanchez-Karhunen, Eduardo Quesada-Moreno, Jose F. Gutiérrez-Naranjo, Miguel A. Machine Learning Computation and Language Intent detection is a text classification task whose aim is to recognize and label the semantics behind a users query. It plays a critical role in various business applications. The output of the intent detection module strongly conditions the behavior of the whole system. This sequence analysis task is mainly tackled using deep learning techniques. Despite the widespread use of these techniques, the internal mechanisms used by networks to solve the problem are poorly understood. Recent lines of work have analyzed the computational mechanisms learned by RNNs from a dynamical systems perspective. In this work, we investigate how different RNN architectures solve the SNIPS intent detection problem. Sentences injected into trained networks can be interpreted as trajectories traversing a hidden state space. This space is constrained to a low-dimensional manifold whose dimensionality is related to the embedding and hidden layer sizes. To generate predictions, RNN steers the trajectories towards concrete regions, spatially aligned with the output layer matrix rows directions. Underlying the system dynamics, an unexpected fixed point topology has been identified with a limited number of attractors. Our results provide new insights into the inner workings of networks that solve the intent detection task.
title	Interpretation of the Intent Detection Problem as Dynamics in a Low-dimensional Space
topic	Machine Learning Computation and Language
url	https://arxiv.org/abs/2408.02838

Similar Items