Enregistré dans:
Détails bibliographiques
Auteurs principaux: Reuter, Arik, Khadka, Bishnu, Thielmann, Anton, Weisser, Christoph, Fischer, Sebastian, Säfken, Benjamin
Format: Preprint
Publié: 2024
Sujets:
Accès en ligne:https://arxiv.org/abs/2403.03628
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
_version_ 1866915627523899392
author Reuter, Arik
Khadka, Bishnu
Thielmann, Anton
Weisser, Christoph
Fischer, Sebastian
Säfken, Benjamin
author_facet Reuter, Arik
Khadka, Bishnu
Thielmann, Anton
Weisser, Christoph
Fischer, Sebastian
Säfken, Benjamin
contents Topic modeling seems to be almost synonymous with generating lists of top words to represent topics within large text corpora. However, deducing a topic from such list of individual terms can require substantial expertise and experience, making topic modelling less accessible to people unfamiliar with the particularities and pitfalls of top-word interpretation. A topic representation limited to top-words might further fall short of offering a comprehensive and easily accessible characterization of the various aspects, facets and nuances a topic might have. To address these challenges, we introduce GPTopic, a software package that leverages Large Language Models (LLMs) to create dynamic, interactive topic representations. GPTopic provides an intuitive chat interface for users to explore, analyze, and refine topics interactively, making topic modeling more accessible and comprehensive. The corresponding code is available here: https://github.com/ArikReuter/TopicGPT.
format Preprint
id arxiv_https___arxiv_org_abs_2403_03628
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle GPTopic: Dynamic and Interactive Topic Representations
Reuter, Arik
Khadka, Bishnu
Thielmann, Anton
Weisser, Christoph
Fischer, Sebastian
Säfken, Benjamin
Computation and Language
Topic modeling seems to be almost synonymous with generating lists of top words to represent topics within large text corpora. However, deducing a topic from such list of individual terms can require substantial expertise and experience, making topic modelling less accessible to people unfamiliar with the particularities and pitfalls of top-word interpretation. A topic representation limited to top-words might further fall short of offering a comprehensive and easily accessible characterization of the various aspects, facets and nuances a topic might have. To address these challenges, we introduce GPTopic, a software package that leverages Large Language Models (LLMs) to create dynamic, interactive topic representations. GPTopic provides an intuitive chat interface for users to explore, analyze, and refine topics interactively, making topic modeling more accessible and comprehensive. The corresponding code is available here: https://github.com/ArikReuter/TopicGPT.
title GPTopic: Dynamic and Interactive Topic Representations
topic Computation and Language
url https://arxiv.org/abs/2403.03628