Saved in:
Bibliographic Details
Main Authors: Fukuma, Tomoki, Noda, Koki, Hoso, Toshihide Ubukata Kousuke, Ichikawa, Yoshiharu, Kambe, Kyosuke, Masubuch, Yu, Toriumi, Fujio
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2404.18371
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916227727753216
author Fukuma, Tomoki
Noda, Koki
Hoso, Toshihide Ubukata Kousuke
Ichikawa, Yoshiharu
Kambe, Kyosuke
Masubuch, Yu
Toriumi, Fujio
author_facet Fukuma, Tomoki
Noda, Koki
Hoso, Toshihide Ubukata Kousuke
Ichikawa, Yoshiharu
Kambe, Kyosuke
Masubuch, Yu
Toriumi, Fujio
contents The proliferation of social media has led to information overload and increased interest in opinion mining. We propose "Question-Answering Network Analysis" (QANA), a novel opinion mining framework that utilizes Large Language Models (LLMs) to generate questions from users' comments, constructs a bipartite graph based on the comments' answerability to the questions, and applies centrality measures to examine the importance of opinions. We investigate the impact of question generation styles, LLM selections, and the choice of embedding model on the quality of the constructed QA networks by comparing them with annotated Key Point Analysis datasets. QANA achieves comparable performance to previous state-of-the-art supervised models in a zero-shot manner for Key Point Matching task, also reducing the computational cost from quadratic to linear. For Key Point Generation, questions with high PageRank or degree centrality align well with manually annotated key points. Notably, QANA enables analysts to assess the importance of key points from various aspects according to their selection of centrality measure. QANA's primary contribution lies in its flexibility to extract key points from a wide range of perspectives, which enhances the quality and impartiality of opinion mining.
format Preprint
id arxiv_https___arxiv_org_abs_2404_18371
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle QANA: LLM-based Question Generation and Network Analysis for Zero-shot Key Point Analysis and Beyond
Fukuma, Tomoki
Noda, Koki
Hoso, Toshihide Ubukata Kousuke
Ichikawa, Yoshiharu
Kambe, Kyosuke
Masubuch, Yu
Toriumi, Fujio
Computation and Language
The proliferation of social media has led to information overload and increased interest in opinion mining. We propose "Question-Answering Network Analysis" (QANA), a novel opinion mining framework that utilizes Large Language Models (LLMs) to generate questions from users' comments, constructs a bipartite graph based on the comments' answerability to the questions, and applies centrality measures to examine the importance of opinions. We investigate the impact of question generation styles, LLM selections, and the choice of embedding model on the quality of the constructed QA networks by comparing them with annotated Key Point Analysis datasets. QANA achieves comparable performance to previous state-of-the-art supervised models in a zero-shot manner for Key Point Matching task, also reducing the computational cost from quadratic to linear. For Key Point Generation, questions with high PageRank or degree centrality align well with manually annotated key points. Notably, QANA enables analysts to assess the importance of key points from various aspects according to their selection of centrality measure. QANA's primary contribution lies in its flexibility to extract key points from a wide range of perspectives, which enhances the quality and impartiality of opinion mining.
title QANA: LLM-based Question Generation and Network Analysis for Zero-shot Key Point Analysis and Beyond
topic Computation and Language
url https://arxiv.org/abs/2404.18371