Saved in:
Bibliographic Details
Main Authors: Zhan, Baohua, Huang, Yongyi, Cui, Wenyao, Zhang, Huaping, Shang, Jianyun
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.08545
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929537829306368
author Zhan, Baohua
Huang, Yongyi
Cui, Wenyao
Zhang, Huaping
Shang, Jianyun
author_facet Zhan, Baohua
Huang, Yongyi
Cui, Wenyao
Zhang, Huaping
Shang, Jianyun
contents Questionnaires are a common method for detecting the personality of Large Language Models (LLMs). However, their reliability is often compromised by two main issues: hallucinations (where LLMs produce inaccurate or irrelevant responses) and the sensitivity of responses to the order of the presented options. To address these issues, we propose combining text mining with questionnaires method. Text mining can extract psychological features from the LLMs' responses without being affected by the order of options. Furthermore, because this method does not rely on specific answers, it reduces the influence of hallucinations. By normalizing the scores from both methods and calculating the root mean square error, our experiment results confirm the effectiveness of this approach. To further investigate the origins of personality traits in LLMs, we conduct experiments on both pre-trained language models (PLMs), such as BERT and GPT, as well as conversational models (ChatLLMs), such as ChatGPT. The results show that LLMs do contain certain personalities, for example, ChatGPT and ChatGLM exhibit the personality traits of 'Conscientiousness'. Additionally, we find that the personalities of LLMs are derived from their pre-trained data. The instruction data used to train ChatLLMs can enhance the generation of data containing personalities and expose their hidden personality. We compare the results with the human average personality score, and we find that the personality of FLAN-T5 in PLMs and ChatGPT in ChatLLMs is more similar to that of a human, with score differences of 0.34 and 0.22, respectively.
format Preprint
id arxiv_https___arxiv_org_abs_2410_08545
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Humanity in AI: Detecting the Personality of Large Language Models
Zhan, Baohua
Huang, Yongyi
Cui, Wenyao
Zhang, Huaping
Shang, Jianyun
Computation and Language
Artificial Intelligence
Questionnaires are a common method for detecting the personality of Large Language Models (LLMs). However, their reliability is often compromised by two main issues: hallucinations (where LLMs produce inaccurate or irrelevant responses) and the sensitivity of responses to the order of the presented options. To address these issues, we propose combining text mining with questionnaires method. Text mining can extract psychological features from the LLMs' responses without being affected by the order of options. Furthermore, because this method does not rely on specific answers, it reduces the influence of hallucinations. By normalizing the scores from both methods and calculating the root mean square error, our experiment results confirm the effectiveness of this approach. To further investigate the origins of personality traits in LLMs, we conduct experiments on both pre-trained language models (PLMs), such as BERT and GPT, as well as conversational models (ChatLLMs), such as ChatGPT. The results show that LLMs do contain certain personalities, for example, ChatGPT and ChatGLM exhibit the personality traits of 'Conscientiousness'. Additionally, we find that the personalities of LLMs are derived from their pre-trained data. The instruction data used to train ChatLLMs can enhance the generation of data containing personalities and expose their hidden personality. We compare the results with the human average personality score, and we find that the personality of FLAN-T5 in PLMs and ChatGPT in ChatLLMs is more similar to that of a human, with score differences of 0.34 and 0.22, respectively.
title Humanity in AI: Detecting the Personality of Large Language Models
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2410.08545