Saved in:
Bibliographic Details
Main Authors: Le, Triet M., Chandra, Arjun, Rytting, C. Anton, Karuzis, Valerie P., Rife, Vladimir, Simpson, William A.
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.09426
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911261971709952
author Le, Triet M.
Chandra, Arjun
Rytting, C. Anton
Karuzis, Valerie P.
Rife, Vladimir
Simpson, William A.
author_facet Le, Triet M.
Chandra, Arjun
Rytting, C. Anton
Karuzis, Valerie P.
Rife, Vladimir
Simpson, William A.
contents Predicting an individual's personalities from their generated texts is a challenging task, especially when the text volume is large. In this paper, we introduce a straightforward yet effective novel strategy called targeted preselection of texts (TPoT). This method semantically filters the texts as input to a deep learning model, specifically designed to predict a Big Five personality trait, facet, or item, referred to as the BIG5-TPoT model. By selecting texts that are semantically relevant to a particular trait, facet, or item, this strategy not only addresses the issue of input text limits in large language models but also improves the Mean Absolute Error and accuracy metrics in predictions for the Stream of Consciousness Essays dataset.
format Preprint
id arxiv_https___arxiv_org_abs_2511_09426
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle BIG5-TPoT: Predicting BIG Five Personality Traits, Facets, and Items Through Targeted Preselection of Texts
Le, Triet M.
Chandra, Arjun
Rytting, C. Anton
Karuzis, Valerie P.
Rife, Vladimir
Simpson, William A.
Computation and Language
Machine Learning
Predicting an individual's personalities from their generated texts is a challenging task, especially when the text volume is large. In this paper, we introduce a straightforward yet effective novel strategy called targeted preselection of texts (TPoT). This method semantically filters the texts as input to a deep learning model, specifically designed to predict a Big Five personality trait, facet, or item, referred to as the BIG5-TPoT model. By selecting texts that are semantically relevant to a particular trait, facet, or item, this strategy not only addresses the issue of input text limits in large language models but also improves the Mean Absolute Error and accuracy metrics in predictions for the Stream of Consciousness Essays dataset.
title BIG5-TPoT: Predicting BIG Five Personality Traits, Facets, and Items Through Targeted Preselection of Texts
topic Computation and Language
Machine Learning
url https://arxiv.org/abs/2511.09426