Saved in:
Bibliographic Details
Main Authors: Kong, InJin, Kang, Shinyee, Park, Yuna, Kim, Sooyong, Park, Sanghyun
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.08668
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909490382635008
author Kong, InJin
Kang, Shinyee
Park, Yuna
Kim, Sooyong
Park, Sanghyun
author_facet Kong, InJin
Kang, Shinyee
Park, Yuna
Kim, Sooyong
Park, Sanghyun
contents This study investigates the stylistic differences among various Bible translations using a Variational Autoencoder (VAE) model. By embedding textual data into high-dimensional vectors, the study aims to detect and analyze stylistic variations between translations, with a specific focus on distinguishing the American Standard Version (ASV) from other translations. The results demonstrate that each translation exhibits a unique stylistic distribution, which can be effectively identified using the VAE model. These findings suggest that the VAE model is proficient in capturing and differentiating textual styles, although it is primarily optimized for distinguishing a single style. The study highlights the model's potential for broader applications in AI-based text generation and stylistic analysis, while also acknowledging the need for further model refinement to address the complexity of multi-dimensional stylistic relationships. Future research could extend this methodology to other text domains, offering deeper insights into the stylistic features embedded within various types of textual data.
format Preprint
id arxiv_https___arxiv_org_abs_2502_08668
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Style Extraction on Text Embeddings Using VAE and Parallel Dataset
Kong, InJin
Kang, Shinyee
Park, Yuna
Kim, Sooyong
Park, Sanghyun
Computation and Language
This study investigates the stylistic differences among various Bible translations using a Variational Autoencoder (VAE) model. By embedding textual data into high-dimensional vectors, the study aims to detect and analyze stylistic variations between translations, with a specific focus on distinguishing the American Standard Version (ASV) from other translations. The results demonstrate that each translation exhibits a unique stylistic distribution, which can be effectively identified using the VAE model. These findings suggest that the VAE model is proficient in capturing and differentiating textual styles, although it is primarily optimized for distinguishing a single style. The study highlights the model's potential for broader applications in AI-based text generation and stylistic analysis, while also acknowledging the need for further model refinement to address the complexity of multi-dimensional stylistic relationships. Future research could extend this methodology to other text domains, offering deeper insights into the stylistic features embedded within various types of textual data.
title Style Extraction on Text Embeddings Using VAE and Parallel Dataset
topic Computation and Language
url https://arxiv.org/abs/2502.08668