Saved in:
Bibliographic Details
Main Authors: Bartnicki, Krzysztof, Drożdż, Stanisław, Kwapień, Jarosław, Stanisz, Tomasz
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2501.12954
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909529775538176
author Bartnicki, Krzysztof
Drożdż, Stanisław
Kwapień, Jarosław
Stanisz, Tomasz
author_facet Bartnicki, Krzysztof
Drożdż, Stanisław
Kwapień, Jarosław
Stanisz, Tomasz
contents The complexity characteristics of texts written in natural languages are significantly related to the rules of punctuation. In particular, the distances between punctuation marks measured by the number of words quite universally follow the family of Weibull distributions known from survival analyses. However, the values of two parameters marking specific forms of these distributions distinguish specific languages. This is such a strong constraint that the punctuation distributions of texts translated from the original language into another adopt quantitative characteristics of the target language. All these changes take place within Weibull distributions such that the corresponding hazard functions are always increasing. Recent previous research shows that James Joyce's famous "Finnegans Wake" is subject to such extreme distribution from the Weibull family that the corresponding hazard function is clearly decreasing. At the same time, the distances of sentence ending punctuation marks, determining the variability of sentence length, have an almost perfect multifractal organization, so far to such an extent found nowhere else in the literature. In the present contribution based on several available translations (Dutch, French, German, Polish, Russian) of "Finnegans Wake", it is shown that the punctuation characteristics of this work remain largely translation invariant, contrary to the common cases. These observations may constitute further evidence that "Finnegans Wake" is a translinguistic work in this respect as well, in line with Joyce's original intention.
format Preprint
id arxiv_https___arxiv_org_abs_2501_12954
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Punctuation patterns in "Finnegans Wake" by James Joyce are largely translation-invariant
Bartnicki, Krzysztof
Drożdż, Stanisław
Kwapień, Jarosław
Stanisz, Tomasz
Computation and Language
The complexity characteristics of texts written in natural languages are significantly related to the rules of punctuation. In particular, the distances between punctuation marks measured by the number of words quite universally follow the family of Weibull distributions known from survival analyses. However, the values of two parameters marking specific forms of these distributions distinguish specific languages. This is such a strong constraint that the punctuation distributions of texts translated from the original language into another adopt quantitative characteristics of the target language. All these changes take place within Weibull distributions such that the corresponding hazard functions are always increasing. Recent previous research shows that James Joyce's famous "Finnegans Wake" is subject to such extreme distribution from the Weibull family that the corresponding hazard function is clearly decreasing. At the same time, the distances of sentence ending punctuation marks, determining the variability of sentence length, have an almost perfect multifractal organization, so far to such an extent found nowhere else in the literature. In the present contribution based on several available translations (Dutch, French, German, Polish, Russian) of "Finnegans Wake", it is shown that the punctuation characteristics of this work remain largely translation invariant, contrary to the common cases. These observations may constitute further evidence that "Finnegans Wake" is a translinguistic work in this respect as well, in line with Joyce's original intention.
title Punctuation patterns in "Finnegans Wake" by James Joyce are largely translation-invariant
topic Computation and Language
url https://arxiv.org/abs/2501.12954