Saved in:
Bibliographic Details
Main Authors: Liu, Yifan, Tilahun, Gelila, Gao, Xinxiang, Wen, Qianfeng, Gervers, Michael
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2410.09283
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866929538549678080
author Liu, Yifan
Tilahun, Gelila
Gao, Xinxiang
Wen, Qianfeng
Gervers, Michael
author_facet Liu, Yifan
Tilahun, Gelila
Gao, Xinxiang
Wen, Qianfeng
Gervers, Michael
contents The Norman Conquest of 1066 C.E. brought profound transformations to England's administrative, societal, and linguistic practices. The DEEDS (Documents of Early England Data Set) database offers a unique opportunity to explore these changes by examining shifts in word meanings within a vast collection of Medieval Latin charters. While computational linguistics typically relies on vector representations of words like static and contextual embeddings to analyze semantic changes, existing embeddings for scarce and historical Medieval Latin are limited and may not be well-suited for this task. This paper presents the first computational analysis of semantic change pre- and post-Norman Conquest and the first systematic comparison of static and contextual embeddings in a scarce historical data set. Our findings confirm that, consistent with existing studies, contextual embeddings outperform static word embeddings in capturing semantic change within a scarce historical corpus.
format Preprint
id arxiv_https___arxiv_org_abs_2410_09283
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Comparative Analysis of Static and Contextual Embeddings for Analyzing Semantic Changes in Medieval Latin Charters
Liu, Yifan
Tilahun, Gelila
Gao, Xinxiang
Wen, Qianfeng
Gervers, Michael
Computation and Language
The Norman Conquest of 1066 C.E. brought profound transformations to England's administrative, societal, and linguistic practices. The DEEDS (Documents of Early England Data Set) database offers a unique opportunity to explore these changes by examining shifts in word meanings within a vast collection of Medieval Latin charters. While computational linguistics typically relies on vector representations of words like static and contextual embeddings to analyze semantic changes, existing embeddings for scarce and historical Medieval Latin are limited and may not be well-suited for this task. This paper presents the first computational analysis of semantic change pre- and post-Norman Conquest and the first systematic comparison of static and contextual embeddings in a scarce historical data set. Our findings confirm that, consistent with existing studies, contextual embeddings outperform static word embeddings in capturing semantic change within a scarce historical corpus.
title Comparative Analysis of Static and Contextual Embeddings for Analyzing Semantic Changes in Medieval Latin Charters
topic Computation and Language
url https://arxiv.org/abs/2410.09283