Saved in:
Bibliographic Details
Main Author: Wong, Sidney
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2604.15744
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913040816930816
author Wong, Sidney
author_facet Wong, Sidney
contents This thesis investigates geographic dialect alignment in place-informed social media communities, focussing on New Zealand-related Reddit communities. By integrating qualitative analyses of user perceptions with computational methods, the study examines how language use reflects place identity and patterns of language variation and change based on user-informed lexical, morphosyntactic, and semantic variables. The findings show that users generally associate language with place, and place-related communities form a contiguous speech community, though alignment between geographic dialect communities and place-related communities remains complex. Advanced language modelling, including static and diachronic Word2Vec language embeddings, revealed semantic variation across place-based communities and meaningful semantic shifts within New Zealand English. The research involved the creation of a corpus containing 4.26 billion unprocessed words, which offers a valuable resource for future study. Overall, the results highlight the potential of social media as a natural laboratory for sociolinguistic inquiry.
format Preprint
id arxiv_https___arxiv_org_abs_2604_15744
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Language, Place, and Social Media: Geographic Dialect Alignment in New Zealand
Wong, Sidney
Computation and Language
This thesis investigates geographic dialect alignment in place-informed social media communities, focussing on New Zealand-related Reddit communities. By integrating qualitative analyses of user perceptions with computational methods, the study examines how language use reflects place identity and patterns of language variation and change based on user-informed lexical, morphosyntactic, and semantic variables. The findings show that users generally associate language with place, and place-related communities form a contiguous speech community, though alignment between geographic dialect communities and place-related communities remains complex. Advanced language modelling, including static and diachronic Word2Vec language embeddings, revealed semantic variation across place-based communities and meaningful semantic shifts within New Zealand English. The research involved the creation of a corpus containing 4.26 billion unprocessed words, which offers a valuable resource for future study. Overall, the results highlight the potential of social media as a natural laboratory for sociolinguistic inquiry.
title Language, Place, and Social Media: Geographic Dialect Alignment in New Zealand
topic Computation and Language
url https://arxiv.org/abs/2604.15744