:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Orekhov, Boris
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2407.08099
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Does Burrows' Delta really confirm that Rowling and Galbraith are the same author?
by: Orekhov, Boris
Published: (2024)

You shall know a piece by the company it keeps. Chess plays as a data for word2vec models
by: Orekhov, Boris
Published: (2024)

Is text normalization relevant for classifying medieval charters?
by: Atzenhofer-Baumgartner, Florian, et al.
Published: (2024)

Metronome: tracing variation in poetic meters via local sequence alignment
by: Nagy, Ben, et al.
Published: (2024)

Markov reads Pushkin, again: A statistical journey into the poetic world of Evgenij Onegin
by: Sabatini, Angelo Maria
Published: (2026)

Why mask diffusion does not work
by: Sun, Haocheng, et al.
Published: (2025)

How do we measure privacy in text? A survey of text anonymization metrics
by: Ren, Yaxuan, et al.
Published: (2025)

Advancing Chinese biomedical text mining with community challenges
by: Zong, Hui, et al.
Published: (2024)

How and where does CLIP process negation?
by: Quantmeyer, Vincent, et al.
Published: (2024)

Translating scientific Latin texts with artificial intelligence: the works of Euler and contemporaries
by: Bistafa, Sylvio R.
Published: (2023)

How does a Language-Specific Tokenizer affect LLMs?
by: Seo, Jean, et al.
Published: (2025)

Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian
by: De Paoli, Stefano
Published: (2024)

Full-text Error Correction for Chinese Speech Recognition with Large Language Model
by: Tang, Zhiyuan, et al.
Published: (2024)

How Sampling Affects the Detectability of Machine-written texts: A Comprehensive Study
by: Dubois, Matthieu, et al.
Published: (2025)

Synthetically generated text for supervised text analysis
by: Halterman, Andrew
Published: (2023)

How does a Multilingual LM Handle Multiple Languages?
by: Kakarla, Santhosh, et al.
Published: (2025)

Democratizing the medieval English legal tradition
by: Zhang, Michael, et al.
Published: (2026)

How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China's LLMs
by: Wen-Yi, Andrea W, et al.
Published: (2024)

Domain Regeneration: How well do LLMs match syntactic properties of text domains?
by: Ju, Da, et al.
Published: (2025)

Diagnosing our datasets: How does my language model learn clinical information?
by: Jia, Furong, et al.
Published: (2025)

How reparametrization trick broke differentially-private text representation learning
by: Habernal, Ivan
Published: (2022)

How does Misinformation Affect Large Language Model Behaviors and Preferences?
by: Peng, Miao, et al.
Published: (2025)

How does fine-tuning improve sensorimotor representations in large language models?
by: Wu, Minghua, et al.
Published: (2026)

A Chat About Boring Problems: Studying GPT-based text normalization
by: Zhang, Yang, et al.
Published: (2023)

¡¿Qué, qué?!Transculturación and Tato Laviera's Spanglish poetics
by: Stephanie Álvarez Martínez
Published: (2006)

How Much Do LLMs Know About Chinese Zero Pronouns?
by: Li, Yifei, et al.
Published: (2026)

How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes
by: Bhasin, Harmon, et al.
Published: (2024)

How does Chain of Thought Think? Mechanistic Interpretability of Chain-of-Thought Reasoning with Sparse Autoencoding
by: Chen, Xi, et al.
Published: (2025)

Can reasoning models comprehend mathematical problems in Chinese ancient texts? An empirical study based on data from Suanjing Shishu
by: Liu, Chang, et al.
Published: (2025)

История стиховедения и формализм
by: Orekhov, Boris
Published: (2024)

Identifying social isolation themes in NVDRS text narratives using topic modeling and text-classification methods
by: Walker, Drew, et al.
Published: (2025)

Identifying attributions of causality in political text
by: Garcia-Corral, Paulina
Published: (2025)

Qwen it detect machine-generated text?
by: Marchitan, Teodor-George, et al.
Published: (2025)

What does it mean to understand language?
by: Casto, Colton, et al.
Published: (2025)

LUQ: Long-text Uncertainty Quantification for LLMs
by: Zhang, Caiqi, et al.
Published: (2024)

Leveraging the power of transformers for guilt detection in text
by: Meque, Abdul Gafar Manuel, et al.
Published: (2024)

Few-shot text-based emotion detection
by: Marchitan, Teodor-George, et al.
Published: (2025)

Transferable text data distillation by trajectory matching
by: Yao, Rong, et al.
Published: (2025)

Where does an LLM begin computing an instruction?
by: Pola, Aditya, et al.
Published: (2025)

A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts
by: Gokceoglu, Gokcen, et al.
Published: (2024)