Saved in:
| Main Author: | Orekhov, Boris |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.08099 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Does Burrows' Delta really confirm that Rowling and Galbraith are the same author?
by: Orekhov, Boris
Published: (2024)
by: Orekhov, Boris
Published: (2024)
You shall know a piece by the company it keeps. Chess plays as a data for word2vec models
by: Orekhov, Boris
Published: (2024)
by: Orekhov, Boris
Published: (2024)
Is text normalization relevant for classifying medieval charters?
by: Atzenhofer-Baumgartner, Florian, et al.
Published: (2024)
by: Atzenhofer-Baumgartner, Florian, et al.
Published: (2024)
Metronome: tracing variation in poetic meters via local sequence alignment
by: Nagy, Ben, et al.
Published: (2024)
by: Nagy, Ben, et al.
Published: (2024)
Markov reads Pushkin, again: A statistical journey into the poetic world of Evgenij Onegin
by: Sabatini, Angelo Maria
Published: (2026)
by: Sabatini, Angelo Maria
Published: (2026)
Why mask diffusion does not work
by: Sun, Haocheng, et al.
Published: (2025)
by: Sun, Haocheng, et al.
Published: (2025)
How do we measure privacy in text? A survey of text anonymization metrics
by: Ren, Yaxuan, et al.
Published: (2025)
by: Ren, Yaxuan, et al.
Published: (2025)
Advancing Chinese biomedical text mining with community challenges
by: Zong, Hui, et al.
Published: (2024)
by: Zong, Hui, et al.
Published: (2024)
How and where does CLIP process negation?
by: Quantmeyer, Vincent, et al.
Published: (2024)
by: Quantmeyer, Vincent, et al.
Published: (2024)
Translating scientific Latin texts with artificial intelligence: the works of Euler and contemporaries
by: Bistafa, Sylvio R.
Published: (2023)
by: Bistafa, Sylvio R.
Published: (2023)
How does a Language-Specific Tokenizer affect LLMs?
by: Seo, Jean, et al.
Published: (2025)
by: Seo, Jean, et al.
Published: (2025)
Thematic Analysis with Large Language Models: does it work with languages other than English? A targeted test in Italian
by: De Paoli, Stefano
Published: (2024)
by: De Paoli, Stefano
Published: (2024)
Full-text Error Correction for Chinese Speech Recognition with Large Language Model
by: Tang, Zhiyuan, et al.
Published: (2024)
by: Tang, Zhiyuan, et al.
Published: (2024)
How Sampling Affects the Detectability of Machine-written texts: A Comprehensive Study
by: Dubois, Matthieu, et al.
Published: (2025)
by: Dubois, Matthieu, et al.
Published: (2025)
Synthetically generated text for supervised text analysis
by: Halterman, Andrew
Published: (2023)
by: Halterman, Andrew
Published: (2023)
How does a Multilingual LM Handle Multiple Languages?
by: Kakarla, Santhosh, et al.
Published: (2025)
by: Kakarla, Santhosh, et al.
Published: (2025)
Democratizing the medieval English legal tradition
by: Zhang, Michael, et al.
Published: (2026)
by: Zhang, Michael, et al.
Published: (2026)
How Chinese are Chinese Language Models? The Puzzling Lack of Language Policy in China's LLMs
by: Wen-Yi, Andrea W, et al.
Published: (2024)
by: Wen-Yi, Andrea W, et al.
Published: (2024)
Domain Regeneration: How well do LLMs match syntactic properties of text domains?
by: Ju, Da, et al.
Published: (2025)
by: Ju, Da, et al.
Published: (2025)
Diagnosing our datasets: How does my language model learn clinical information?
by: Jia, Furong, et al.
Published: (2025)
by: Jia, Furong, et al.
Published: (2025)
How reparametrization trick broke differentially-private text representation learning
by: Habernal, Ivan
Published: (2022)
by: Habernal, Ivan
Published: (2022)
How does Misinformation Affect Large Language Model Behaviors and Preferences?
by: Peng, Miao, et al.
Published: (2025)
by: Peng, Miao, et al.
Published: (2025)
How does fine-tuning improve sensorimotor representations in large language models?
by: Wu, Minghua, et al.
Published: (2026)
by: Wu, Minghua, et al.
Published: (2026)
A Chat About Boring Problems: Studying GPT-based text normalization
by: Zhang, Yang, et al.
Published: (2023)
by: Zhang, Yang, et al.
Published: (2023)
¡¿Qué, qué?!Transculturación and Tato Laviera's Spanglish poetics
by: Stephanie Álvarez Martínez
Published: (2006)
by: Stephanie Álvarez Martínez
Published: (2006)
How Much Do LLMs Know About Chinese Zero Pronouns?
by: Li, Yifei, et al.
Published: (2026)
by: Li, Yifei, et al.
Published: (2026)
How does Multi-Task Training Affect Transformer In-Context Capabilities? Investigations with Function Classes
by: Bhasin, Harmon, et al.
Published: (2024)
by: Bhasin, Harmon, et al.
Published: (2024)
How does Chain of Thought Think? Mechanistic Interpretability of Chain-of-Thought Reasoning with Sparse Autoencoding
by: Chen, Xi, et al.
Published: (2025)
by: Chen, Xi, et al.
Published: (2025)
Can reasoning models comprehend mathematical problems in Chinese ancient texts? An empirical study based on data from Suanjing Shishu
by: Liu, Chang, et al.
Published: (2025)
by: Liu, Chang, et al.
Published: (2025)
История стиховедения и формализм
by: Orekhov, Boris
Published: (2024)
by: Orekhov, Boris
Published: (2024)
Identifying social isolation themes in NVDRS text narratives using topic modeling and text-classification methods
by: Walker, Drew, et al.
Published: (2025)
by: Walker, Drew, et al.
Published: (2025)
Identifying attributions of causality in political text
by: Garcia-Corral, Paulina
Published: (2025)
by: Garcia-Corral, Paulina
Published: (2025)
Qwen it detect machine-generated text?
by: Marchitan, Teodor-George, et al.
Published: (2025)
by: Marchitan, Teodor-George, et al.
Published: (2025)
What does it mean to understand language?
by: Casto, Colton, et al.
Published: (2025)
by: Casto, Colton, et al.
Published: (2025)
LUQ: Long-text Uncertainty Quantification for LLMs
by: Zhang, Caiqi, et al.
Published: (2024)
by: Zhang, Caiqi, et al.
Published: (2024)
Leveraging the power of transformers for guilt detection in text
by: Meque, Abdul Gafar Manuel, et al.
Published: (2024)
by: Meque, Abdul Gafar Manuel, et al.
Published: (2024)
Few-shot text-based emotion detection
by: Marchitan, Teodor-George, et al.
Published: (2025)
by: Marchitan, Teodor-George, et al.
Published: (2025)
Transferable text data distillation by trajectory matching
by: Yao, Rong, et al.
Published: (2025)
by: Yao, Rong, et al.
Published: (2025)
Where does an LLM begin computing an instruction?
by: Pola, Aditya, et al.
Published: (2025)
by: Pola, Aditya, et al.
Published: (2025)
A multi-level multi-label text classification dataset of 19th century Ottoman and Russian literary and critical texts
by: Gokceoglu, Gokcen, et al.
Published: (2024)
by: Gokceoglu, Gokcen, et al.
Published: (2024)
Similar Items
-
Does Burrows' Delta really confirm that Rowling and Galbraith are the same author?
by: Orekhov, Boris
Published: (2024) -
You shall know a piece by the company it keeps. Chess plays as a data for word2vec models
by: Orekhov, Boris
Published: (2024) -
Is text normalization relevant for classifying medieval charters?
by: Atzenhofer-Baumgartner, Florian, et al.
Published: (2024) -
Metronome: tracing variation in poetic meters via local sequence alignment
by: Nagy, Ben, et al.
Published: (2024) -
Markov reads Pushkin, again: A statistical journey into the poetic world of Evgenij Onegin
by: Sabatini, Angelo Maria
Published: (2026)