:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Berman, David S., Stapleton, Alexander G.
Format:	Preprint
Published:	2026
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2601.03368
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Infusing clinical knowledge into tokenisers for language models
by: Hasan, Abul, et al.
Published: (2024)

Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
by: Raposo, David, et al.
Published: (2024)

Zipf Distributions from Two-Stage Symbolic Processes: Stability Under Stochastic Lexical Filtering
by: Berman, Vladimir
Published: (2025)

InsurTech innovation using natural language processing
by: Dong, Panyi, et al.
Published: (2025)

Applications of natural language processing in aviation safety: A review and qualitative analysis
by: Nanyonga, Aziida, et al.
Published: (2025)

On the evolution of research in hypersonics: application of natural language processing and machine learning
by: Ebadi, Ashkan, et al.
Published: (2022)

Random Text, Zipf's Law, Critical Length,and Implications for Large Language Models
by: Berman, Vladimir
Published: (2025)

Comparison of different Unique hard attention transformer models by the formal languages they can recognize
by: Ryvkin, Leonid
Published: (2025)

MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model
by: Reddy, K. Sahit, et al.
Published: (2025)

Benchmarking large language models for biomedical natural language processing applications and recommendations
by: Chen, Qingyu, et al.
Published: (2023)

EEG-CLIP : Learning EEG representations from natural language descriptions
by: Ndir, Tidiane Camaret, et al.
Published: (2025)

A comparison of pipelines for the translation of a low resource language based on transformers
by: Bonfanti, Chiara, et al.
Published: (2025)

Detecting out-of-distribution text using topological features of transformer-based language models
by: Pollano, Andres, et al.
Published: (2023)

Physical models realizing the transformer architecture of large language models
by: Chen, Zeqian
Published: (2025)

SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
by: Asl, Javad Rafiei, et al.
Published: (2024)

Enhancing ASD detection accuracy: a combined approach of machine learning and deep learning models with natural language processing
by: Rubio-Martín, Sergio, et al.
Published: (2024)

Block removal for large language models through constrained binary optimization
by: Jansen, David, et al.
Published: (2026)

Predicting potentially abusive clauses in Chilean terms of services with natural language processing
by: Loeffler, Christoffer, et al.
Published: (2025)

Innovative tokenisation of structured data for LLM training
by: Karim, Kayvan, et al.
Published: (2025)

Zero-shot data citation function classification using transformer-based large language models (LLMs)
by: Byers, Neil, et al.
Published: (2025)

BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
by: Pandey, Gaurav, et al.
Published: (2024)

Optimization Strategies for Enhancing Resource Efficiency in Transformers & Large Language Models
by: Wallace, Tom, et al.
Published: (2025)

Transformers need glasses! Information over-squashing in language tasks
by: Barbero, Federico, et al.
Published: (2024)

A mean teacher algorithm for unlearning of language models
by: Klochkov, Yegor
Published: (2025)

Aligning language models with human preferences
by: Korbak, Tomasz
Published: (2024)

Evaluating language models as risk scores
by: Cruz, André F., et al.
Published: (2024)

DevBench: A multimodal developmental benchmark for language learning
by: Tan, Alvin Wei Ming, et al.
Published: (2024)

Amortizing intractable inference in large language models
by: Hu, Edward J., et al.
Published: (2023)

Attribution analysis of legal language as used by LLM
by: Belew, Richard K.
Published: (2025)

Human-interpretable clustering of short-text using large language models
by: Miller, Justin K., et al.
Published: (2024)

Learning from flowsheets: A generative transformer model for autocompletion of flowsheets
by: Vogel, Gabriel, et al.
Published: (2022)

Dynamic layer selection in decoder-only transformers
by: Glavas, Theodore, et al.
Published: (2024)

The SMeL Test: A simple benchmark for media literacy in language models
by: Ahdritz, Gustaf, et al.
Published: (2025)

Perturbed examples reveal invariances shared by language models
by: Rawal, Ruchit, et al.
Published: (2023)

Do language models plan ahead for future tokens?
by: Wu, Wilson, et al.
Published: (2024)

Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025)

DataComp-LM: In search of the next generation of training sets for language models
by: Li, Jeffrey, et al.
Published: (2024)

Prompt reinforcing for long-term planning of large language models
by: Lin, Hsien-Chin, et al.
Published: (2025)

Machine-generated text detection prevents language model collapse
by: Drayson, George, et al.
Published: (2025)

Alignment faking in large language models
by: Greenblatt, Ryan, et al.
Published: (2024)