Saved in:
| Main Authors: | Berman, David S., Stapleton, Alexander G. |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.03368 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Infusing clinical knowledge into tokenisers for language models
by: Hasan, Abul, et al.
Published: (2024)
by: Hasan, Abul, et al.
Published: (2024)
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
by: Raposo, David, et al.
Published: (2024)
by: Raposo, David, et al.
Published: (2024)
Zipf Distributions from Two-Stage Symbolic Processes: Stability Under Stochastic Lexical Filtering
by: Berman, Vladimir
Published: (2025)
by: Berman, Vladimir
Published: (2025)
InsurTech innovation using natural language processing
by: Dong, Panyi, et al.
Published: (2025)
by: Dong, Panyi, et al.
Published: (2025)
Applications of natural language processing in aviation safety: A review and qualitative analysis
by: Nanyonga, Aziida, et al.
Published: (2025)
by: Nanyonga, Aziida, et al.
Published: (2025)
On the evolution of research in hypersonics: application of natural language processing and machine learning
by: Ebadi, Ashkan, et al.
Published: (2022)
by: Ebadi, Ashkan, et al.
Published: (2022)
Random Text, Zipf's Law, Critical Length,and Implications for Large Language Models
by: Berman, Vladimir
Published: (2025)
by: Berman, Vladimir
Published: (2025)
Comparison of different Unique hard attention transformer models by the formal languages they can recognize
by: Ryvkin, Leonid
Published: (2025)
by: Ryvkin, Leonid
Published: (2025)
MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model
by: Reddy, K. Sahit, et al.
Published: (2025)
by: Reddy, K. Sahit, et al.
Published: (2025)
Benchmarking large language models for biomedical natural language processing applications and recommendations
by: Chen, Qingyu, et al.
Published: (2023)
by: Chen, Qingyu, et al.
Published: (2023)
EEG-CLIP : Learning EEG representations from natural language descriptions
by: Ndir, Tidiane Camaret, et al.
Published: (2025)
by: Ndir, Tidiane Camaret, et al.
Published: (2025)
A comparison of pipelines for the translation of a low resource language based on transformers
by: Bonfanti, Chiara, et al.
Published: (2025)
by: Bonfanti, Chiara, et al.
Published: (2025)
Detecting out-of-distribution text using topological features of transformer-based language models
by: Pollano, Andres, et al.
Published: (2023)
by: Pollano, Andres, et al.
Published: (2023)
Physical models realizing the transformer architecture of large language models
by: Chen, Zeqian
Published: (2025)
by: Chen, Zeqian
Published: (2025)
SSCAE -- Semantic, Syntactic, and Context-aware natural language Adversarial Examples generator
by: Asl, Javad Rafiei, et al.
Published: (2024)
by: Asl, Javad Rafiei, et al.
Published: (2024)
Enhancing ASD detection accuracy: a combined approach of machine learning and deep learning models with natural language processing
by: Rubio-Martín, Sergio, et al.
Published: (2024)
by: Rubio-Martín, Sergio, et al.
Published: (2024)
Block removal for large language models through constrained binary optimization
by: Jansen, David, et al.
Published: (2026)
by: Jansen, David, et al.
Published: (2026)
Predicting potentially abusive clauses in Chilean terms of services with natural language processing
by: Loeffler, Christoffer, et al.
Published: (2025)
by: Loeffler, Christoffer, et al.
Published: (2025)
Innovative tokenisation of structured data for LLM training
by: Karim, Kayvan, et al.
Published: (2025)
by: Karim, Kayvan, et al.
Published: (2025)
Zero-shot data citation function classification using transformer-based large language models (LLMs)
by: Byers, Neil, et al.
Published: (2025)
by: Byers, Neil, et al.
Published: (2025)
BRAIn: Bayesian Reward-conditioned Amortized Inference for natural language generation from feedback
by: Pandey, Gaurav, et al.
Published: (2024)
by: Pandey, Gaurav, et al.
Published: (2024)
Optimization Strategies for Enhancing Resource Efficiency in Transformers & Large Language Models
by: Wallace, Tom, et al.
Published: (2025)
by: Wallace, Tom, et al.
Published: (2025)
Transformers need glasses! Information over-squashing in language tasks
by: Barbero, Federico, et al.
Published: (2024)
by: Barbero, Federico, et al.
Published: (2024)
A mean teacher algorithm for unlearning of language models
by: Klochkov, Yegor
Published: (2025)
by: Klochkov, Yegor
Published: (2025)
Aligning language models with human preferences
by: Korbak, Tomasz
Published: (2024)
by: Korbak, Tomasz
Published: (2024)
Evaluating language models as risk scores
by: Cruz, André F., et al.
Published: (2024)
by: Cruz, André F., et al.
Published: (2024)
DevBench: A multimodal developmental benchmark for language learning
by: Tan, Alvin Wei Ming, et al.
Published: (2024)
by: Tan, Alvin Wei Ming, et al.
Published: (2024)
Amortizing intractable inference in large language models
by: Hu, Edward J., et al.
Published: (2023)
by: Hu, Edward J., et al.
Published: (2023)
Attribution analysis of legal language as used by LLM
by: Belew, Richard K.
Published: (2025)
by: Belew, Richard K.
Published: (2025)
Human-interpretable clustering of short-text using large language models
by: Miller, Justin K., et al.
Published: (2024)
by: Miller, Justin K., et al.
Published: (2024)
Learning from flowsheets: A generative transformer model for autocompletion of flowsheets
by: Vogel, Gabriel, et al.
Published: (2022)
by: Vogel, Gabriel, et al.
Published: (2022)
Dynamic layer selection in decoder-only transformers
by: Glavas, Theodore, et al.
Published: (2024)
by: Glavas, Theodore, et al.
Published: (2024)
The SMeL Test: A simple benchmark for media literacy in language models
by: Ahdritz, Gustaf, et al.
Published: (2025)
by: Ahdritz, Gustaf, et al.
Published: (2025)
Perturbed examples reveal invariances shared by language models
by: Rawal, Ruchit, et al.
Published: (2023)
by: Rawal, Ruchit, et al.
Published: (2023)
Do language models plan ahead for future tokens?
by: Wu, Wilson, et al.
Published: (2024)
by: Wu, Wilson, et al.
Published: (2024)
Visualizing token importance for black-box language models
by: Rauba, Paulius, et al.
Published: (2025)
by: Rauba, Paulius, et al.
Published: (2025)
DataComp-LM: In search of the next generation of training sets for language models
by: Li, Jeffrey, et al.
Published: (2024)
by: Li, Jeffrey, et al.
Published: (2024)
Prompt reinforcing for long-term planning of large language models
by: Lin, Hsien-Chin, et al.
Published: (2025)
by: Lin, Hsien-Chin, et al.
Published: (2025)
Machine-generated text detection prevents language model collapse
by: Drayson, George, et al.
Published: (2025)
by: Drayson, George, et al.
Published: (2025)
Alignment faking in large language models
by: Greenblatt, Ryan, et al.
Published: (2024)
by: Greenblatt, Ryan, et al.
Published: (2024)
Similar Items
-
Infusing clinical knowledge into tokenisers for language models
by: Hasan, Abul, et al.
Published: (2024) -
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models
by: Raposo, David, et al.
Published: (2024) -
Zipf Distributions from Two-Stage Symbolic Processes: Stability Under Stochastic Lexical Filtering
by: Berman, Vladimir
Published: (2025) -
InsurTech innovation using natural language processing
by: Dong, Panyi, et al.
Published: (2025) -
Applications of natural language processing in aviation safety: A review and qualitative analysis
by: Nanyonga, Aziida, et al.
Published: (2025)