Saved in:
| Main Authors: | Lochter, Johannes V., Silva, Renato M., Almeida, Tiago A. |
|---|---|
| Format: | Preprint |
| Published: |
2020
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2007.07318 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
What makes a word hard to learn? Modeling L1 influence on English vocabulary difficulty
by: Martins, Jonas Mayer, et al.
Published: (2026)
by: Martins, Jonas Mayer, et al.
Published: (2026)
OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding
by: Ramakrishnan, Ramchalam Kinattinkara, et al.
Published: (2025)
by: Ramakrishnan, Ramchalam Kinattinkara, et al.
Published: (2025)
Multi-word Tokenization for Sequence Compression
by: Gee, Leonidas, et al.
Published: (2024)
by: Gee, Leonidas, et al.
Published: (2024)
Transformers represent belief state geometry in their residual stream
by: Shai, Adam S., et al.
Published: (2024)
by: Shai, Adam S., et al.
Published: (2024)
Vocabulary shapes cross-lingual variation of word-order learnability in language models
by: Martins, Jonas Mayer, et al.
Published: (2026)
by: Martins, Jonas Mayer, et al.
Published: (2026)
Critical biblical studies via word frequency analysis: unveiling text authorship
by: Faigenbaum-Golovin, Shira, et al.
Published: (2024)
by: Faigenbaum-Golovin, Shira, et al.
Published: (2024)
Forcing Diffuse Distributions out of Language Models
by: Zhang, Yiming, et al.
Published: (2024)
by: Zhang, Yiming, et al.
Published: (2024)
AIDetx: a compression-based method for identification of machine-learning generated text
by: Almeida, Leonardo, et al.
Published: (2024)
by: Almeida, Leonardo, et al.
Published: (2024)
From communities to interpretable network and word embedding: an unified approach
by: Prouteau, Thibault, et al.
Published: (2024)
by: Prouteau, Thibault, et al.
Published: (2024)
Effects of term weighting approach with and without stop words removing on Arabic text classification
by: Alhenawi, Esra'a, et al.
Published: (2024)
by: Alhenawi, Esra'a, et al.
Published: (2024)
Detecting out-of-distribution text using topological features of transformer-based language models
by: Pollano, Andres, et al.
Published: (2023)
by: Pollano, Andres, et al.
Published: (2023)
Convergence and Divergence of Language Models under Different Random Seeds
by: Fehlauer, Finlay, et al.
Published: (2025)
by: Fehlauer, Finlay, et al.
Published: (2025)
A light-weight and efficient punctuation and word casing prediction model for on-device streaming ASR
by: You, Jian, et al.
Published: (2024)
by: You, Jian, et al.
Published: (2024)
Deep learning and abstractive summarisation for radiological reports: an empirical study for adapting the PEGASUS models' family with scarce data
by: Benzoni, Claudio, et al.
Published: (2025)
by: Benzoni, Claudio, et al.
Published: (2025)
Enhancing ASD detection accuracy: a combined approach of machine learning and deep learning models with natural language processing
by: Rubio-Martín, Sergio, et al.
Published: (2024)
by: Rubio-Martín, Sergio, et al.
Published: (2024)
How do language models learn facts? Dynamics, curricula and hallucinations
by: Zucchet, Nicolas, et al.
Published: (2025)
by: Zucchet, Nicolas, et al.
Published: (2025)
Forecasting Events in Soccer Matches Through Language
by: Mendes-Neves, Tiago, et al.
Published: (2024)
by: Mendes-Neves, Tiago, et al.
Published: (2024)
Deep literature reviews: an application of fine-tuned language models to migration research
by: Iacus, Stefano M., et al.
Published: (2025)
by: Iacus, Stefano M., et al.
Published: (2025)
Prompt reinforcing for long-term planning of large language models
by: Lin, Hsien-Chin, et al.
Published: (2025)
by: Lin, Hsien-Chin, et al.
Published: (2025)
The representation landscape of few-shot learning and fine-tuning in large language models
by: Doimo, Diego, et al.
Published: (2024)
by: Doimo, Diego, et al.
Published: (2024)
Topic Modeling with Fine-tuning LLMs and Bag of Sentences
by: Schneider, Johannes
Published: (2024)
by: Schneider, Johannes
Published: (2024)
Improving Next Tokens via Second-to-Last Predictions with Generate and Refine
by: Schneider, Johannes
Published: (2024)
by: Schneider, Johannes
Published: (2024)
Efficient and Flexible Topic Modeling using Pretrained Embeddings and Bag of Sentences
by: Schneider, Johannes
Published: (2023)
by: Schneider, Johannes
Published: (2023)
Tokenisation via Convex Relaxations
by: Tempus, Jan, et al.
Published: (2026)
by: Tempus, Jan, et al.
Published: (2026)
Empowering machine learning models with contextual knowledge for enhancing the detection of eating disorders in social media posts
by: Benítez-Andrades, José Alberto, et al.
Published: (2024)
by: Benítez-Andrades, José Alberto, et al.
Published: (2024)
Leveraging large language models for structured information extraction from pathology reports
by: Balasubramanian, Jeya Balaji, et al.
Published: (2025)
by: Balasubramanian, Jeya Balaji, et al.
Published: (2025)
Do Generalisation Results Generalise?
by: Boglioni, Matteo, et al.
Published: (2025)
by: Boglioni, Matteo, et al.
Published: (2025)
Playing with words: Comparing the vocabulary and lexical diversity of ChatGPT and humans
by: Reviriego, Pedro, et al.
Published: (2023)
by: Reviriego, Pedro, et al.
Published: (2023)
A meta-analysis on the performance of machine-learning based language models for sentiment analysis
by: Rohde, Elena, et al.
Published: (2025)
by: Rohde, Elena, et al.
Published: (2025)
Negation Neglect: When models fail to learn negations in training
by: Mayne, Harry, et al.
Published: (2026)
by: Mayne, Harry, et al.
Published: (2026)
Latent learning: episodic memory complements parametric learning by enabling flexible reuse of experiences
by: Lampinen, Andrew Kyle, et al.
Published: (2025)
by: Lampinen, Andrew Kyle, et al.
Published: (2025)
Predicting ATP binding sites in protein sequences using Deep Learning and Natural Language Processing
by: V, Shreyas, et al.
Published: (2024)
by: V, Shreyas, et al.
Published: (2024)
The broader spectrum of in-context learning
by: Lampinen, Andrew Kyle, et al.
Published: (2024)
by: Lampinen, Andrew Kyle, et al.
Published: (2024)
Where is the signal in tokenization space?
by: Geh, Renato Lui, et al.
Published: (2024)
by: Geh, Renato Lui, et al.
Published: (2024)
Large language models reorganize representational geometry during in-context learning
by: Xiong, Hua-Dong, et al.
Published: (2026)
by: Xiong, Hua-Dong, et al.
Published: (2026)
CausalLM is not optimal for in-context learning
by: Ding, Nan, et al.
Published: (2023)
by: Ding, Nan, et al.
Published: (2023)
More than words: Advancements and challenges in speech recognition for singing
by: Kruspe, Anna
Published: (2024)
by: Kruspe, Anna
Published: (2024)
JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models
by: Arefa, et al.
Published: (2024)
by: Arefa, et al.
Published: (2024)
Deep sequence models tend to memorize geometrically; it is unclear why
by: Noroozizadeh, Shahriar, et al.
Published: (2025)
by: Noroozizadeh, Shahriar, et al.
Published: (2025)
Rethinking Reinforcement Fine-Tuning in LVLM: Convergence, Reward Decomposition, and Generalization
by: Adams, Carter, et al.
Published: (2026)
by: Adams, Carter, et al.
Published: (2026)
Similar Items
-
What makes a word hard to learn? Modeling L1 influence on English vocabulary difficulty
by: Martins, Jonas Mayer, et al.
Published: (2026) -
OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding
by: Ramakrishnan, Ramchalam Kinattinkara, et al.
Published: (2025) -
Multi-word Tokenization for Sequence Compression
by: Gee, Leonidas, et al.
Published: (2024) -
Transformers represent belief state geometry in their residual stream
by: Shai, Adam S., et al.
Published: (2024) -
Vocabulary shapes cross-lingual variation of word-order learnability in language models
by: Martins, Jonas Mayer, et al.
Published: (2026)