:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lochter, Johannes V., Silva, Renato M., Almeida, Tiago A.
Format:	Preprint
Published:	2020
Subjects:	Computation and Language Machine Learning
Online Access:	https://arxiv.org/abs/2007.07318
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

What makes a word hard to learn? Modeling L1 influence on English vocabulary difficulty
by: Martins, Jonas Mayer, et al.
Published: (2026)

OmniDraft: A Cross-vocabulary, Online Adaptive Drafter for On-device Speculative Decoding
by: Ramakrishnan, Ramchalam Kinattinkara, et al.
Published: (2025)

Multi-word Tokenization for Sequence Compression
by: Gee, Leonidas, et al.
Published: (2024)

Transformers represent belief state geometry in their residual stream
by: Shai, Adam S., et al.
Published: (2024)

Vocabulary shapes cross-lingual variation of word-order learnability in language models
by: Martins, Jonas Mayer, et al.
Published: (2026)

Critical biblical studies via word frequency analysis: unveiling text authorship
by: Faigenbaum-Golovin, Shira, et al.
Published: (2024)

Forcing Diffuse Distributions out of Language Models
by: Zhang, Yiming, et al.
Published: (2024)

AIDetx: a compression-based method for identification of machine-learning generated text
by: Almeida, Leonardo, et al.
Published: (2024)

From communities to interpretable network and word embedding: an unified approach
by: Prouteau, Thibault, et al.
Published: (2024)

Effects of term weighting approach with and without stop words removing on Arabic text classification
by: Alhenawi, Esra'a, et al.
Published: (2024)

Detecting out-of-distribution text using topological features of transformer-based language models
by: Pollano, Andres, et al.
Published: (2023)

Convergence and Divergence of Language Models under Different Random Seeds
by: Fehlauer, Finlay, et al.
Published: (2025)

A light-weight and efficient punctuation and word casing prediction model for on-device streaming ASR
by: You, Jian, et al.
Published: (2024)

Deep learning and abstractive summarisation for radiological reports: an empirical study for adapting the PEGASUS models' family with scarce data
by: Benzoni, Claudio, et al.
Published: (2025)

Enhancing ASD detection accuracy: a combined approach of machine learning and deep learning models with natural language processing
by: Rubio-Martín, Sergio, et al.
Published: (2024)

How do language models learn facts? Dynamics, curricula and hallucinations
by: Zucchet, Nicolas, et al.
Published: (2025)

Forecasting Events in Soccer Matches Through Language
by: Mendes-Neves, Tiago, et al.
Published: (2024)

Deep literature reviews: an application of fine-tuned language models to migration research
by: Iacus, Stefano M., et al.
Published: (2025)

Prompt reinforcing for long-term planning of large language models
by: Lin, Hsien-Chin, et al.
Published: (2025)

The representation landscape of few-shot learning and fine-tuning in large language models
by: Doimo, Diego, et al.
Published: (2024)

Topic Modeling with Fine-tuning LLMs and Bag of Sentences
by: Schneider, Johannes
Published: (2024)

Improving Next Tokens via Second-to-Last Predictions with Generate and Refine
by: Schneider, Johannes
Published: (2024)

Efficient and Flexible Topic Modeling using Pretrained Embeddings and Bag of Sentences
by: Schneider, Johannes
Published: (2023)

Tokenisation via Convex Relaxations
by: Tempus, Jan, et al.
Published: (2026)

Empowering machine learning models with contextual knowledge for enhancing the detection of eating disorders in social media posts
by: Benítez-Andrades, José Alberto, et al.
Published: (2024)

Leveraging large language models for structured information extraction from pathology reports
by: Balasubramanian, Jeya Balaji, et al.
Published: (2025)

Do Generalisation Results Generalise?
by: Boglioni, Matteo, et al.
Published: (2025)

Playing with words: Comparing the vocabulary and lexical diversity of ChatGPT and humans
by: Reviriego, Pedro, et al.
Published: (2023)

A meta-analysis on the performance of machine-learning based language models for sentiment analysis
by: Rohde, Elena, et al.
Published: (2025)

Negation Neglect: When models fail to learn negations in training
by: Mayne, Harry, et al.
Published: (2026)

Latent learning: episodic memory complements parametric learning by enabling flexible reuse of experiences
by: Lampinen, Andrew Kyle, et al.
Published: (2025)

Predicting ATP binding sites in protein sequences using Deep Learning and Natural Language Processing
by: V, Shreyas, et al.
Published: (2024)

The broader spectrum of in-context learning
by: Lampinen, Andrew Kyle, et al.
Published: (2024)

Where is the signal in tokenization space?
by: Geh, Renato Lui, et al.
Published: (2024)

Large language models reorganize representational geometry during in-context learning
by: Xiong, Hua-Dong, et al.
Published: (2026)

CausalLM is not optimal for in-context learning
by: Ding, Nan, et al.
Published: (2023)

More than words: Advancements and challenges in speech recognition for singing
by: Kruspe, Anna
Published: (2024)

JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models
by: Arefa, et al.
Published: (2024)

Deep sequence models tend to memorize geometrically; it is unclear why
by: Noroozizadeh, Shahriar, et al.
Published: (2025)

Rethinking Reinforcement Fine-Tuning in LVLM: Convergence, Reward Decomposition, and Generalization
by: Adams, Carter, et al.
Published: (2026)