Saved in:
| Main Authors: | Elsner, Micha, Liu, David |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.09778 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Multilingual acoustic word embeddings for zero-resource languages
by: Jacobs, Christiaan
Published: (2024)
by: Jacobs, Christiaan
Published: (2024)
Visually grounded few-shot word learning in low-resource settings
by: Nortje, Leanne, et al.
Published: (2023)
by: Nortje, Leanne, et al.
Published: (2023)
Shortcomings of LLMs for Low-Resource Translation: Retrieval and Understanding are Both the Problem
by: Court, Sara, et al.
Published: (2024)
by: Court, Sara, et al.
Published: (2024)
Can we teach language models to gloss endangered languages?
by: Ginn, Michael, et al.
Published: (2024)
by: Ginn, Michael, et al.
Published: (2024)
Representing data in words: A context engineering approach
by: Caut, Amandine M., et al.
Published: (2025)
by: Caut, Amandine M., et al.
Published: (2025)
Meaningless is better: hashing bias-inducing words in LLM prompts improves performance in logical reasoning and statistical learning
by: Chadimová, Milena, et al.
Published: (2024)
by: Chadimová, Milena, et al.
Published: (2024)
Does language matter for spoken word classification? A multilingual generative meta-learning approach
by: Ziki, Batsirayi Mupamhi, et al.
Published: (2026)
by: Ziki, Batsirayi Mupamhi, et al.
Published: (2026)
Target word activity detector: An approach to obtain ASR word boundaries without lexicon
by: Sivasankaran, Sunit, et al.
Published: (2024)
by: Sivasankaran, Sunit, et al.
Published: (2024)
What is a word?
by: Murphy, Elliot
Published: (2024)
by: Murphy, Elliot
Published: (2024)
LongTail-Swap: benchmarking language models' abilities on rare words
by: Algayres, Robin, et al.
Published: (2025)
by: Algayres, Robin, et al.
Published: (2025)
Can large language models understand uncommon meanings of common words?
by: Wu, Jinyang, et al.
Published: (2024)
by: Wu, Jinyang, et al.
Published: (2024)
Does mBERT understand Romansh? Evaluating word embeddings using word alignment
by: Dolev, Eyal Liron
Published: (2023)
by: Dolev, Eyal Liron
Published: (2023)
Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation
by: Sánchez-Cartagena, Víctor M., et al.
Published: (2024)
by: Sánchez-Cartagena, Víctor M., et al.
Published: (2024)
Word length predicts word order: "Min-max"-ing drives language evolution
by: Ring, Hiram
Published: (2025)
by: Ring, Hiram
Published: (2025)
Slice closures of indexed languages and word equations with counting constraints
by: Ciobanu, Laura, et al.
Published: (2024)
by: Ciobanu, Laura, et al.
Published: (2024)
Watson-Crick conjugates of words and languages
by: Mahalingam, Kalpana, et al.
Published: (2022)
by: Mahalingam, Kalpana, et al.
Published: (2022)
Using large language models to estimate features of multi-word expressions: Concreteness, valence, arousal
by: Martínez, Gonzalo, et al.
Published: (2024)
by: Martínez, Gonzalo, et al.
Published: (2024)
Automata on $S$-adic words
by: Berthé, Valérie, et al.
Published: (2025)
by: Berthé, Valérie, et al.
Published: (2025)
Vocabulary shapes cross-lingual variation of word-order learnability in language models
by: Martins, Jonas Mayer, et al.
Published: (2026)
by: Martins, Jonas Mayer, et al.
Published: (2026)
From communities to interpretable network and word embedding: an unified approach
by: Prouteau, Thibault, et al.
Published: (2024)
by: Prouteau, Thibault, et al.
Published: (2024)
Language models and brains align due to more than next-word prediction and word-level information
by: Merlin, Gabriele, et al.
Published: (2022)
by: Merlin, Gabriele, et al.
Published: (2022)
Why do objects have many names? A study on word informativeness in language use and lexical systems
by: Gualdoni, Eleonora, et al.
Published: (2024)
by: Gualdoni, Eleonora, et al.
Published: (2024)
Evolutionary ecology of words
by: Suzuki, Reiji, et al.
Published: (2025)
by: Suzuki, Reiji, et al.
Published: (2025)
Effect of dimensionality change on the bias of word embeddings
by: Rai, Rohit Raj, et al.
Published: (2023)
by: Rai, Rohit Raj, et al.
Published: (2023)
A new kid on the block: Distributional semantics predicts the word-specific tone signatures of monosyllabic words in conversational Taiwan Mandarin
by: Jin, Xiaoyun, et al.
Published: (2025)
by: Jin, Xiaoyun, et al.
Published: (2025)
Multi-word Tokenization for Sequence Compression
by: Gee, Leonidas, et al.
Published: (2024)
by: Gee, Leonidas, et al.
Published: (2024)
Less than one percent of words would be affected by gender-inclusive language in German press texts
by: Müller-Spitzer, Carolin, et al.
Published: (2024)
by: Müller-Spitzer, Carolin, et al.
Published: (2024)
A symbolic Perl algorithm for the unification of Nahuatl word spellings
by: Guzmán-Landa, Juan-José, et al.
Published: (2025)
by: Guzmán-Landa, Juan-José, et al.
Published: (2025)
Comparing LLM prompting with Cross-lingual transfer performance on Indigenous and Low-resource Brazilian Languages
by: Adelani, David Ifeoluwa, et al.
Published: (2024)
by: Adelani, David Ifeoluwa, et al.
Published: (2024)
Subword models struggle with word learning, but surprisal hides it
by: Bunzeck, Bastian, et al.
Published: (2025)
by: Bunzeck, Bastian, et al.
Published: (2025)
When does word order matter and when doesn't it?
by: Chen, Xuanda, et al.
Published: (2024)
by: Chen, Xuanda, et al.
Published: (2024)
Automatic Real-word Error Correction in Persian Text
by: Dashti, Seyed Mohammad Sadegh, et al.
Published: (2024)
by: Dashti, Seyed Mohammad Sadegh, et al.
Published: (2024)
Speech perception: a model of word recognition
by: Luck, Jean-Marc, et al.
Published: (2024)
by: Luck, Jean-Marc, et al.
Published: (2024)
All that is English may be Hindi: Enhancing language identification through automatic ranking of likeliness of word borrowing in social media
by: Patro, Jasabanta, et al.
Published: (2017)
by: Patro, Jasabanta, et al.
Published: (2017)
InkubaLM: A small language model for low-resource African languages
by: Tonja, Atnafu Lambebo, et al.
Published: (2024)
by: Tonja, Atnafu Lambebo, et al.
Published: (2024)
Demystifying optimized prompts in language models
by: Melamed, Rimon, et al.
Published: (2025)
by: Melamed, Rimon, et al.
Published: (2025)
Clustering of return words in languages of interval exchanges
by: Dolce, Francesco, et al.
Published: (2025)
by: Dolce, Francesco, et al.
Published: (2025)
The truth is no diaper: Human and AI-generated associations to emotional words
by: Vintar, Špela, et al.
Published: (2025)
by: Vintar, Špela, et al.
Published: (2025)
An experimental and computational study of an Estonian single-person word naming
by: Lõo, Kaidi, et al.
Published: (2025)
by: Lõo, Kaidi, et al.
Published: (2025)
Extracting domain-specific terms using contextual word embeddings
by: Repar, Andraž, et al.
Published: (2025)
by: Repar, Andraž, et al.
Published: (2025)
Similar Items
-
Multilingual acoustic word embeddings for zero-resource languages
by: Jacobs, Christiaan
Published: (2024) -
Visually grounded few-shot word learning in low-resource settings
by: Nortje, Leanne, et al.
Published: (2023) -
Shortcomings of LLMs for Low-Resource Translation: Retrieval and Understanding are Both the Problem
by: Court, Sara, et al.
Published: (2024) -
Can we teach language models to gloss endangered languages?
by: Ginn, Michael, et al.
Published: (2024) -
Representing data in words: A context engineering approach
by: Caut, Amandine M., et al.
Published: (2025)