Saved in:
| Main Authors: | Shi, Shaozhen, Matusevych, Yevgen, Nissim, Malvina |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2410.22081 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
BabyLlama-2: Ensemble-Distilled Models Consistently Outperform Teachers With Limited Data
by: Tastet, Jean-Loup, et al.
Published: (2024)
by: Tastet, Jean-Loup, et al.
Published: (2024)
Generating Completions for Broca's Aphasic Sentences Using Large Language Models
by: van Vaals, Sijbren, et al.
Published: (2024)
by: van Vaals, Sijbren, et al.
Published: (2024)
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
by: Lai, Huiyuan, et al.
Published: (2024)
by: Lai, Huiyuan, et al.
Published: (2024)
IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
by: Sarti, Gabriele, et al.
Published: (2022)
by: Sarti, Gabriele, et al.
Published: (2022)
Puzzled By ChatGPT? No more! A Jigsaw Puzzle to Promote AI Literacy and Awareness
by: Padovani, Francesca, et al.
Published: (2026)
by: Padovani, Francesca, et al.
Published: (2026)
Is Child-Directed Language Optimized for Word Learning? A Computational Study of Verb Meaning Acquisition
by: Padovani, Francesca, et al.
Published: (2026)
by: Padovani, Francesca, et al.
Published: (2026)
Child-Directed Language Does Not Consistently Boost Syntax Learning in Language Models
by: Padovani, Francesca, et al.
Published: (2025)
by: Padovani, Francesca, et al.
Published: (2025)
TACLer: Tailored Curriculum Reinforcement Learning for Efficient Reasoning
by: Lai, Huiyuan, et al.
Published: (2026)
by: Lai, Huiyuan, et al.
Published: (2026)
Visually Grounded Speech Models have a Mutual Exclusivity Bias
by: Nortje, Leanne, et al.
Published: (2024)
by: Nortje, Leanne, et al.
Published: (2024)
The mutual exclusivity bias of bilingual visually grounded speech models
by: Oneata, Dan, et al.
Published: (2025)
by: Oneata, Dan, et al.
Published: (2025)
BabyLM Turns 3: Call for papers for the 2025 BabyLM workshop
by: Charpentier, Lucas, et al.
Published: (2025)
by: Charpentier, Lucas, et al.
Published: (2025)
Multidimensional Consistency Improves Reasoning in Language Models
by: Lai, Huiyuan, et al.
Published: (2025)
by: Lai, Huiyuan, et al.
Published: (2025)
When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation
by: Occhipinti, Daniela, et al.
Published: (2025)
by: Occhipinti, Daniela, et al.
Published: (2025)
BabyLM Turns 4 and Goes Multilingual: Call for Papers for the 2026 BabyLM Workshop
by: Choshen, Leshem, et al.
Published: (2026)
by: Choshen, Leshem, et al.
Published: (2026)
A gentle push funziona benissimo: making instructed models in Italian via contrastive activation steering
by: Scalena, Daniel, et al.
Published: (2024)
by: Scalena, Daniel, et al.
Published: (2024)
Practising responsibility: Ethics in NLP as a hands-on course
by: Nissim, Malvina, et al.
Published: (2025)
by: Nissim, Malvina, et al.
Published: (2025)
Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas
by: Bunzeck, Bastian, et al.
Published: (2024)
by: Bunzeck, Bastian, et al.
Published: (2024)
Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?
by: Zotos, Leonidas, et al.
Published: (2024)
by: Zotos, Leonidas, et al.
Published: (2024)
Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation
by: Zotos, Leonidas, et al.
Published: (2024)
by: Zotos, Leonidas, et al.
Published: (2024)
The Role of the Availability Heuristic in Multiple-Choice Answering Behaviour
by: Zotos, Leonidas, et al.
Published: (2026)
by: Zotos, Leonidas, et al.
Published: (2026)
Multi-property Steering of Large Language Models with Dynamic Activation Composition
by: Scalena, Daniel, et al.
Published: (2024)
by: Scalena, Daniel, et al.
Published: (2024)
Are BabyLMs Second Language Learners?
by: Edman, Lukas, et al.
Published: (2024)
by: Edman, Lukas, et al.
Published: (2024)
When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?
by: Iyer, Srikrishna
Published: (2024)
by: Iyer, Srikrishna
Published: (2024)
BAMBI: Developing Baby Language Models for Italian
by: Suozzi, Alice, et al.
Published: (2025)
by: Suozzi, Alice, et al.
Published: (2025)
Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses
by: Sarti, Gabriele, et al.
Published: (2024)
by: Sarti, Gabriele, et al.
Published: (2024)
Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement
by: Sarti, Gabriele, et al.
Published: (2025)
by: Sarti, Gabriele, et al.
Published: (2025)
BabyVision: Visual Reasoning Beyond Language
by: Chen, Liang, et al.
Published: (2026)
by: Chen, Liang, et al.
Published: (2026)
CAIT: A Syntactic Parsing Toolkit for Child-Adult InTeractions
by: Padovani, Francesca, et al.
Published: (2026)
by: Padovani, Francesca, et al.
Published: (2026)
BabyReasoningBench: Generating Developmentally-Inspired Reasoning Tasks for Evaluating Baby Language Models
by: Dhole, Kaustubh D.
Published: (2026)
by: Dhole, Kaustubh D.
Published: (2026)
Child-directed speech facilitates production, not comprehension, in BabyLMs
by: Bunzeck, Bastian, et al.
Published: (2026)
by: Bunzeck, Bastian, et al.
Published: (2026)
Quantifying the Plausibility of Context Reliance in Neural Machine Translation
by: Sarti, Gabriele, et al.
Published: (2023)
by: Sarti, Gabriele, et al.
Published: (2023)
Mini Minds: Exploring Bebeshka and Zlata Baby Models
by: Proskurina, Irina, et al.
Published: (2023)
by: Proskurina, Irina, et al.
Published: (2023)
BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
by: Haller, Patrick, et al.
Published: (2024)
by: Haller, Patrick, et al.
Published: (2024)
BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM
by: Shen, Zhewen, et al.
Published: (2024)
by: Shen, Zhewen, et al.
Published: (2024)
Steering Large Language Models for Machine Translation Personalization
by: Scalena, Daniel, et al.
Published: (2025)
by: Scalena, Daniel, et al.
Published: (2025)
ARGUS: Seeing the Influence of Narrative Features on Persuasion in Argumentative Texts
by: Nabhani, Sara, et al.
Published: (2026)
by: Nabhani, Sara, et al.
Published: (2026)
BabyLM's First Constructions: Causal probing provides a signal of learning
by: Rozner, Joshua, et al.
Published: (2025)
by: Rozner, Joshua, et al.
Published: (2025)
CLASS-IT: Conversational and Lecture-Aligned Small-Scale Instruction Tuning for BabyLMs
by: Capone, Luca, et al.
Published: (2025)
by: Capone, Luca, et al.
Published: (2025)
BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
by: Jumelet, Jaap, et al.
Published: (2025)
by: Jumelet, Jaap, et al.
Published: (2025)
Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
by: Warstadt, Alex, et al.
Published: (2025)
by: Warstadt, Alex, et al.
Published: (2025)
Similar Items
-
BabyLlama-2: Ensemble-Distilled Models Consistently Outperform Teachers With Limited Data
by: Tastet, Jean-Loup, et al.
Published: (2024) -
Generating Completions for Broca's Aphasic Sentences Using Large Language Models
by: van Vaals, Sijbren, et al.
Published: (2024) -
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
by: Lai, Huiyuan, et al.
Published: (2024) -
IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
by: Sarti, Gabriele, et al.
Published: (2022) -
Puzzled By ChatGPT? No more! A Jigsaw Puzzle to Promote AI Literacy and Awareness
by: Padovani, Francesca, et al.
Published: (2026)