:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Shi, Shaozhen, Matusevych, Yevgen, Nissim, Malvina
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2410.22081
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

BabyLlama-2: Ensemble-Distilled Models Consistently Outperform Teachers With Limited Data
by: Tastet, Jean-Loup, et al.
Published: (2024)

Generating Completions for Broca's Aphasic Sentences Using Large Language Models
by: van Vaals, Sijbren, et al.
Published: (2024)

mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language Models
by: Lai, Huiyuan, et al.
Published: (2024)

IT5: Text-to-text Pretraining for Italian Language Understanding and Generation
by: Sarti, Gabriele, et al.
Published: (2022)

Puzzled By ChatGPT? No more! A Jigsaw Puzzle to Promote AI Literacy and Awareness
by: Padovani, Francesca, et al.
Published: (2026)

Is Child-Directed Language Optimized for Word Learning? A Computational Study of Verb Meaning Acquisition
by: Padovani, Francesca, et al.
Published: (2026)

Child-Directed Language Does Not Consistently Boost Syntax Learning in Language Models
by: Padovani, Francesca, et al.
Published: (2025)

TACLer: Tailored Curriculum Reinforcement Learning for Efficient Reasoning
by: Lai, Huiyuan, et al.
Published: (2026)

Visually Grounded Speech Models have a Mutual Exclusivity Bias
by: Nortje, Leanne, et al.
Published: (2024)

The mutual exclusivity bias of bilingual visually grounded speech models
by: Oneata, Dan, et al.
Published: (2025)

BabyLM Turns 3: Call for papers for the 2025 BabyLM workshop
by: Charpentier, Lucas, et al.
Published: (2025)

Multidimensional Consistency Improves Reasoning in Language Models
by: Lai, Huiyuan, et al.
Published: (2025)

When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation
by: Occhipinti, Daniela, et al.
Published: (2025)

BabyLM Turns 4 and Goes Multilingual: Call for Papers for the 2026 BabyLM Workshop
by: Choshen, Leshem, et al.
Published: (2026)

A gentle push funziona benissimo: making instructed models in Italian via contrastive activation steering
by: Scalena, Daniel, et al.
Published: (2024)

Practising responsibility: Ethics in NLP as a hands-on course
by: Nissim, Malvina, et al.
Published: (2025)

Small Language Models Also Work With Small Vocabularies: Probing the Linguistic Abilities of Grapheme- and Phoneme-Based Baby Llamas
by: Bunzeck, Bastian, et al.
Published: (2024)

Can Model Uncertainty Function as a Proxy for Multiple-Choice Question Item Difficulty?
by: Zotos, Leonidas, et al.
Published: (2024)

Are You Doubtful? Oh, It Might Be Difficult Then! Exploring the Use of Model Uncertainty for Question Difficulty Estimation
by: Zotos, Leonidas, et al.
Published: (2024)

The Role of the Availability Heuristic in Multiple-Choice Answering Behaviour
by: Zotos, Leonidas, et al.
Published: (2026)

Multi-property Steering of Large Language Models with Dynamic Activation Composition
by: Scalena, Daniel, et al.
Published: (2024)

Are BabyLMs Second Language Learners?
by: Edman, Lukas, et al.
Published: (2024)

When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets?
by: Iyer, Srikrishna
Published: (2024)

BAMBI: Developing Baby Language Models for Italian
by: Suozzi, Alice, et al.
Published: (2025)

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses
by: Sarti, Gabriele, et al.
Published: (2024)

Unsupervised Word-level Quality Estimation for Machine Translation Through the Lens of Annotators (Dis)agreement
by: Sarti, Gabriele, et al.
Published: (2025)

BabyVision: Visual Reasoning Beyond Language
by: Chen, Liang, et al.
Published: (2026)

CAIT: A Syntactic Parsing Toolkit for Child-Adult InTeractions
by: Padovani, Francesca, et al.
Published: (2026)

BabyReasoningBench: Generating Developmentally-Inspired Reasoning Tasks for Evaluating Baby Language Models
by: Dhole, Kaustubh D.
Published: (2026)

Child-directed speech facilitates production, not comprehension, in BabyLMs
by: Bunzeck, Bastian, et al.
Published: (2026)

Quantifying the Plausibility of Context Reliance in Neural Machine Translation
by: Sarti, Gabriele, et al.
Published: (2023)

Mini Minds: Exploring Bebeshka and Zlata Baby Models
by: Proskurina, Irina, et al.
Published: (2023)

BabyHGRN: Exploring RNNs for Sample-Efficient Training of Language Models
by: Haller, Patrick, et al.
Published: (2024)

BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM
by: Shen, Zhewen, et al.
Published: (2024)

Steering Large Language Models for Machine Translation Personalization
by: Scalena, Daniel, et al.
Published: (2025)

ARGUS: Seeing the Influence of Narrative Features on Persuasion in Argumentative Texts
by: Nabhani, Sara, et al.
Published: (2026)

BabyLM's First Constructions: Causal probing provides a signal of learning
by: Rozner, Joshua, et al.
Published: (2025)

CLASS-IT: Conversational and Lecture-Aligned Small-Scale Instruction Tuning for BabyLMs
by: Capone, Luca, et al.
Published: (2025)

BabyBabelLM: A Multilingual Benchmark of Developmentally Plausible Training Data
by: Jumelet, Jaap, et al.
Published: (2025)

Findings of the BabyLM Challenge: Sample-Efficient Pretraining on Developmentally Plausible Corpora
by: Warstadt, Alex, et al.
Published: (2025)