Saved in:
Bibliographic Details
Main Authors: Güven, Arzu Burcu, Rogers, Anna, van der Goot, Rob
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.08199
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866918196443873280
author Güven, Arzu Burcu
Rogers, Anna
van der Goot, Rob
author_facet Güven, Arzu Burcu
Rogers, Anna
van der Goot, Rob
contents We examine the syntactic properties of BabyLM corpus, and age-groups within CHILDES. While we find that CHILDES does not exhibit strong syntactic differentiation by age, we show that the syntactic knowledge about the training data can be helpful in interpreting model performance on linguistic tasks. For curriculum learning, we explore developmental and several alternative cognitively inspired curriculum approaches. We find that some curricula help with reading tasks, but the main performance improvement come from using the subset of syntactically categorizable data, rather than the full noisy corpus.
format Preprint
id arxiv_https___arxiv_org_abs_2511_08199
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Do Syntactic Categories Help in Developmentally Motivated Curriculum Learning for Language Models?
Güven, Arzu Burcu
Rogers, Anna
van der Goot, Rob
Computation and Language
We examine the syntactic properties of BabyLM corpus, and age-groups within CHILDES. While we find that CHILDES does not exhibit strong syntactic differentiation by age, we show that the syntactic knowledge about the training data can be helpful in interpreting model performance on linguistic tasks. For curriculum learning, we explore developmental and several alternative cognitively inspired curriculum approaches. We find that some curricula help with reading tasks, but the main performance improvement come from using the subset of syntactically categorizable data, rather than the full noisy corpus.
title Do Syntactic Categories Help in Developmentally Motivated Curriculum Learning for Language Models?
topic Computation and Language
url https://arxiv.org/abs/2511.08199