Saved in:
Bibliographic Details
Main Authors: Park, Jinwook, Kim, Kangil
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.20734
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • Unsupervised neural grammar induction aims to learn interpretable hierarchical structures from language data. However, existing models face an expressiveness bottleneck, often resulting in unnecessarily large yet underperforming grammars. We identify a core issue, $\textit{probability distribution collapse}$, as the underlying cause of this limitation. We analyze when and how the collapse emerges across key components of neural parameterization and introduce a targeted solution, $\textit{collapse-relaxing neural parameterization}$, to mitigate it. Our approach substantially improves parsing performance while enabling the use of significantly more compact grammars across a wide range of languages, as demonstrated through extensive empirical analysis.