Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Author:	Zhang, Wen
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2605.20558
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866910247273103360
author	Zhang, Wen
author_facet	Zhang, Wen
contents	Neural morphological generation systems often achieve high aggregate accuracy on benchmark datasets, yet such performance can conceal systematic errors concentrated in rare morphological subclasses. We examine Japanese past-tense verb inflection and show that a very small, structurally specific irregular subtype (<1% of data) accounts for a disproportionate share of model errors. Controlled ablation experiments demonstrate that removing this subtype yields larger improvements in generalization than removing all irregular verbs, indicating that not all irregularity contributes equally to model instability. These findings suggest that error concentration is driven by the interaction between extreme low-frequency morphological patterns and specific morphophonological processes, particularly gemination. We argue that morphological evaluation should incorporate finer-grained subclass analysis beyond standard conjugation categories.
format	Preprint
id	arxiv_https___arxiv_org_abs_2605_20558
institution	arXiv
publishDate	2026
record_format	arxiv
spellingShingle	When Irregularity Helps: A Subclass Analysis of Inductive Bias in Neural Morphology Zhang, Wen Computation and Language Neural morphological generation systems often achieve high aggregate accuracy on benchmark datasets, yet such performance can conceal systematic errors concentrated in rare morphological subclasses. We examine Japanese past-tense verb inflection and show that a very small, structurally specific irregular subtype (<1% of data) accounts for a disproportionate share of model errors. Controlled ablation experiments demonstrate that removing this subtype yields larger improvements in generalization than removing all irregular verbs, indicating that not all irregularity contributes equally to model instability. These findings suggest that error concentration is driven by the interaction between extreme low-frequency morphological patterns and specific morphophonological processes, particularly gemination. We argue that morphological evaluation should incorporate finer-grained subclass analysis beyond standard conjugation categories.
title	When Irregularity Helps: A Subclass Analysis of Inductive Bias in Neural Morphology
topic	Computation and Language
url	https://arxiv.org/abs/2605.20558

Similar Items