Saved in:
Bibliographic Details
Main Author: Zhang, Wen
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2605.20558
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866910247273103360
author Zhang, Wen
author_facet Zhang, Wen
contents Neural morphological generation systems often achieve high aggregate accuracy on benchmark datasets, yet such performance can conceal systematic errors concentrated in rare morphological subclasses. We examine Japanese past-tense verb inflection and show that a very small, structurally specific irregular subtype (<1% of data) accounts for a disproportionate share of model errors. Controlled ablation experiments demonstrate that removing this subtype yields larger improvements in generalization than removing all irregular verbs, indicating that not all irregularity contributes equally to model instability. These findings suggest that error concentration is driven by the interaction between extreme low-frequency morphological patterns and specific morphophonological processes, particularly gemination. We argue that morphological evaluation should incorporate finer-grained subclass analysis beyond standard conjugation categories.
format Preprint
id arxiv_https___arxiv_org_abs_2605_20558
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle When Irregularity Helps: A Subclass Analysis of Inductive Bias in Neural Morphology
Zhang, Wen
Computation and Language
Neural morphological generation systems often achieve high aggregate accuracy on benchmark datasets, yet such performance can conceal systematic errors concentrated in rare morphological subclasses. We examine Japanese past-tense verb inflection and show that a very small, structurally specific irregular subtype (<1% of data) accounts for a disproportionate share of model errors. Controlled ablation experiments demonstrate that removing this subtype yields larger improvements in generalization than removing all irregular verbs, indicating that not all irregularity contributes equally to model instability. These findings suggest that error concentration is driven by the interaction between extreme low-frequency morphological patterns and specific morphophonological processes, particularly gemination. We argue that morphological evaluation should incorporate finer-grained subclass analysis beyond standard conjugation categories.
title When Irregularity Helps: A Subclass Analysis of Inductive Bias in Neural Morphology
topic Computation and Language
url https://arxiv.org/abs/2605.20558