Saved in:
Bibliographic Details
Main Authors: Tang, Xuri, Liu, Daohuan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2601.04197
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911359957991424
author Tang, Xuri
Liu, Daohuan
author_facet Tang, Xuri
Liu, Daohuan
contents This paper proposes a fully unsupervised approach to the construction of verb collostruction database for Chinese language, aimed at complementing LLMs by providing explicit and interpretable rules for application scenarios where explanation and interpretability are indispensable. The paper formally defines a verb collostruction as a projective, rooted, ordered, and directed acyclic graph and employs a series of clustering algorithms to generate collostructions for a given verb from a list of sentences retrieved from large-scale corpus. Statistical analysis demonstrates that the generated collostructions possess the design features of functional independence and graded typicality. Evaluation with verb grammatical error correction shows that the error correction algorithm based on maximum matching with collostructions achieves better performance than LLMs.
format Preprint
id arxiv_https___arxiv_org_abs_2601_04197
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle Automatic Construction of Chinese Verb Collostruction Database
Tang, Xuri
Liu, Daohuan
Computation and Language
This paper proposes a fully unsupervised approach to the construction of verb collostruction database for Chinese language, aimed at complementing LLMs by providing explicit and interpretable rules for application scenarios where explanation and interpretability are indispensable. The paper formally defines a verb collostruction as a projective, rooted, ordered, and directed acyclic graph and employs a series of clustering algorithms to generate collostructions for a given verb from a list of sentences retrieved from large-scale corpus. Statistical analysis demonstrates that the generated collostructions possess the design features of functional independence and graded typicality. Evaluation with verb grammatical error correction shows that the error correction algorithm based on maximum matching with collostructions achieves better performance than LLMs.
title Automatic Construction of Chinese Verb Collostruction Database
topic Computation and Language
url https://arxiv.org/abs/2601.04197