Saved in:
Bibliographic Details
Main Authors: Xu, Nan, Zhang, Hongming, Chen, Jianshu
Format: Preprint
Published: 2023
Subjects:
Online Access:https://arxiv.org/abs/2305.13521
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866914654868996096
author Xu, Nan
Zhang, Hongming
Chen, Jianshu
author_facet Xu, Nan
Zhang, Hongming
Chen, Jianshu
contents Existing event-centric NLP models often only apply to the pre-defined ontology, which significantly restricts their generalization capabilities. This paper presents CEO, a novel Corpus-based Event Ontology induction model to relax the restriction imposed by pre-defined event ontologies. Without direct supervision, CEO leverages distant supervision from available summary datasets to detect corpus-wise salient events and exploits external event knowledge to force events within a short distance to have close embeddings. Experiments on three popular event datasets show that the schema induced by CEO has better coverage and higher accuracy than previous methods. Moreover, CEO is the first event ontology induction model that can induce a hierarchical event ontology with meaningful names on eleven open-domain corpora, making the induced schema more trustworthy and easier to be further curated.
format Preprint
id arxiv_https___arxiv_org_abs_2305_13521
institution arXiv
publishDate 2023
record_format arxiv
spellingShingle CEO: Corpus-based Open-Domain Event Ontology Induction
Xu, Nan
Zhang, Hongming
Chen, Jianshu
Computation and Language
Existing event-centric NLP models often only apply to the pre-defined ontology, which significantly restricts their generalization capabilities. This paper presents CEO, a novel Corpus-based Event Ontology induction model to relax the restriction imposed by pre-defined event ontologies. Without direct supervision, CEO leverages distant supervision from available summary datasets to detect corpus-wise salient events and exploits external event knowledge to force events within a short distance to have close embeddings. Experiments on three popular event datasets show that the schema induced by CEO has better coverage and higher accuracy than previous methods. Moreover, CEO is the first event ontology induction model that can induce a hierarchical event ontology with meaningful names on eleven open-domain corpora, making the induced schema more trustworthy and easier to be further curated.
title CEO: Corpus-based Open-Domain Event Ontology Induction
topic Computation and Language
url https://arxiv.org/abs/2305.13521