Saved in:
Bibliographic Details
Main Authors: Lin, Xiangyu, Jia, Weijia, Gong, Zhiguo
Format: Preprint
Published: 2024
Subjects:
Online Access:https://arxiv.org/abs/2407.01026
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866909235461226496
author Lin, Xiangyu
Jia, Weijia
Gong, Zhiguo
author_facet Lin, Xiangyu
Jia, Weijia
Gong, Zhiguo
contents Despite its popularity in sentence-level relation extraction, distantly supervised data is rarely utilized by existing work in document-level relation extraction due to its noisy nature and low information density. Among its current applications, distantly supervised data is mostly used as a whole for pertaining, which is of low time efficiency. To fill in the gap of efficient and robust utilization of distantly supervised training data, we propose Efficient Multi-Supervision for document-level relation extraction, in which we first select a subset of informative documents from the massive dataset by combining distant supervision with expert supervision, then train the model with Multi-Supervision Ranking Loss that integrates the knowledge from multiple sources of supervision to alleviate the effects of noise. The experiments demonstrate the effectiveness of our method in improving the model performance with higher time efficiency than existing baselines.
format Preprint
id arxiv_https___arxiv_org_abs_2407_01026
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Augmenting Document-level Relation Extraction with Efficient Multi-Supervision
Lin, Xiangyu
Jia, Weijia
Gong, Zhiguo
Computation and Language
Artificial Intelligence
Despite its popularity in sentence-level relation extraction, distantly supervised data is rarely utilized by existing work in document-level relation extraction due to its noisy nature and low information density. Among its current applications, distantly supervised data is mostly used as a whole for pertaining, which is of low time efficiency. To fill in the gap of efficient and robust utilization of distantly supervised training data, we propose Efficient Multi-Supervision for document-level relation extraction, in which we first select a subset of informative documents from the massive dataset by combining distant supervision with expert supervision, then train the model with Multi-Supervision Ranking Loss that integrates the knowledge from multiple sources of supervision to alleviate the effects of noise. The experiments demonstrate the effectiveness of our method in improving the model performance with higher time efficiency than existing baselines.
title Augmenting Document-level Relation Extraction with Efficient Multi-Supervision
topic Computation and Language
Artificial Intelligence
url https://arxiv.org/abs/2407.01026