Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Anuyah, Sydney, Vanschaik, Jack, Jain, Palak, Lehman, Sawyer, Chakraborty, Sunandan
Format:	Preprint
Published:	2025
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2503.06076
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917948418949120
author	Anuyah, Sydney Vanschaik, Jack Jain, Palak Lehman, Sawyer Chakraborty, Sunandan
author_facet	Anuyah, Sydney Vanschaik, Jack Jain, Palak Lehman, Sawyer Chakraborty, Sunandan
contents	We conduct an empirical analysis of neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that a relatively straightforward BioBERT-BiGRU relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. Furthermore, we introduce a metric for evaluating transfer performance, $F1_{phrase}$ that emphasizes noun phrase localization rather than directly matching target tags. Using this metric, we can conduct data transfer experiments, ultimately revealing that augmentation with data with varying domains and annotation styles can improve performance. Data augmentation is especially beneficial when an adequate proportion of implicitly and explicitly causal sentences are included.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_06076
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	An Empirical Study of Causal Relation Extraction Transfer: Design and Data Anuyah, Sydney Vanschaik, Jack Jain, Palak Lehman, Sawyer Chakraborty, Sunandan Computation and Language We conduct an empirical analysis of neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that a relatively straightforward BioBERT-BiGRU relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. Furthermore, we introduce a metric for evaluating transfer performance, $F1_{phrase}$ that emphasizes noun phrase localization rather than directly matching target tags. Using this metric, we can conduct data transfer experiments, ultimately revealing that augmentation with data with varying domains and annotation styles can improve performance. Data augmentation is especially beneficial when an adequate proportion of implicitly and explicitly causal sentences are included.
title	An Empirical Study of Causal Relation Extraction Transfer: Design and Data
topic	Computation and Language
url	https://arxiv.org/abs/2503.06076

Similar Items