Saved in:
Bibliographic Details
Main Authors: Anuyah, Sydney, Vanschaik, Jack, Jain, Palak, Lehman, Sawyer, Chakraborty, Sunandan
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2503.06076
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866917948418949120
author Anuyah, Sydney
Vanschaik, Jack
Jain, Palak
Lehman, Sawyer
Chakraborty, Sunandan
author_facet Anuyah, Sydney
Vanschaik, Jack
Jain, Palak
Lehman, Sawyer
Chakraborty, Sunandan
contents We conduct an empirical analysis of neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that a relatively straightforward BioBERT-BiGRU relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. Furthermore, we introduce a metric for evaluating transfer performance, $F1_{phrase}$ that emphasizes noun phrase localization rather than directly matching target tags. Using this metric, we can conduct data transfer experiments, ultimately revealing that augmentation with data with varying domains and annotation styles can improve performance. Data augmentation is especially beneficial when an adequate proportion of implicitly and explicitly causal sentences are included.
format Preprint
id arxiv_https___arxiv_org_abs_2503_06076
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle An Empirical Study of Causal Relation Extraction Transfer: Design and Data
Anuyah, Sydney
Vanschaik, Jack
Jain, Palak
Lehman, Sawyer
Chakraborty, Sunandan
Computation and Language
We conduct an empirical analysis of neural network architectures and data transfer strategies for causal relation extraction. By conducting experiments with various contextual embedding layers and architectural components, we show that a relatively straightforward BioBERT-BiGRU relation extraction model generalizes better than other architectures across varying web-based sources and annotation strategies. Furthermore, we introduce a metric for evaluating transfer performance, $F1_{phrase}$ that emphasizes noun phrase localization rather than directly matching target tags. Using this metric, we can conduct data transfer experiments, ultimately revealing that augmentation with data with varying domains and annotation styles can improve performance. Data augmentation is especially beneficial when an adequate proportion of implicitly and explicitly causal sentences are included.
title An Empirical Study of Causal Relation Extraction Transfer: Design and Data
topic Computation and Language
url https://arxiv.org/abs/2503.06076