Saved in:
Bibliographic Details
Main Authors: Baker, Henrietta, Hallowell, Matthew R., Tixier, Antoine J. -P.
Format: Preprint
Published: 2019
Subjects:
Online Access:https://arxiv.org/abs/1907.11769
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866913146713669632
author Baker, Henrietta
Hallowell, Matthew R.
Tixier, Antoine J. -P.
author_facet Baker, Henrietta
Hallowell, Matthew R.
Tixier, Antoine J. -P.
contents In light of the increasing availability of digitally recorded safety reports in the construction industry, it is important to develop methods to exploit these data to improve our understanding of safety incidents and ability to learn from them. In this study, we compare several approaches to automatically learn injury precursors from raw construction accident reports. More precisely, we experiment with two state-of-the-art deep learning architectures for Natural Language Processing (NLP), Convolutional Neural Networks (CNN) and Hierarchical Attention Networks (HAN), and with the established Term Frequency - Inverse Document Frequency representation (TF-IDF) + Support Vector Machine (SVM) approach. For each model, we provide a method to identify (after training) the textual patterns that are, on average, the most predictive of each safety outcome. We show that among those pieces of text, valid injury precursors can be found. The proposed methods can also be used by the user to visualize and understand the models' predictions.
format Preprint
id arxiv_https___arxiv_org_abs_1907_11769
institution arXiv
publishDate 2019
record_format arxiv
spellingShingle Automatically Learning Construction Injury Precursors from Text
Baker, Henrietta
Hallowell, Matthew R.
Tixier, Antoine J. -P.
Computation and Language
In light of the increasing availability of digitally recorded safety reports in the construction industry, it is important to develop methods to exploit these data to improve our understanding of safety incidents and ability to learn from them. In this study, we compare several approaches to automatically learn injury precursors from raw construction accident reports. More precisely, we experiment with two state-of-the-art deep learning architectures for Natural Language Processing (NLP), Convolutional Neural Networks (CNN) and Hierarchical Attention Networks (HAN), and with the established Term Frequency - Inverse Document Frequency representation (TF-IDF) + Support Vector Machine (SVM) approach. For each model, we provide a method to identify (after training) the textual patterns that are, on average, the most predictive of each safety outcome. We show that among those pieces of text, valid injury precursors can be found. The proposed methods can also be used by the user to visualize and understand the models' predictions.
title Automatically Learning Construction Injury Precursors from Text
topic Computation and Language
url https://arxiv.org/abs/1907.11769