Saved in:
Bibliographic Details
Main Authors: Baker, Henrietta, Hallowell, Matthew R., Tixier, Antoine J. -P.
Format: Preprint
Published: 2019
Subjects:
Online Access:https://arxiv.org/abs/1908.05972
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866916029177790464
author Baker, Henrietta
Hallowell, Matthew R.
Tixier, Antoine J. -P.
author_facet Baker, Henrietta
Hallowell, Matthew R.
Tixier, Antoine J. -P.
contents This paper significantly improves on, and finishes to validate, an approach proposed in previous research in which safety outcomes were predicted from attributes with machine learning. Like in the original study, we use Natural Language Processing (NLP) to extract fundamental attributes from raw incident reports and machine learning models are trained to predict safety outcomes. The outcomes predicted here are injury severity, injury type, body part impacted, and incident type. However, unlike in the original study, safety outcomes were not extracted via NLP but were provided by independent human annotations, eliminating any potential source of artificial correlation between predictors and predictands. Results show that attributes are still highly predictive, confirming the validity of the original approach. Other improvements brought by the current study include the use of (1) a much larger dataset featuring more than 90,000 reports, (2) two new models, XGBoost and linear SVM (Support Vector Machines), (3) model stacking, (4) a more straightforward experimental setup with more appropriate performance metrics, and (5) an analysis of per-category attribute importance scores. Finally, the injury severity outcome is well predicted, which was not the case in the original study. This is a significant advancement.
format Preprint
id arxiv_https___arxiv_org_abs_1908_05972
institution arXiv
publishDate 2019
record_format arxiv
spellingShingle AI-based Prediction of Independent Construction Safety Outcomes from Universal Attributes
Baker, Henrietta
Hallowell, Matthew R.
Tixier, Antoine J. -P.
Machine Learning
This paper significantly improves on, and finishes to validate, an approach proposed in previous research in which safety outcomes were predicted from attributes with machine learning. Like in the original study, we use Natural Language Processing (NLP) to extract fundamental attributes from raw incident reports and machine learning models are trained to predict safety outcomes. The outcomes predicted here are injury severity, injury type, body part impacted, and incident type. However, unlike in the original study, safety outcomes were not extracted via NLP but were provided by independent human annotations, eliminating any potential source of artificial correlation between predictors and predictands. Results show that attributes are still highly predictive, confirming the validity of the original approach. Other improvements brought by the current study include the use of (1) a much larger dataset featuring more than 90,000 reports, (2) two new models, XGBoost and linear SVM (Support Vector Machines), (3) model stacking, (4) a more straightforward experimental setup with more appropriate performance metrics, and (5) an analysis of per-category attribute importance scores. Finally, the injury severity outcome is well predicted, which was not the case in the original study. This is a significant advancement.
title AI-based Prediction of Independent Construction Safety Outcomes from Universal Attributes
topic Machine Learning
url https://arxiv.org/abs/1908.05972