Date of Award
2023
Degree Name
Computer Science
College
College of Engineering and Computer Sciences
Type of Degree
M.S.
Document Type
Thesis
First Advisor
Dr. James D. McIntosh, Committee Chairperson
Second Advisor
Dr. Tanvir Irfan Chowdhury
Third Advisor
Dr. Ammar Alzarrad
Fourth Advisor
Dr. Haroon Malik
Abstract
Mining safety and health in the US can be better understood through the application of machine learning techniques to data collected by the Mine Safety and Health Administration (MSHA). By identifying hazardous conditions that could lead to accidents before they occur, valuable insights can be gained by MSHA, mining operators, and miners. In this study, we propose using a Random Forest machine learning model to predict whether a given mining violation will lead to an accident, and if so, whether it will be fatal or non-fatal. To achieve this, the model is trained on MSHA violation data and the sum of scheduled accident charges within 35 days of the violation. We experiment with different predictive models using varying data columns, training set sizes, prediction classes, and hyperparameters to achieve a reliable prediction. One of the challenges in generating these models is accurately predicting the sparse class of accidents, as opposed to the abundant class of no accidents. To address this, we propose utilizing sample minimizing to balance the false negative and false positive rate and create a more accurate predictive model. Our results demonstrate, with a high degree of confidence, the potential for machine learning to improve mine safety and health by identifying hazardous conditions and mitigating the risk of accidents.
Subject(s)
Mine safety – Research – United States.
Machine learning.
Mine accidents.
Machine learning – Technique.
Coal mines and mining – United States – Safety measures.
Coal mines and mining – Health aspects – United States.
Coal miners – Health aspects – United States.
Recommended Citation
Milam, Olivia, "Identifying hazardous patterns in MSHA data using random forests" (2023). Theses, Dissertations and Capstones. 1813.
https://mds.marshall.edu/etd/1813
Included in
Computer Engineering Commons, Risk Analysis Commons, Social and Behavioral Sciences Commons