Date of Award


Degree Name

Computer Science


College of Engineering and Computer Sciences

Type of Degree


Document Type


First Advisor

Dr. James D. McIntosh, Committee Chairperson

Second Advisor

Dr. Tanvir Irfan Chowdhury

Third Advisor

Dr. Ammar Alzarrad

Fourth Advisor

Dr. Haroon Malik


Mining safety and health in the US can be better understood through the application of machine learning techniques to data collected by the Mine Safety and Health Administration (MSHA). By identifying hazardous conditions that could lead to accidents before they occur, valuable insights can be gained by MSHA, mining operators, and miners. In this study, we propose using a Random Forest machine learning model to predict whether a given mining violation will lead to an accident, and if so, whether it will be fatal or non-fatal. To achieve this, the model is trained on MSHA violation data and the sum of scheduled accident charges within 35 days of the violation. We experiment with different predictive models using varying data columns, training set sizes, prediction classes, and hyperparameters to achieve a reliable prediction. One of the challenges in generating these models is accurately predicting the sparse class of accidents, as opposed to the abundant class of no accidents. To address this, we propose utilizing sample minimizing to balance the false negative and false positive rate and create a more accurate predictive model. Our results demonstrate, with a high degree of confidence, the potential for machine learning to improve mine safety and health by identifying hazardous conditions and mitigating the risk of accidents.


Mine safety – Research – United States.

Machine learning.

Mine accidents.

Machine learning – Technique.

Coal mines and mining – United States – Safety measures.

Coal mines and mining – Health aspects – United States.

Coal miners – Health aspects – United States.