WAINWRIGHT, Richard (2023). Supporting medical decision-making using machine learning. Doctoral, Sheffield Hallam University. [Thesis]
Documents
32312:620891
PDF
Wainwright_2023_PhD_SupportingMedicalDecision.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
Wainwright_2023_PhD_SupportingMedicalDecision.pdf - Accepted Version
Available under License Creative Commons Attribution Non-commercial No Derivatives.
Download (24MB) | Preview
Abstract
As the strain on health care continues to grow worldwide, the need for
reliable decision-making has never been more apparent. The computerisation
of electronic health records has provided a wealth of data that can be applied
to various medical use cases. Machine Learning algorithms are exploited to
try and assist with making effective decisions. The resulting contributions
within this work demonstrate that it is possible to lean on advancements in
computer science to develop support tools for medical practitioners which
assist in their decision-making processes.
This thesis contributes four core advances to the research domain: Firstly
the enhancement of current mortality prediction systems in intensive care
units was considered. Comparing multiple Machine Learning classifiers with
optimised pipelines produced results that were both comparable and more
effective at determining patient mortality than the existing APACHE II model.
The most encouraging classifier was Decision Trees whilst being trained using:
K-fold cross validation, Grid search hyper-parameter tuning and SMOTE
achieving an average AUROC score of 0.93 and accuracy of 0.92. Unlike
other mortality prediction systems which are often trained on small cohorts
of data, a method of retraining and optimising for different patient cohorts is
introduced. Retraining based on a patients age or admission in to the ICU is
also considered as a novel approach of keeping support tools up to date.
An ensemble imputation method has been developed that can be used to
generate the missing data in a real life dataset. This has produced accuracy
and recall results comparable to current state of the art techniques when applied to the Cleveland hospital dataset.
In this work, strategies to rebalance datasets are investigated to predict
early onset Sepsis. One promising approach examined in this thesis is the use
of the RUSboost algorithm. This enabled the optimisation of a classifier that
has a high fidelity without overfitting.
More Information
Statistics
Downloads
Downloads per month over past year
Metrics
Altmetric Badge
Dimensions Badge
Share
Actions (login required)
View Item |