COMP90089 Machine Learning Applications in Health Final Project.
Group #13, Semester 2, 2022.
Group members
Extracted datasets have been excluded from the repository as per PhysioNet's data use agreement.
They can be re-constructed by running the jupyter notebooks with credentialled access to MIMIC.
Notebooks:
- Feature extraction SQL
- Data exploration
- Data processing
- Dimensionality Reduction PCA
- Clustering K-Means
- Clustering DBSCAN
- Feature importance: largest cluster
- Feature importance: second largest cluster
template
: template code to use for queriesmimic
: MIMIC exploration scriptsdata
: data sources used in the projectsql
: sql scripts used to query MIMICdata_exploration
: exploratory data analysis and data processingcluster
: scripts associated with dimensionality reduction and clusteringfeature_importance_analysis
: scripts associated with feature importance analysisimg
: extracted images and plots from notebookssrc
: python helper functions
MIMIC docs
- Github MIT-LCP/mimic-iv-website (more comprehensive)
- MIMIC-IV docs website
conda_env_gbq.yml
: To access and query MIMIC on Google BigQuery locally.
conda_env_tensor_m1.yml
: To enable tensorflow with GPU on M1 macs.
conda env create -f <file>.yml
Local access to MIMIC and BigQuery via the following extension:
https://marketplace.visualstudio.com/items?itemName=GoogleCloudTools.cloudcode
Usage of data within this repository must comply with PhysioNet's data usage policy.