This is the project script repository for UIUC course CS598 Deap Learning for Health Data-Reproducing the Paper "Learning Latent Space Representations to Predict Patient Outcomes: Model Development and Validation. The original github repository is https://github.com/subendhu19/CLOUT
After communication with the author of the paper, structured datasets from MIMIC-III (version 1.4) were preprocessed using the script Data_processing_modified_from_source_code.ipynbmodified from the author provided processing script
process_mimic.py`.
Scripts developed during the report drafting phase of the project were all put in the folder "Old_codes". Scripts updated and developed for the final report include the follows. All scripts for models include sections of data loading, model construction, model training and prediction. Some models also incude the risk factor interpretation section.
- Logistic Regression + concatednated features:
Logistic Regression Modified from Source Code.ipynb
- RETAIN:
Baseline_RETAIN_ICD9.ipynb
(Using only Diagnosis ICD9 codes);Baseline_RETAIN_All_Features.ipynb
(Using concatenated ICD+Med+Lab). - CLOUT with Auto-Encoder (AE):
AE_modified_from_source_code.ipynb
(Generating hidden states that are used to weight the concatenated features for CLOUT),CLOUT_and_AE_modifed_from_source_code.ipynb
. - LSTM:
LSTM_ICD9.ipynb
Auto-Encoder (AE) and Correlational Auto-Encoder (CAE) were constructed by first and second scripts listed below. The embedding weights learned from AE or CAE were used to build the representations of clinical features that fed the LSTM model as encounter vector alone or concatenated with embeddings of clinical features.
- AE (Auto-Encoder that generates the AE embeddings):
AE_modified_from_source_code.ipynb
- CAE (Correlated Auto-Encoder that generates the latent space embeddings):
CAE_modifed_from_source_code.ipynb
- CLOUT + AE only:
CLOUT_and_AE_modifed_from_source_code.ipynb
- CLOUT + AE with concatenated features:
CLOUT_and_AE_concatenation_modified_from_source_code.ipynb
- CLOUT + concatenated features:
CLOUT_Concat_modifed_from_surce_code.ipynb
- CLOUT + latent space only:
CLOUT_and_Latent_Space_only_modified_from_source_code.ipynb
- CLOUT + latent space with concatenated features"
CLOUT_Concat_Latent_modified_from_source_code.ipynb
. This is the final model.