You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository has been archived by the owner on Sep 29, 2023. It is now read-only.
It would be helpful to provide documentation into how CLMBR performs data splitting and how that relates to the parameters train_end_date and val_end_date and banned_patient_file in clmbr_create_info. There should further be a discussion of best practices, and a discussion of any trade-offs, for selecting the clmbr_create_info parameters for different downstream study designs. There would ideally be examples of how to select these parameters for time splitting and patient splitting designs for different assumptions on allowed (date/time and patient) overlap between various partitions within and across pretraining and cohort-relevant partitions of the data.
The text was updated successfully, but these errors were encountered:
Sign up for freeto subscribe to this conversation on GitHub.
Already have an account?
Sign in.
It would be helpful to provide documentation into how CLMBR performs data splitting and how that relates to the parameters
train_end_date
andval_end_date
andbanned_patient_file
inclmbr_create_info
. There should further be a discussion of best practices, and a discussion of any trade-offs, for selecting theclmbr_create_info
parameters for different downstream study designs. There would ideally be examples of how to select these parameters for time splitting and patient splitting designs for different assumptions on allowed (date/time and patient) overlap between various partitions within and across pretraining and cohort-relevant partitions of the data.The text was updated successfully, but these errors were encountered: