HIPC Dashboard pipeline v1.4.0
Changes in version 1.4.0 (Data)
- Add additional gene expression signatures from Arunachalam et al., 2020
Changes in version 1.4.0 (Pipeline)
- Add splitting on tissue_type_term_id (split 4).
-- Split tissue_type_term_ids get individual columns in Dashboard submission files - Add splitting on comparison field entries (split 5)
- Adapt code to new directory structure in github repository
- Change "recreated_templates" to "standardized_curation_templates" in output file names
- Add writing of standardized, fully denormalized versions of data for easy reuse
- Rename submission id variables in code and standardized files for clarity
-- subm_obs_id -> sig_subm_id, uniq_obs_id -> sig_row_id - Revise messages in code for clarity
- Separate writing tab-delimited and csv versions of standardized submission files to different, specifiable directories
- Fix problem with logging using log_no_valid_symbol_vs_pmid()
- "pmid:" tags now removed immediately