Skip to content

HIPC Dashboard pipeline v1.4.0

Compare
Choose a tag to compare
@kcs3 kcs3 released this 08 Mar 22:46
· 124 commits to master since this release

Changes in version 1.4.0 (Data)

  • Add additional gene expression signatures from Arunachalam et al., 2020

Changes in version 1.4.0 (Pipeline)

  • Add splitting on tissue_type_term_id (split 4).
    -- Split tissue_type_term_ids get individual columns in Dashboard submission files
  • Add splitting on comparison field entries (split 5)
  • Adapt code to new directory structure in github repository
  • Change "recreated_templates" to "standardized_curation_templates" in output file names
  • Add writing of standardized, fully denormalized versions of data for easy reuse
  • Rename submission id variables in code and standardized files for clarity
    -- subm_obs_id -> sig_subm_id, uniq_obs_id -> sig_row_id
  • Revise messages in code for clarity
  • Separate writing tab-delimited and csv versions of standardized submission files to different, specifiable directories
  • Fix problem with logging using log_no_valid_symbol_vs_pmid()
  • "pmid:" tags now removed immediately