data management

Update with new edition of data package is the most conspicious activity lately in this project. This document describes the steps involved to handle such an update.

check out the latest submission data from the 'pipeline' repository
update the matching gene background data
run the script called raw_data_process.py (under directory tools) to process the new data. This script both creates the proper data format required by the loading process and re-generates three relevant config files. This may not be obvious: the loading config files depend on the source data, so this step must be done before rebuilding the application.

like this python3 tools/raw_data_process.py
rebulid the application

like this dev/build.sh
load the data

like this dev/reload.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data_management.md

data_management.md

data management

Files

data_management.md

Latest commit

History

data_management.md

File metadata and controls

data management