You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What needs to be done:
Before running any computational steps on remote nodes, a cleaning of possible human contaminants should be done.
Suggestions on how to get it done:
We could and a pre-stage which uses a local executor (calling a local comp node), which runs cleanup of all fastq files. The question is on which software to do the cleanup. I don't have any benchmarks really, but the two that springs to mind is BBduk form the BBMap package, or kraken2. Would be great to get some input form others here. Any suggestions @talnor@sofstam@JD2112@bokelund ?
The documentation should probably also refer to some publication showing the effectiveness of such a cleanup
What are the arguments for getting it done:
This way we can ensure we've taken precautions to not ending any sensitive genetic information to remote compute nodes.
Task is considered finished when:
All fastq files sent to remote nodes are free from human data.
The text was updated successfully, but these errors were encountered:
What needs to be done:
Before running any computational steps on remote nodes, a cleaning of possible human contaminants should be done.
Suggestions on how to get it done:
We could and a pre-stage which uses a local executor (calling a local comp node), which runs cleanup of all fastq files. The question is on which software to do the cleanup. I don't have any benchmarks really, but the two that springs to mind is BBduk form the BBMap package, or kraken2. Would be great to get some input form others here. Any suggestions @talnor @sofstam @JD2112 @bokelund ?
The documentation should probably also refer to some publication showing the effectiveness of such a cleanup
What are the arguments for getting it done:
This way we can ensure we've taken precautions to not ending any sensitive genetic information to remote compute nodes.
Task is considered finished when:
All fastq files sent to remote nodes are free from human data.
The text was updated successfully, but these errors were encountered: