Nanopolish call-methylation Processing Speed #1113

ghost · 2023-09-24T17:34:55Z

Hello, I'm using nanopolish call-methylation on an Ontario long read data with an average read depth of 35. I'm using 40 processors with 4 GB of memory per processor. I'm using the standard parameters (-t, -r, -b, -g). The job has been running for 12 days, and it has only completed processing chromosomes 1-4 and 10-22 so far. I'm wondering if there are ways to speed up the process, or if this long processing time is expected for this tool?

hasindu2008 · 2023-09-24T23:36:09Z

Likely to be the fast5 IO bottleneck
You may try https://github.com/hasindu2008/f5c/ with the --iop option to spawn parallel processes for IO. F5c should give same output as nanopolish.

To go even more faster, the best solution is to convert your fast5 to blow5 using slow5tools and then run nanopolish or f5c on it. Instructions are at https://hasindu2008.github.io/slow5tools/workflows.html

.

ghost · 2023-09-25T03:16:35Z

Thank you for the quick and helpful response. I will try your suggestions. I am wondering if it is possible to use intervals (one Chr per job) when running the Nanopolish call-methylation and the calculate_methylation_frequency.py, and at the end concatenate the methylation_frequency.tsv files from each chromosome into a single file?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nanopolish call-methylation Processing Speed #1113

Nanopolish call-methylation Processing Speed #1113

ghost commented Sep 24, 2023

hasindu2008 commented Sep 24, 2023

ghost commented Sep 25, 2023

Nanopolish call-methylation Processing Speed #1113

Nanopolish call-methylation Processing Speed #1113

Comments

ghost commented Sep 24, 2023

hasindu2008 commented Sep 24, 2023

ghost commented Sep 25, 2023