-
Notifications
You must be signed in to change notification settings - Fork 179
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Assemble a diploid assembly with ONT long reads #1985
Comments
|
Thank you very much for your reply. Our target genome is very diverged (I don't know the exact value between two haplotypes though, it is probably 2-3% or larger). Anyway, thank you very much for your input. |
You have a draft assembly though to align to right? I would suggest trying that to get an estimate of error. If you have short-read data you can also estimate the haplotype diversity using GenomeScope. If you want to separate haplotypes, I wouldn't use any type of corrected read. Correction will not preserve haplotype phasing within reads and so you'll end up with mixed haplotype reads which means haplotype separation from the assembly will be impossible. |
I'm closing since you have a way forward but feel free to comment on this issue regarding how/if the run worked. |
Yes, I have a mosaic assembly produced by flye using the same ONT data. It was polished by two rounds internal polishment of flye and one additional polishment using POLCA with illumina short reads. I think I can use this assembly to estimate the error rate of my ONT long reads. |
Hi,
Thank you very much for developing Canu and also thank you very much for supporting the scientific community.
I have some ONT long reads (42x) basedcalled using guppy v4 for a diploid plant genome, which is highly heterozygous. Recently, I tried to reconstruct both haplotypes for the gnome using a read-based phasing approach. However, with a mosaic haploid assembly, I cannot perfectly phase/split reads to two haplotype groups due to the alignment bias. Now, I am thinking about directly assembling a diploid assembly using canu without collapsing haplotypes, however, I am kind of confused by the batoption.
I saw from issue #1715, you recommended -trim-assemble batOptions="-eg 0.12 -sb 0.01 -dg 12 -db 12 -dr 6 -ca 1000 -cp 10" "correctedErrorRate=0.12" -pacbio-hifi , I am wondering is this suitable for my case? My raw ONT long reads have mean read accuracy around Q10 (error rate is around 10%) , and all reads have accuracy > Q7 (error rate 19%).
My questions are:
Could you please give a little hint? Thank you very much.
Best wishes,
Yutang
The text was updated successfully, but these errors were encountered: