feature request: filter reads with kmers from bam file? #39

kevfengler227 · 2023-12-11T20:40:23Z

Would it be possible to add a feature to run meryl-lookup exclude/include on a BAM instead of a fasta and output BAM? This would be very useful for filtering reads from PacBio or ONT data in their original BAM format without going through FASTA intermediary. Or at least just output a list of reads from the fasta instead of generating the filtered fasta?

Thanks,
KF

kevfengler227 · 2023-12-12T16:36:41Z

To that end, does meryl-lookup find homo-polymer compressed kmers in the reads when the database is made with compressed kmers?

kevfengler227 · 2023-12-13T15:43:47Z

It appears that is does not. for removal of long reads this may be very beneficial.

brianwalenz · 2023-12-13T16:50:08Z

Both are excellent suggestions, and the tools are in dire need of a refresh. We'll (hopefully) get it done late winter/early spring.

BAM support shouldn't be too hard.

Compressed kmer support needed a bit more engineering effort than I wanted to put into the current version, but will definitely be in the next version.

kevfengler227 · 2023-12-13T17:05:11Z

Thanks! Even in it's current form, meryl is a godsend for identifying unique kmers from a target sequence and removing reads that contain those kmers. But these two enhancements would be awesome.

kevfengler227 · 2024-06-12T18:57:42Z

I would still be very much interested in these two enhancements. Looking forward to next version.

arangrhie added the enhancement New feature or request label Dec 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feature request: filter reads with kmers from bam file? #39

feature request: filter reads with kmers from bam file? #39

kevfengler227 commented Dec 11, 2023

kevfengler227 commented Dec 12, 2023 •

edited

Loading

kevfengler227 commented Dec 13, 2023

brianwalenz commented Dec 13, 2023

kevfengler227 commented Dec 13, 2023

kevfengler227 commented Jun 12, 2024

feature request: filter reads with kmers from bam file? #39

feature request: filter reads with kmers from bam file? #39

Comments

kevfengler227 commented Dec 11, 2023

kevfengler227 commented Dec 12, 2023 • edited Loading

kevfengler227 commented Dec 13, 2023

brianwalenz commented Dec 13, 2023

kevfengler227 commented Dec 13, 2023

kevfengler227 commented Jun 12, 2024

kevfengler227 commented Dec 12, 2023 •

edited

Loading