Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature request: filter reads with kmers from bam file? #39

Open
kevfengler227 opened this issue Dec 11, 2023 · 5 comments
Open

feature request: filter reads with kmers from bam file? #39

kevfengler227 opened this issue Dec 11, 2023 · 5 comments
Labels
enhancement New feature or request

Comments

@kevfengler227
Copy link

Would it be possible to add a feature to run meryl-lookup exclude/include on a BAM instead of a fasta and output BAM? This would be very useful for filtering reads from PacBio or ONT data in their original BAM format without going through FASTA intermediary. Or at least just output a list of reads from the fasta instead of generating the filtered fasta?

Thanks,
KF

@arangrhie arangrhie added the enhancement New feature or request label Dec 11, 2023
@kevfengler227
Copy link
Author

kevfengler227 commented Dec 12, 2023

To that end, does meryl-lookup find homo-polymer compressed kmers in the reads when the database is made with compressed kmers?

@kevfengler227
Copy link
Author

It appears that is does not. for removal of long reads this may be very beneficial.

@brianwalenz
Copy link
Member

Both are excellent suggestions, and the tools are in dire need of a refresh. We'll (hopefully) get it done late winter/early spring.

BAM support shouldn't be too hard.

Compressed kmer support needed a bit more engineering effort than I wanted to put into the current version, but will definitely be in the next version.

@kevfengler227
Copy link
Author

Thanks! Even in it's current form, meryl is a godsend for identifying unique kmers from a target sequence and removing reads that contain those kmers. But these two enhancements would be awesome.

@kevfengler227
Copy link
Author

I would still be very much interested in these two enhancements. Looking forward to next version.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants