Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

over time, develop filtering heuristics to systematically remove false positives that repeatedly crop up #38

Open
taylorreiter opened this issue Jun 21, 2023 · 0 comments

Comments

@taylorreiter
Copy link
Member

In the pub, we say, "Over time, we hope to curate a list of genes that the preHGT pipeline frequently detects as false positives and to develop a strategy to filter them out."

Originally i had thought of filtering out by annotation name. @jonathaneisen suggested that we could create a BLAST database and filter out by sequence similarity. I think this is a much better approach than going by name, wanted to record here and to continue brainstorming about potential strategies.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant