-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Change the input files to only take one transcriptome assembly file instead of two #47
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just a couple minor comments/edits for clarity.
demo/README.md
Outdated
We pulled the "short contigs" file from an internal S3 bucket. | ||
It contains contigs that were filtered from the Amblyomma transcriptome prior to txome merging. | ||
We also pulled short contigs (less than 75 bp) from an internal S3 bucket and added these contigs to the `contigs.fa` file (50 contigs). | ||
These are contigs that were filtered from the *Amblyomma* transcriptome prior to transcriptome merging (mid assembly pipeline). |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Consider clarifying what "mid assembly pipeline" means (afaict, it's also not clear from this readme what pipeline this is referring to)
demo/config.yml
Outdated
# ORFs should be predicted from the same transcriptome assembly as the "contigs" input file. | ||
# ORFs should have the same name (before the first period in the name) as the contigs in the | ||
# "contigs" input file. TransDecoder provides files in the proper format. | ||
# Used for cleavage peptide prediction and annotation of nonribosomal peptide synthetases, and to | ||
# remove coding transcripts from the transcriptome assembly before sORF prediction. | ||
# - orfs_nucleotides: predicted ORFs as nucleotide sequences. Should contain the same ORFs as | ||
# "orfs_amino_acids" but in nucleotide format. TransDecoder also provides this file in the proper | ||
# format. If this file contains short ORFs (< 300 nucleotides), they will not be reported as sORFs | ||
# as they are already annotated in the input. | ||
# - plmutils_model_dir: path to the directory for the plmutils model that will predict whether sORFs | ||
# are coding or non-coding. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is unrelated to this PR, but fwiw I feel you could delete all of these comments in this demo config (so that they only appear in one place, in the main top-level config.yml
). This would avoid duplicating them (and avoid the need to keep them in sync when they are changed)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
oh i love this, I'll do that, thank you!
Co-authored-by: Keith Cheveralls <[email protected]> Signed-off-by: Taylor Reiter <[email protected]>
PR checklist
Fixes #123, Resolves #456
).conda
environments.PR description