Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change the input files to only take one transcriptome assembly file instead of two #47

Merged
merged 5 commits into from
Jun 5, 2024

Conversation

taylorreiter
Copy link
Member

@taylorreiter taylorreiter commented Jun 5, 2024

PR checklist

  • Tag the issue(s) or milestones this PR fixes (e.g. Fixes #123, Resolves #456).
  • Describe the changes you've made.
  • Describe any tests you have conducted to confirm that your changes behave as expected.
  • If you've added new software dependencies, make sure that those dependencies are included in the appropriate conda environments.
  • If you've added new functionality, make sure that the documentation is updated accordingly.
  • If you encountered bugs or features that you won't address, but should be addressed eventually, create new issues for them.

PR description

@taylorreiter taylorreiter marked this pull request as draft June 5, 2024 15:22
Copy link
Member

@keithchev keithchev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just a couple minor comments/edits for clarity.

README.md Outdated Show resolved Hide resolved
config.yml Outdated Show resolved Hide resolved
demo/README.md Outdated
We pulled the "short contigs" file from an internal S3 bucket.
It contains contigs that were filtered from the Amblyomma transcriptome prior to txome merging.
We also pulled short contigs (less than 75 bp) from an internal S3 bucket and added these contigs to the `contigs.fa` file (50 contigs).
These are contigs that were filtered from the *Amblyomma* transcriptome prior to transcriptome merging (mid assembly pipeline).
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider clarifying what "mid assembly pipeline" means (afaict, it's also not clear from this readme what pipeline this is referring to)

demo/config.yml Outdated
Comment on lines 20 to 30
# ORFs should be predicted from the same transcriptome assembly as the "contigs" input file.
# ORFs should have the same name (before the first period in the name) as the contigs in the
# "contigs" input file. TransDecoder provides files in the proper format.
# Used for cleavage peptide prediction and annotation of nonribosomal peptide synthetases, and to
# remove coding transcripts from the transcriptome assembly before sORF prediction.
# - orfs_nucleotides: predicted ORFs as nucleotide sequences. Should contain the same ORFs as
# "orfs_amino_acids" but in nucleotide format. TransDecoder also provides this file in the proper
# format. If this file contains short ORFs (< 300 nucleotides), they will not be reported as sORFs
# as they are already annotated in the input.
# - plmutils_model_dir: path to the directory for the plmutils model that will predict whether sORFs
# are coding or non-coding.
Copy link
Member

@keithchev keithchev Jun 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is unrelated to this PR, but fwiw I feel you could delete all of these comments in this demo config (so that they only appear in one place, in the main top-level config.yml). This would avoid duplicating them (and avoid the need to keep them in sync when they are changed)

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

oh i love this, I'll do that, thank you!

taylorreiter and others added 2 commits June 5, 2024 18:27
@taylorreiter taylorreiter merged commit 1d425ed into main Jun 5, 2024
2 checks passed
@taylorreiter taylorreiter deleted the ter/change-input-files branch June 5, 2024 22:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants