-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge pull request #47 from Arcadia-Science/ter/change-input-files
Change the input files to only take one transcriptome assembly file instead of two
- Loading branch information
Showing
7 changed files
with
172 additions
and
238 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,39 +1,6 @@ | ||
################################################################################ | ||
## Input file descriptions | ||
################################################################################ | ||
|
||
# Config parameters in this file are used as defaults by the pipeline (Snakefile). | ||
# To override the defaults, create a copy of this file and pass your new file to snakemake using the --configfile flag. | ||
# Any parameters in the new config file will overwrite the defaults listed here. | ||
|
||
########### | ||
# File IO # | ||
########### | ||
|
||
# Specify the input and output directories | ||
input_dir: "inputs/" | ||
output_dir: "outputs/demo" | ||
|
||
# All input files are produced by reads2transcriptome. | ||
# While this is not a strict requirement, using these output files gives us the ability to look at very short contiguous sequences. | ||
# TER TODO: update names of output files based on Nextflow of reads2transcriptome. | ||
# - orfs_amino_acids: predicted ORFs translated into amino acids. | ||
# Output by transdecoder. | ||
# Used for cleavage peptide prediction and annotation of nonribosomal peptide synthetases. | ||
# - orfs_nucleotides: predicted ORFs as nucleotide sequences. Output by transdecoder. | ||
# Used to compare peptide nucleotide sequences (clustering, dn/ds estimation, etc.). | ||
# - contigs_shorter_than_r2t_minimum_length: contigs that are shorter than X nucleotides (by default 75bp). | ||
# The reads2transcriptome pipeline assembles RNA-seq reads into contigs (transcripts) using multiple assemblers and then merges those assemblies together. | ||
# Before merging, very short contigs are removed (<75bp). | ||
# However, reads2transcriptome outputs a FASTA file containing these transcripts, which is used as input to peptigate here. | ||
# These contigs may contain sORFs and so are included as an input to the peptigate pipeline. | ||
# - contigs_longer_than_r2t_minimum_length: contigs longer than X nucleotides (default is 75bp). | ||
# If the user did not use the reads2transcriptome file to generate their input files, this would be a transcriptome assembly FASTA file in nucleotide format containing transcripts. | ||
# Note that the first step of sORF prediction combines the contigs_shorter_than_r2t_minimum_length and contigs_longer_than_r2t_minimum_length files, so there is no need to perform any pre-processing by length. | ||
# - plmutils_model_dir: path to the directory for the plmutils model that will predict whether sORFs are coding or non-coding. | ||
|
||
contigs: "demo/contigs.fa" | ||
orfs_amino_acids: "demo/orfs_amino_acids.faa" | ||
orfs_nucleotides: "demo/orfs_nucleotides.fa" | ||
contigs_shorter_than_r2t_minimum_length: "demo/contigs_shorter_than_r2t_minimum_length.fa" | ||
contigs_longer_than_r2t_minimum_length: "demo/contigs_longer_than_r2t_minimum_length.fa" | ||
plmutils_model_dir: "inputs/models/plmutils/" |
Oops, something went wrong.