Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error in renaming and classification #50

Open
drwazeem opened this issue Dec 12, 2020 · 0 comments
Open

Error in renaming and classification #50

drwazeem opened this issue Dec 12, 2020 · 0 comments

Comments

@drwazeem
Copy link

I'm facing the problem (likely related to grep (Discussed here probably: #41 (comment) ) as an empty file is generated) since a few days. Could you please have a look and check if everything is okay. The commands and the output are given below:

Error executing process > 'Summary_renaming_and_classification (1)'

Caused by:
 Process `Summary_renaming_and_classification (1)` terminated with an error exit status (1)

Command executed:

 set -o pipefail
 gffcompare -G -o filter                     -r know_lnc.gtf                     -p 3 novel.lncRNA.stringent.gtf
 awk '$3 =="u"||$3=="x"{print $5}' filter.novel.lncRNA.stringent.gtf.tmap |sort|uniq|                     perl /media/wazeem/82346450346448ED/lncpipe/LncPipe/bin/extract_gtf_by_name.pl novel.lncRNA.stringent.gtf - > novel.lncRNA.stringent.filter.gtf
 
 #rename lncRNAs according to neighbouring protein coding genes
 awk '$3 =="gene"{print }' known_coding.gtf | perl -F'\t' -lane '$F[8]=~/gene_id "(.*?)";/ && print join qq{\t},@F[0,3,4],$1,@F[5,6,1,2,7,8,9]' - |             sort-bed - > gencode.protein_coding.gene.bed
 gtf2bed < novel.lncRNA.stringent.filter.gtf |sort-bed - > novel.lncRNA.stringent.filter.bed
 gtf2bed < know_lnc.gtf |sort-bed - > known.lncRNA.bed
 perl /media/wazeem/82346450346448ED/lncpipe/LncPipe/bin/rename_lncRNA_2.pl non_human_mod.gtf
 # mv lncRNA.final.v2.gtf all_lncRNA_for_classifier.gtf
 grep -v 'gene_id "NA-' lncRNA.final.v2.gtf > all_lncRNA_for_classifier.gtf
 perl /media/wazeem/82346450346448ED/lncpipe/LncPipe/bin/rename_proteincoding.pl known_coding.gtf> protein_coding.final.gtf
 cat all_lncRNA_for_classifier.gtf protein_coding.final.gtf > final_all.gtf
 gffread final_all.gtf -g GRCm38.primary_assembly.chr_only.fa -w final_all.fa -W
 gffread all_lncRNA_for_classifier.gtf -g GRCm38.primary_assembly.chr_only.fa -w lncRNA.fa -W
 gffread protein_coding.final.gtf -g GRCm38.primary_assembly.chr_only.fa -w protein_coding.fa -W
 #classification 
 perl /media/wazeem/82346450346448ED/lncpipe/LncPipe/bin/lincRNA_classification.pl all_lncRNA_for_classifier.gtf known_coding.gtf lncRNA_classification.txt

Command exit status:
 1

Command output:
 (empty)

Command error:
   18856 reference transcripts loaded.
   74 duplicate reference transcripts discarded.
   655 query transfrags loaded.

Work dir:
 /media/wazeem/82346450346448ED/lncpipe/LncPipe/work/22/39af242ea431e3e89321040f827330

Tip: you can try to figure out what's wrong by changing to the process work dir and showing the script file named `.command.sh` 
`
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant