You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
EMBOSS TranSeq has a bug/feature which removes anything before a colon : from sequence ids (i.e. the first "word" after > in each FASTA header line).
This results in empty cluster_cds files and failed workflow.
So if input sequences contains : in sequence identifiers, they need to be removed or another copy of CDS dataset needs to be prepared to avoid this error.
Thanks
Anil
The text was updated successfully, but these errors were encountered:
> Before colon: after colon
accaGTTACCCTCATCATCTTAGCTGATAGCCAGCCAGCCACCACAGGCAtgagtca
> Before_colon:after_colon
gtttgccatcttttgctgctctagggaatccagcagctgtcaccatgtaaacaagcccagg
Output of transeq -sequence 'test.fasta' -outseq 'protein.fasta' -frame 1 -table 0 -trim no -osformat2 fasta -auto:
>Before_1 colon: after colon
...
>after_colon_1
...
EMBOSS TranSeq has a bug/feature which removes anything before a colon
:
from sequence ids (i.e. the first "word" after>
in each FASTA header line).This results in empty cluster_cds files and failed workflow.
So if input sequences contains
:
in sequence identifiers, they need to be removed or another copy of CDS dataset needs to be prepared to avoid this error.Thanks
Anil
The text was updated successfully, but these errors were encountered: