Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Translation when circular genome and ORF in the cut of the assembly #81

Open
BenjaminGuinet opened this issue Sep 13, 2023 · 1 comment

Comments

@BenjaminGuinet
Copy link

BenjaminGuinet commented Sep 13, 2023

Hello I have a gff3 file such as :

1   Geneious        CDS     101     10548   .       +       0       Name=1_ORF104;product=hypothetical protein
1   Geneious        gene    101     10548   .       +       0       Name=1_ORF104;product=hypothetical protein

This is part of a circular viral genome, how can I code the gff3 file to get the translation from 100548..101121 and 101121,1..101 using --translate in EMBLmyGFF3

instead of the translation of the protein from 101 to 10548

such as :

complement(join(100548..101121,1..101))

Thanks for your help

@Juke34
Copy link
Collaborator

Juke34 commented Sep 18, 2023

You must use the qualifier ribosomal_slippage in the CDS feature: https://www.ebi.ac.uk/ena/WebFeat/
So it will be a ribosomal_slippage attribute of the CDS feature in the GFF.
But I think the mechanism to automatically reflect this information in the EMBL file by EMBLmyGFF3 is not implemented (it will be among the qualifiers but the join operator will not be inserted).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants