-
Notifications
You must be signed in to change notification settings - Fork 56
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Short Introns flagged with <pseudo> not accepted by ENA anymore #151
Comments
Hi thank you for using our tools and for your feedback with many precious information.
|
Hi,
Probably you misunderstood " I will now filter out all pseudo flagged entries and keep you updated. |
Sorry I read too fast the Sam's reply. So no threshold, but they don't allow it anymore. |
Hi,
Do the numbers actually have any meaning? In the end Due to this mess, I counted the number of genes incorrectly. I should be 196. Finally, I excluded the entries now with: Probably it would be the easiest to include an option in The flat file created with |
attribute syntax: Are you sure you did not mix up with Otherwise I guess the features with the Note containing |
Thanks for the reply!
Does the value of the pseudo key have any meaning?
This is identical to
Therefore, the awk command should have worked. But to be sure I will use the output from agat. |
Good, I was not sure that
No it is just a counter. |
Now it is possible to use: |
Dear @Juke34 and all,
first of all: thank you very much for this tool kit. This is very helpful!
To submit a de novo assembly including annotation to ENA I ran certain filtering steps with agat 0.5.1 before running EMBLmyGFF3.
In detail executed
agat_sp_keep_longest_isoform.pl -f <gff> -o <agat1.gff>
agat_convert_sp_gxf2gxf.pl -g <agat1.gff> -o <agat2.gff>
agat_sp_fix_features_locations_duplicated.pl -f <agat2.gff> -o <agat3.gff>
agat_sp_flag_short_introns.pl --gff <agat3.gff> --out <agat4.gff>
Subsequently, I ran
EMBLmyGFF3
, validated and submitted withwebin-cli-3.5.0.jar
ENA accepted the submission and released the fasta sequences of the genome assembly but not the annotation. I wrote back an forth with the help desk (Sam Holt) but he couldn't find any issues on their end. Until recently - half a year after submission - he came back to me with this answer:
Of cause, I replied:
Then Sam replied to this again:
As @Juke34 wrote here:
NBISweden/EMBLmyGFF3#31
agat_sp_flag_short_introns.pl
should do the job.My questions are now:
pseudo=8
.agat_sp_flag_short_introns.pl
, correct?I see that this is not really an issue with agat itself but I just wanted to share my experience here.
Thanks and best regards,
Tilman
The text was updated successfully, but these errors were encountered: