Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

unique constraint error #151

Open
rob123king opened this issue Dec 14, 2021 · 3 comments
Open

unique constraint error #151

rob123king opened this issue Dec 14, 2021 · 3 comments

Comments

@rob123king
Copy link

Having issues in galaxy doing then prep step. The annotation is unique thought so not sure why it is failing here. Is this error suggesting there are two transcripts with the same name as I don't see that in my gff?

 raise Exception(f"Error while inserting {to_insert} into transcript table: {e}")
Exception: Error while inserting ('CHILSU_01T008110-RA', None, 'cds', None, False, 'CHILSU_01G008110') into transcript table: UNIQUE constraint failed: transcript.protein_id
@rob123king
Copy link
Author

Tried it for another species and get the same problem at around 8K transcripts.

@nsoranzo
Copy link
Member

@rob123king Thanks for reporting the issue!
In your case it seems there is a duplicated transcript protein id: cds.
For a GFF3 file, the protein id of a transcript is extracted from the "CDS" lines which have this transcript as Parent. The script checks that all these CDS lines have the same ID (or Name if ID is missing) and then uses this as protein id.

Happy to have a look at your GFF3 file, if you like.

@rob123king
Copy link
Author

rob123king commented Dec 15, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants