Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

discrepancies with the 1000 genomes project #354

Open
MauricioAlvcam opened this issue Aug 6, 2019 · 0 comments
Open

discrepancies with the 1000 genomes project #354

MauricioAlvcam opened this issue Aug 6, 2019 · 0 comments

Comments

@MauricioAlvcam
Copy link

I am contacting you because I have found discrepancies with the 1000 genomes project. I have been using the database to see the variation in human populations. But I have found sites where not exist variation in the 1000 genomes project and these differ with gnomAD.

For example, rs113875178 where the alleles are A/C (A is the reference). This site is not found in the VCF file, so I am assuming that 100% of the population doesn´t have variation (it has not been called in the 1000 genomes project and the reference allele are present at that position). But in gnomAD the ALT genotype is the majority (C).

What happen with these kinds of snp? Did the 1000 genomes project not be considered? I don´t know if my interpretation is wrong, errors by reassignment, I'm missing some data that I'm not observing, reads with errors or missing information on the page. Do you used the reads of 1000 genomes or the vcf file?.

I hope you could help me with my doubts.
Thank you for your help

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant