Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting data into the proper format #1

Open
virpatel524 opened this issue Mar 19, 2017 · 1 comment
Open

Getting data into the proper format #1

virpatel524 opened this issue Mar 19, 2017 · 1 comment

Comments

@virpatel524
Copy link

Hi,

I was recommended to use your approach for determining COI with my malaria data. Unfortunately, I'm pretty new to VCF analysis, so I'm having some trouble understanding how to get my VCF files into your matrix format. First and foremost, how would you recommend converting my VCF files? Is there a script that does this easily? Second, what is the distinction between the major and minor allele and the reference and alternate allele? I know that they are not always the same when looking at VCF files, so I'm curious to know how you would go about accounting for this.

Thank you very much for your help!

@hhchang
Copy link
Collaborator

hhchang commented Aug 25, 2017

Thank you so much for your interest! I am not familiar with VCF files. I did some quick search and found that some existing codes can be used to change VCF file to tab delimited file (such as this one: https://github.com/EichlerLab/vcf-flow/blob/master/vcf_to_tab.py). You can give it a try and let me know if it works!

-Hsiao-Han

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants