GWAS Analytics Scripts

This file contains handy scripts and pregenerated data for analysing GWAS Catalog data. This is a quick-n-dirty analysis, so is untested, probably contains lots of local paths and comes with absolutely no guarantees that any of it will work for now. But it might be useful as a starting point.

Citation Graph Analysis

Making sure the file gwas-pubmed-ids.csv is in the current working directory, execute the calculate-citations.py script. This should produce output like:

Read 2799 PubMed ids
    Collecting citations for 15761122...
    doing page 2

This will produce a file of output called 'citation-graph.csv'.

Once generated, this can be loaded into a Neo4J instance using Cypher so the citation graph can be queried.

Publication Trait Analysis

The file 'study-traits.csv' contains the results of a database query for the links between studies and their associated traits. This can also be loaded into Neo4J and queried as part of the graph.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
dev-demo		dev-demo
README.md		README.md
ancestry-by-month.csv		ancestry-by-month.csv
associations_by_catalog_month.csv		associations_by_catalog_month.csv
associations_by_month.csv		associations_by_month.csv
calculate-authors.py		calculate-authors.py
calculate-citations.py		calculate-citations.py
charts.html		charts.html
citation-graph.csv		citation-graph.csv
convert-download-logs.py		convert-download-logs.py
convert-search-logs.py		convert-search-logs.py
data.csv		data.csv
data.json		data.json
gene-data.csv		gene-data.csv
gwas-pubmed-ids.csv		gwas-pubmed-ids.csv
gxe-studies.csv		gxe-studies.csv
gxg-studies.csv		gxg-studies.csv
publications-v-studies.csv		publications-v-studies.csv
search-analysis.txt		search-analysis.txt
studies-by-month.csv		studies-by-month.csv
study-authors.csv		study-authors.csv
study-traits.csv		study-traits.csv
times-trait-used.csv		times-trait-used.csv
trait-counts-2010.csv		trait-counts-2010.csv
trait-counts-2011.csv		trait-counts-2011.csv
trait-counts-2012.csv		trait-counts-2012.csv
trait-counts-2013.csv		trait-counts-2013.csv
trait-counts-2014.csv		trait-counts-2014.csv
trait-counts-2015.csv		trait-counts-2015.csv
trait-counts-2016.csv		trait-counts-2016.csv
trait-counts-2017.csv		trait-counts-2017.csv
trait-counts.csv		trait-counts.csv
trait-growth-over-time.csv		trait-growth-over-time.csv
traits-by-month.csv		traits-by-month.csv
traits-data.csv		traits-data.csv
traits.tsv		traits.tsv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GWAS Analytics Scripts

Citation Graph Analysis

Publication Trait Analysis

About

Releases

Packages

Languages

tburdett/gwas-analytics

Folders and files

Latest commit

History

Repository files navigation

GWAS Analytics Scripts

Citation Graph Analysis

Publication Trait Analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages