This set of Python scripts downloads, parses, and aggregates datasets from the World Bank's Open Data initiative, and outputs the data as a Mirador dataset. It aggregates the following World Bank datasets:
- World Development Indicators (WDI): http://data.worldbank.org/data-catalog/world-development-indicators
- Health Nutrition and Population (HNP) Statistics: http://data.worldbank.org/data-catalog/health-nutrition-and-population-statistics
- Gender Statistics: http://data.worldbank.org/data-catalog/gender-statistics
- Education Statistics: http://data.worldbank.org/data-catalog/ed-stats
which contain yearly data from 1960 until the present and are updated quarterly, except the HNP statistics which are updated biannually.
The scripts have the following dependencies:
- Python 2.7.3+ (not tested with 3+) and the following package:
1) Download and extract the zip files:
python download.py
2) Creates Mirador dataset. By default, it only uses the WDI data:
python makedataset.py
and the HNP, Gender and Education statistics can be added by using the -hnp, -gender, and -edu parameters. For example, to add HNP and Gender statistics to the base WDI data, the command would be
python makedataset.py -hnp -gender
The resulting Mirador dataset will be saved in the mirador folder.