December 21st 2022 version
Welcome to the data repository of the Toflit18 project! More details on the project can be found on its research blog: https://toflit18.hypotheses.org/ Details on the funding, partners, contributors, etc. can be found here: http://toflit18.medialab.sciences-po.fr/#/about The main research tool we provide to researchers is the datascape: http://toflit18.medialab.sciences-po.fr/#/home There is a companion GitHub repository dealing with the software aspects of the datascape (https://github.com/medialab/toflit18).
All the data are encoded in UTF-8, comma-separated. We recommend working with them using LibreOffice.
The data are released under datapackage format.
The data are released under an ODbl licence : http://opendatacommons.org/licenses/odbl/1.0/
The folder "source" includes sources used in the project as csv files. For more details on the different types of sources, see http://toflit18.medialab.sciences-po.fr/#/exploration/sources
- The folder "base" has all the data, classification and various files. These include:
- bdd_centrale.csv.zip which is a aggregation of all sources (except the "Out" ones"). This is the go-to file if you want the latest, raw version of the data.
- documentation about all variables in bdd_centrale.csv is available in the file "Variables explanation.csv"
- We provide you basically with all the necessary files to do a relational database. We work with it ourselves with the scripts included in the folder "scripts". Our workflow creates Stata and Neo4J output.
- documentation about the classifications is available in the file "classifications_index.csv". Do not hesitate to contribute new ones, starting from "marchandises_pour_nouvelle_classification.csv"
- bdd_courante.csv.zip is the "flat file" build around bdd_centrale.csv. This includes all the classifications, some computations for imputed value of flow and value_per_unit, best guess sources etc. This is the go-to file if you want the lastet, cleaned and enriched version of the data
For history related issues, ask Loïc Charles ([email protected]) or Guillaume Daudin ([email protected])
For basic guidance in using these ressources, ask Guillaume Daudin ([email protected])
For advanced technical issues, ask Paul Girard ([email protected])
This dataset is published on Zenodo as
You can cite it thusly:
Guillaume Daudin, Loïc Charles, Pierre Gervais, Paul Girard, & Guillaume Plique. (2022). TOFLIT18 dataset (1.0.0-zenodo) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.6573397