Skip to content

Latest commit

 

History

History
46 lines (29 loc) · 2.91 KB

README.md

File metadata and controls

46 lines (29 loc) · 2.91 KB

DOI

README.md

December 21st 2022 version

Welcome to the data repository of the Toflit18 project! More details on the project can be found on its research blog: https://toflit18.hypotheses.org/ Details on the funding, partners, contributors, etc. can be found here: http://toflit18.medialab.sciences-po.fr/#/about The main research tool we provide to researchers is the datascape: http://toflit18.medialab.sciences-po.fr/#/home There is a companion GitHub repository dealing with the software aspects of the datascape (https://github.com/medialab/toflit18).

All the data are encoded in UTF-8, comma-separated. We recommend working with them using LibreOffice.

The data are released under datapackage format.

The data are released under an ODbl licence : http://opendatacommons.org/licenses/odbl/1.0/

Sources

The folder "source" includes sources used in the project as csv files. For more details on the different types of sources, see http://toflit18.medialab.sciences-po.fr/#/exploration/sources

Raw aggregated data

  • The folder "base" has all the data, classification and various files. These include:
    • bdd_centrale.csv.zip which is a aggregation of all sources (except the "Out" ones"). This is the go-to file if you want the latest, raw version of the data.
    • documentation about all variables in bdd_centrale.csv is available in the file "Variables explanation.csv"

Relational database

  • We provide you basically with all the necessary files to do a relational database. We work with it ourselves with the scripts included in the folder "scripts". Our workflow creates Stata and Neo4J output.
  • documentation about the classifications is available in the file "classifications_index.csv". Do not hesitate to contribute new ones, starting from "marchandises_pour_nouvelle_classification.csv"
  • bdd_courante.csv.zip is the "flat file" build around bdd_centrale.csv. This includes all the classifications, some computations for imputed value of flow and value_per_unit, best guess sources etc. This is the go-to file if you want the lastet, cleaned and enriched version of the data

Contact us

For history related issues, ask Loïc Charles ([email protected]) or Guillaume Daudin ([email protected])

For basic guidance in using these ressources, ask Guillaume Daudin ([email protected])

For advanced technical issues, ask Paul Girard ([email protected])

How to cite the dataset

This dataset is published on Zenodo as DOI

You can cite it thusly:

Guillaume Daudin, Loïc Charles, Pierre Gervais, Paul Girard, & Guillaume Plique. (2022). TOFLIT18 dataset (1.0.0-zenodo) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.6573397