Skip to content

Short descriptions for csv2rdf4lod's scripts

Timothy Lebo edited this page Feb 14, 2012 · 10 revisions
csv2rdf4lod-automation is licensed under the [Apache License, Version 2.0](https://github.com/timrdf/csv2rdf4lod-automation/wiki/License)

dg-create-dataset-dir.sh

  • Sets up the directory structure,
  • queries data.gov for formats,
  • retrieves all but rdf,
  • unzips files, and
  • sets up the convert script.
  • Uncommenting one line in bin/dg-create-dataset-dir.sh will also run the raw conversion. (search for "NOTE" and follow the instructions)

cr-create-convert-sh.sh

  • You will only need to use this manually if data.gov does not return *.csv files

dg-get-mod-date.sh

  • You will rarely need this to convert csv data. Run with a data.gov datasetID to get the web server modificationd date.

bin/util/symbol-interpretation.awk

rr-create-void.sh

  • When in a version directory, use the "essential parameters" embedded in the directory structure to output the attributes for the Abstract Dataset and the Versioned Dataset.
  • Include a parameter for the dump file of the dataset, and get an extra triple describing the dcterms:modified of the Versioned Dataset.

cr-publish-params-to-endpoint.sh

  • find all //manual/*.params.ttl, aggregate into a single file, and load into a named graph on the SPARQL endpoint.
  • Defaults to named graph http://purl.org/twc/vocab/conversion/ConversionProcess

cr-publish-void-to-endpoint.sh

cr-publish-sameas-to-endpoint.sh

update-csv2rdf4lod-bin.sh

List

Clone this wiki locally