This is the code for the processing of the articles from the Swiss National Licences.
More information on the general project on http://www.nationallizenzen.ch.
See the hyperlinked description of the workflow
- Extract all zip files from Oxford
- example of an article with JATS metadata
- merge all articles from Oxford (900'000 files) in one big xml file per jounrnal
- example of all articles from a specific journal in one xml file
- do a quick analysis for Oxford, extract for example the DTD's
- elasticsearch mapping
- extract selected fields with metamorph
- index data in elasticsearch
- generate title lists from elasticsearch
To be able to use this, you need to install metafacture. We recommend installing it from https://github.com/linked-swissbib/mfWorkflows/