Prototype enrichment workflow for SAOS system
- Install Oracle Java 7+
- Install lein2
- Clone saos-wf-enrich GitHub repo
git clone --recursive https://github.com/CeON/saos-wf-enrich/
- Run top level
bld.sh
script - Copy
law_journal_dict.json.gz
toext
directory - Prepare
config
file based on templatecp config.example config
Edit
config
, providing put and get URLs + authorization credentials
Enricher comes with three scripts:
- run.sh
- The main entry point for running analysis and putting results back. For periodic updates of enrichment tags, add it to crontab.
- get.sh
- Download the data via REST (do not process them)
- put.sh
- Put the calculated data to the server (useful if you want to resend existing data)
The following files log information about enrichment
- get.log
- log for downloading data via REST
- bob.log
- log for processing data
- put/rest/out/put_json.log
- stores the response from the
server, obtained after doing REST PUT with the results. Correct
response should look like this:
{:code 200, :message OK, :content {"status":"OK", "message":"Enrichment tags successfully uploaded","details":""}}
Set the environment variable JVM_OPTS
to the following value
JVM_OPTS="-Djava.net.preferIPv6Addresses=true"
Distributed under the Eclipse Public License, the same as Clojure.