feed_ursus

Script to process CSVs into an Sinai-ready solr index.

Using feed_ursus.py

We recommend installing with poetry and pyenv. which can be installed with homebrew:

brew install pyenv
curl -sSL https://install.python-poetry.org | python3 -

You may need to add export PATH="/Users/andy/.local/bin:$PATH" to your shell profile.

If you installed poetry using homebrew (as this document formerly recommended), you might run into some dependency issues. If this happens try brew uninstall poetry and the official installer as shown above

To install dependencies in a virtual environment:

poetry install

Then, to run commands inside the new virtual environment, you can either enter poetry shell to enter the virtual environment, or you can prefix your commands with poetry run.

You can then use the script to convert a csv into a json document that follows the data model of an Ursus solr index:

poetry run feed_ursus.py [path/to/your.csv]

This repo includes a docker-compose.yml file that will run local instances of solr and ursus for use in testing this script. To use them (first install docker and docker compose):

docker-compose up --detach
docker-compose run web bundle exec rails db:setup

Give it a minute or so for solr to get up and running, then point feed_ursus.py directly at the new solr:

poetry run ./feed_ursus.py [path/to/your.csv] --solr_url http://localhost:6983/solr/californica

When the command finishes running, you can see your new site at http://localhost:6003

Running the test suite

First, install the dev dependencies and enter the virtualenv:

poetry install --dev
poetry shell

Then you can simply run:

pytest --mypy --pylint

This will run:

pylint, a linter, via pytest-pylint
mypy, a static type checker, via pytest-mypy
the test suite, written using pytest

Caveats

IIIF Manifests

When importing a work, the script will always assume that a IIIF manifest exists at https://iiif.library.ucla.edu/[ark]/manifest, where [ark] is the URL-encoded Archival Resource Key of the work. This link should work, as long as a manifest has been pushed to that location by importing the work into Fester or Californica. If you haven't done one of those, obviously, the link will fail and the image won't be visible, but metadata will import and be visible. A manifest can then be created and pushed to the expected location without re-running feed_ursus.py.

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
.github/workflows		.github/workflows
docker-entrypoint-initdb.d		docker-entrypoint-initdb.d
fields		fields
solr		solr
test		test
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
date_parser.py		date_parser.py
docker-compose.yml		docker-compose.yml
dotenv.sample		dotenv.sample
feed_ursus.py		feed_ursus.py
mapper.py		mapper.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
year_parser.py		year_parser.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

feed_ursus

Using feed_ursus.py

Running the test suite

Caveats

IIIF Manifests

About

Releases

Packages

Contributors 6

Languages

License

UCLALibrary/feed_ursus

Folders and files

Latest commit

History

Repository files navigation

feed_ursus

Using feed_ursus.py

Running the test suite

Caveats

IIIF Manifests

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 6

Languages

Packages