Libraries for paged content in Islandora.
This module requires the following modules/libraries:
This module has the following optional requirements:
- Ghostscript - Debian/Ubuntu
sudo apt-get install ghostscript
- pdftotext - Debian/Ubuntu
sudo apt-get install poppler-utils
- pdfinfo - Debian/Ubuntu
sudo apt-get install poppler-utils
Install as usual, see this for further information.
Set the path for gs
(GhostScript), the externally accessible Djatoka URL, and the 'Solr page sequence number field' in Administration » Islandora » Solution pack configuration » Paged Content Module (admin/islandora/solution_pack_config/paged_content).
There is an option to set the page label to the page's sequence number. On ingest, each page's label will be set to its sequence number. When reordering pages, all of the page labels will be updated with the new sequence numbers.
You can also "Hide Page Objects From Search Results", so that only the parent object is returned. If you use this option, make sure that you check the "Aggregate OCR?" box when ingesting your paged content object. Otherwise, the parent object will not receive an OCR datastream, and will not be returned in search results.
Further documentation for this module is available at our wiki.
paged-content-consolidate-missing-ocr
This drush command finds all page objects whose parent does not have a OCR datastream, generates it by combining the OCR datastreams from the children and adds that datastream to the parent.
Having problems or solved a problem? Check out the Islandora google groups for a solution.
Current maintainers:
If you would like to contribute to this module, please check out CONTRIBUTING.md. In addition, we have helpful Documentation for Developers info, as well as our Developers section on the Islandora.ca site.