Shape Mining Pipeline

Description

The shape mining pipeline is designed to extract veselprofiles with corresponding metadata from book scans.

Extract figureID and pageID from book scan
Extract veselprofiles from book scan
Extract only characteristic shape of the veselprofiles
kNN approach to find most similar vesel shapes

Getting Started

There are two Docker Containers. One for a GPU machine and one for a CPU machine. Install and run docker container. Container hosts a jupyter server. Ther URL for accessing the server will be shown in the terminal.

chmod +x start_docker.sh
./start_docker.sh

or use docker-compose for your preferred config (CPU or GPU). Example for CPU

docker-compose -f Docker_CPU/docker-compose.yml up

Access Docker shell. The container has to run for that.

docker ps (to check COTAINER_ID)
docker exec -it CONTAINER_ID bash

Development with Visual Studio Code

Install Visual Studio Code with the following extensions:

Open devcontainer file and choose in entry "dockerComposeFile" the GPU or CPU container.
Create following directories if you use a Linux OS:

vscode_remote/extensions
vscode_remote/bashhistory
vscode_remote/insiders

In VSCodee press Shift+P and run "Remote-Containers:Rebuild and Reopen in Container" command.

Cotainer filesystem

Source code is located at /home/Code
Tensorflow objection detection API at /models/research/object_detection

Models

Models for mining shapes can be downloaded at Mining Pages

Run whole pipeline

Mount correct volumes to docker-compose file
Run file mining_pages.py

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.devcontainer		.devcontainer
.vscode		.vscode
Docker		Docker
Mining_Pages		Mining_Pages
mining_pages_utils		mining_pages_utils
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
mining_pages.ipynb		mining_pages.ipynb
start_docker.sh		start_docker.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Shape Mining Pipeline

Description

Getting Started

Development with Visual Studio Code

Cotainer filesystem

Models

Run whole pipeline

About

Releases

Packages

Languages

maxhaibt/mining-shapes

Folders and files

Latest commit

History

Repository files navigation

Shape Mining Pipeline

Description

Getting Started

Development with Visual Studio Code

Cotainer filesystem

Models

Run whole pipeline

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages