Skip to content

Latest commit

 

History

History
32 lines (20 loc) · 1.65 KB

how_to_download_models_en.md

File metadata and controls

32 lines (20 loc) · 1.65 KB

Model downloads are divided into initial downloads and updates to the model directory. Please refer to the corresponding documentation for instructions on how to proceed.

Initial download of model files

Download the Model from Hugging Face

Use a Python Script to Download Model Files from Hugging Face

pip install huggingface_hub
wget https://github.com/opendatalab/MinerU/raw/master/scripts/download_models_hf.py -O download_models_hf.py
python download_models_hf.py

The Python script will automatically download the model files and configure the model directory in the configuration file.

The configuration file can be found in the user directory, with the filename magic-pdf.json.

How to update models previously downloaded

1. Models downloaded via Git LFS

Important

Due to feedback from some users that downloading model files using git lfs was incomplete or resulted in corrupted model files, this method is no longer recommended.

For versions 0.9.x and later, due to the repository change and the addition of the layout sorting model in PDF-Extract-Kit 1.0, the models cannot be updated using the git pull command. Instead, a Python script must be used for one-click updates.

When magic-pdf <= 0.8.1, if you have previously downloaded the model files via git lfs, you can navigate to the previous download directory and update the models using the git pull command.

2. Models downloaded via Hugging Face or Model Scope

If you previously downloaded models via Hugging Face or Model Scope, you can rerun the Python script used for the initial download. This will automatically update the model directory to the latest version.