Skip to content

Latest commit

 

History

History
31 lines (22 loc) · 2.25 KB

DATASET.md

File metadata and controls

31 lines (22 loc) · 2.25 KB

Dataset Curation

  • In the dataset folder, there is a separate python script for each dataset such as Mannequin, HM3D and ....

  • In all scripts, we use the multiprocessing package in Python to make the process faster. We pass each scene for 3Dscenes datasets or each video in video datasets to a single process.

  • Each script gets the path to the raw data, mines potential image pairs, which is different for videos and 3D scenes (.glb files), and then stores the image pairs with sufficient overlap.

  • All you need to do is download your raw dataset (for example, co3d) and use the command below: (data_path points to the raw data directory, which has all scenes/videos)

python co3d_multipro.py --data_path /path/to/raw/files --store_path /path/to/store/directory --cpus num_cpus