SVIP-Smooth-DTW: Sequence VerIfication for Procedures in Videos with Smooth DTW

This repo is a experimental combination of SVIP:https://github.com/svip-lab/SVIP-Sequence-VerIfication-for-Procedures-in-Videos and VideoAlignment: https://github.com/hadjisma/VideoAlignment.

Main pipeline uses SVIP so the setup and scripts are copied from there. Smooth DTW loss defined in utils/smoothDTW.py. Training pipeline is modified to only use Smooth DTW loss. Some example figures in figs. dist_matrix_*.png corresponds to distance matrix from smooth DTW. dtw_matrix_*.png corresponds to DTW matrix computed through DP. frames_*.png corresponds to the frame input pairing including labels.

Getting Started

Prerequisites

python 3.6
pytorch 1.7.1
cuda 10.2

Installation

Clone the repo and install dependencies.

git clone https://github.com/svip-lab/SVIP-Sequence-VerIfication-for-Procedures-in-Videos.git
cd VIP-Sequence-VerIfication-for-Procedures-in-Videos
pip install requirements.txt

Download the pretrained model.

Link：here

Extraction code：2555

Datasets

Please refer to here for detailed instructions.

Training and Evaluation

We have provided the default configuration files for reproducing our results. Try these commands to play with this project.

For training:

CUDA_VISIBLE_DEVICES=0,1,2,3 python train.py --config configs/train_resnet_config.yml

For evaluation:

CUDA_VISIBLE_DEVICES=0 python eval.py --config configs/eval_resnet_config.yml --root_path [model&log folder] --dist [L2/NormL2] --log_name [xxx]

Note that we use L2 distance while evaluating on COIN-SV, otherwise NormL2.

Trained Models

We provide checkpoints for each dataset trained with this re-organized codebase.

Notice: The reproduced performances are occassionally higher or lower (within a reasonable range) than the results reported in the paper.

Dataset	Split	Papar	Reproduce	ckpt
COIN-SV	val	56.81, 0.4005	58.27, 0.4667	here
COIN-SV	test	51.13, 0.4098	51.55, 0.4658	here
Diving48-SV	val	91.91, 1.0642	91.69, 1.0928	here
Diving48-SV	test	83.11, 0.6009	84.28, 0.6193	here
CSV	test	83.02, 0.4193	82.88, 0.4474	here

Citation

If you find this repo helpful, please cite our paper:

@inproceedings{qian2022svip,
  title={SVIP: Sequence VerIfication for Procedures in Videos},
  author={Qian, Yicheng and Luo, Weixin and Lian, Dongze and Tang, Xu and Zhao, Peilin and Gao, Shenghua},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={19890--19902},
  year={2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
configs		configs
data		data
datasets		datasets
figs		figs
imgs		imgs
models		models
temp_log		temp_log
utils		utils
.DS_Store		.DS_Store
LICENSE		LICENSE
README.md		README.md
eval.py		eval.py
predict.py		predict.py
predict.sh		predict.sh
predict_dtw.py		predict_dtw.py
requirements.txt		requirements.txt
smoothDTW_demo.py		smoothDTW_demo.py
train.py		train.py
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SVIP-Smooth-DTW: Sequence VerIfication for Procedures in Videos with Smooth DTW

Getting Started

Prerequisites

Installation

Datasets

Training and Evaluation

Trained Models

Citation

About

Releases

Packages

Languages

License

fan23j/SVIP-Smooth-DTW

Folders and files

Latest commit

History

Repository files navigation

SVIP-Smooth-DTW: Sequence VerIfication for Procedures in Videos with Smooth DTW

Getting Started

Prerequisites

Installation

Datasets

Training and Evaluation

Trained Models

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages