SoftAgent

This repository contains the benchmarked algorithms for environments in SoftGym (paper). The benchmarked algorithms include

Cross Entropy Method(CEM) [source]
CURL/SAC [source] [paper]
- We use the original implementation
DrQ [source] [paper]
- We use the original implementation
PlaNet [source] [paper]
- We use this customized pytorch version
MVP [source] [paper]
- We build on top of the original implementation

Installation

Install SoftGym by following the instructions in SoftGym repository. Then, copy the softgym code to the SoftAgent root directory so we have the following file structure:
```
softagent
├── cem
├── ...
├── softgym
```
Update conda env with additional packages required by SoftAgent: conda env update --file environment.yml --prune
Activate the conda environment by running . ./prepare_1.0.sh.
For running MVP, please refer to the original implementation for dependencies.

Running benchmarked experiments

Generating initial states for different SoftGym environments: python experiments/generate_cached_states.py
Running CEM experiments: python experiments/run_cem.py. Refer to run_cem.py for different arguments.
Running CURL/SAC experiments: python experiments/run_curl.py. Refer to run_curl.py for different arguments.
Running PlaNet experiments: python experiments/run_planet.py. Refer to run_planet.py for different arguments.
Running DrQ experiments: python experiments/run_drq.py. Refer to run_drq.py for different arguments.
Train an MVP policy: python experiments/run_mvp.py. Refer to run_mvp.py for different arguments. Once the model is trained, use rlpyt_cloth/max_q_eval_policy to evaluate the policy that selects the pick location with the maximum Q value.

Note: Default number of environment variations are set to 1. Set them to 1000 to reproduce the original experiments.

Cite

If you find this codebase useful in your research, please consider citing:

@inproceedings{corl2020softgym,
 title={SoftGym: Benchmarking Deep Reinforcement Learning for Deformable Object Manipulation},
 author={Lin, Xingyu and Wang, Yufei and Olkin, Jake and Held, David},
 booktitle={Conference on Robot Learning},
 year={2020}
}

References

CURL implementation is from the official release: https://github.com/MishaLaskin/curl
PlaNet implementation is modified from this repository: https://github.com/Kaixhin/PlaNet
DrQ implementation is from the official repository: https://github.com/denisyarats/drq
MVP implementation is from the official repository: https://github.com/wilson1yan/rlpyt
Softgym repository: https://github.com/Xingyu-Lin/softgym

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SoftAgent

Installation

Running benchmarked experiments

Cite

References

About

Releases

Packages

Contributors 3

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
cem		cem
chester		chester
curl		curl
drq		drq
envs		envs
experiments		experiments
planet		planet
rlpyt_cloth		rlpyt_cloth
.gitmodules		.gitmodules
README.md		README.md
compile_1.0.sh		compile_1.0.sh
environment.yml		environment.yml
prepare_1.0.sh		prepare_1.0.sh

Xingyu-Lin/softagent

Folders and files

Latest commit

History

Repository files navigation

SoftAgent

Installation

Running benchmarked experiments

Cite

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages