Bayesian Surprise

Repo for environments, gym wrappers, and scripts for the SMiRL project.

Requirements:

For distributing experiments.

doodad: https://github.com/montrealrobotics/doodad

RL library

rlkit: https://github.com/Neo-X/rlkit/tree/surprise

Build Instruction

conda create --name smirl_code python=3.7 pip
conda activate smirl_code
pip install -r requirements.txt
pip install -e ./
cd ../
git clone [email protected]:montrealrobotics/doodad.git
cd doodad
pip install -e ./
cd ../smirl_code

Then you will need copy the config.py file locally to launchers.config.py and update the paths in the file. You need to update BASE_CODE_DIR to the location you have saved SMiRL_Code. Also update LOCAL_LOG_DIR to the location you would like the logging data to be saved on your computer. You can look at the doodad for more details on this configuration.

Commands:

A basic examples.

python3 scripts/dqn_smirl.py --config=configs/tetris_SMiRL.json --run_mode=local --exp_name=test_smirl

python3 scripts/dqn_smirl.py --config=configs/Carnival_Small_SMiRL.json --run_mode=local --exp_name=test_smirl --training_processor_type=gpu

With docker locally

python3 scripts/dqn_smirl.py --config=configs/tetris_SMiRL.json --exp-name=test --run_mode=local_docker

###Run Vizdoom SMiRL experiments

python3 scripts/dqn_smirl.py --config=configs/VizDoom_TakeCover_Small.json --exp_name=vizdoom_small_test --run_mode=ssh --random_seeds=1 --meta_sim_threads=4 --log_comet=true --training_processor_type=gpu --tuningConfig=configs/GPU_indexes.json

python3 scripts/dqn_smirl.py --config=configs/VizDoom_DefendTheLine_Small.json --exp_name=vizdoom_DTL_small_smirl --run_mode=ssh --random_seeds=1 --meta_sim_threads=4 --log_comet=true --training_processor_type=gpu --tuningConfig=configs/GPU_indexes.json

python3 scripts/dqn_smirl.py --config=configs/VizDoom_DefendTheLine_Small_Bonus.json --exp_name=vizdoom_DTL_small_smirl_bonus --run_mode=ssh --ssh_host=newton1 --random_seeds=1 --meta_sim_threads=4 --log_comet=true --training_processor_type=gpu --tuningConfig=configs/GPU_indexes.json

Run Atari Experiments

python3 scripts/dqn_smirl.py --config=configs/Carnival_Small_SMiRL.json --exp_name=Atari_Carnival__small_smirl --run_mode=ssh --random_seeds=1 --meta_sim_threads=4 --log_comet=true --training_processor_type=gpu --tuningConfig=configs/GPU_indexes.json

python3 scripts/dqn_smirl.py --config=configs/Carnival_Small_SMiRL_Bonus.json --exp_name=Atari_Carnival_small_smirl_bonus --run_mode=ssh --ssh_host=newton1 --random_seeds=1 --meta_sim_threads=4 --log_comet=true --training_processor_type=gpu --tuningConfig=configs/GPU_indexes.json

python3 scripts/dqn_smirl.py --config=configs/IceHockey_Small_SMiRL.json --exp_name=Atari_IceHockey_small_smirl --run_mode=ssh --random_seeds=1 --meta_sim_threads=4 --log_comet=true --training_processor_type=gpu --tuningConfig=configs/GPU_indexes.json

python3 scripts/dqn_smirl.py --config=configs/RiverRaid_Small_SMiRL.json --exp_name=Atari_RiverRaid_small_smirl --run_mode=ssh --ssh_host=newton1 --random_seeds=1 --meta_sim_threads=4 --log_comet=true --training_processor_type=gpu --tuningConfig=configs/GPU_indexes.json

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
configs		configs
launchers		launchers
scripts		scripts
surprise		surprise
util		util
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
Singularity		Singularity
build_apptainer.sh		build_apptainer.sh
build_push_Singularity_to_BRC.sh		build_push_Singularity_to_BRC.sh
build_push_docker.sh		build_push_docker.sh
bundleCode.sh		bundleCode.sh
install_docker.sh		install_docker.sh
requirements.txt		requirements.txt
run_smirl_exps.sh		run_smirl_exps.sh
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bayesian Surprise

Requirements:

Build Instruction

Commands:

Run Atari Experiments

About

Releases

Packages

Languages

Neo-X/SMiRL_Code

Folders and files

Latest commit

History

Repository files navigation

Bayesian Surprise

Requirements:

Build Instruction

Commands:

Run Atari Experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages