ACKTR

An actor-critic model in TensorFlow, using KFAC loss, as descriibed in: "Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation" by Wu et al. Tested on Atari games.

Presentation: https://drive.google.com/open?id=1tMWLk45CWVNBj8werpZe0QDEoMdCJ0MA6zPETjyzFds

Slides: https://docs.google.com/presentation/d/1nWnYXL_4z9sW_tO9mf_XMTHr4O0kfJczrXaqvEaX9Vc/edit?usp=sharing

Plots of realtive sample efficiencies as compared to baselines:

Name		Name	Last commit message	Last commit date
Latest commit History 183 Commits
save		save
.gitignore		.gitignore
ACKTR.pdf		ACKTR.pdf
LICENSE		LICENSE
README.md		README.md
Things that ACKTR is missing in paper.txt		Things that ACKTR is missing in paper.txt
acktr_model.py		acktr_model.py
atari_wrapper.py		atari_wrapper.py
baselines_utils.py		baselines_utils.py
breakout_results.jpg		breakout_results.jpg
constants.py		constants.py
monitor.py		monitor.py
plot.py		plot.py
pong_results.jpg		pong_results.jpg
random_agent.py		random_agent.py
run.py		run.py
subproc_vec_env.py		subproc_vec_env.py
transform_monitor.py		transform_monitor.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ACKTR

About

Releases

Packages

Contributors 4

Languages

License

dyelax/acktr

Folders and files

Latest commit

History

Repository files navigation

ACKTR

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages