Skip to content

ILQR implementation (following Tassa et al - IROS 2012) for gym pendulum environment, using both known model and linear Gaussian dynamic model learning (Levine et al - JMLR 2016).

Notifications You must be signed in to change notification settings

mgb45/ilqr_pendulum

 
 

Repository files navigation

ilqr_pendulum

ILQR implementation (following Tassa et al - IROS 1012) for gym pendulum environment, using both known model and linear Gaussian dynamic model learning (Levine et al - JMLR 2016).

  • gaussian_model_learner.ipynb shows how to learn a conditionally linear Gaussian dynamics model by fitting a joint Gaussian to state action pairs.

  • ilqr.ipynb is an ILQR implementation for a kinematic motion model of a wheeled mobile robot.

  • ilqr-gym.ipynb is an ILQR implementation for an OpenAI gym inverted pendulum, given a known dynamics model.

  • ilqr_model_learner.ipynb is an ILQR implementation for an OpenAI gym inverted pendulum, which attempts to learn the dynamics using motor babbling to fit a set of linear Gaussian dynamics models, as in the guided policy search paper. This works pretty poorly, as local linear models make it hard to plan ahead for a longer horizon, and it can be hard to gather data in terminal pendulum positions.

Requires

About

ILQR implementation (following Tassa et al - IROS 2012) for gym pendulum environment, using both known model and linear Gaussian dynamic model learning (Levine et al - JMLR 2016).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%