Blackjack

Linked Article on Medium : https://medium.com/@ashish.dhiman/optimal-blackjack-strategy-with-reinforcement-learning-867ae03584ec

Step 1 : To run the all the experiments can be run using the run files located in the modules specific to the MDPs

Blackjack

blackjack.run_BJ_policy_iteration.py
blackjack.run_BJ_value_iteration.py
blackjack.run_BJ_q_learning.py

All the above files run the code specific to the algorithm. The hyper-parameters can be changed within the module.

Step 2: Once the algorithms executed, the necessary stats and policy images will be stored in the vi_stats, pi_stats, ql_stats directories for the respective algorithms.

These can now be used to generate various plots using the two include jupyer notebooks as follows :

frozen_lake_analysis.ipynb
blackjack_analysis.ipynb

Other files

agent.py : Contains the QLearningAgent which can be used for both frozen lake and Blackjack common.py : Some common functions for frozen lake stats and policy generation gym_to_mdp.py : Contains code to convert the Gym transition matrices to MDP toolbox and vice versa policy_iteration.py : Wrapper around MDPToolbox policy iteration which also saves the stats value_iteration.py : Wrapper around MDPToolbox value iteration which also saves the stats q_learning.py : Q-Learning training loop where agent interacts with the environment blackjack.blackjack_util.py : Utility functions for building the Model for Blackjack and simulating the gameplay blackjack.visualize_policy.py : Visualize the policy for the Blackjack game frozen_lake.frozen_lake_game.py : Simulation of Frozen lake game and getting the environment with parametes frozen_lake.visualize_policy.py : Visualize the policy for the FrozenLake game

Folowing python libraries are used only :

matplotlib
gymnasium
mdptoolbox
seaborn
numpy
scipy
tqdm
pandas

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Blackjack

Step 1 : To run the all the experiments can be run using the run files located in the modules specific to the MDPs

Step 2: Once the algorithms executed, the necessary stats and policy images will be stored in the vi_stats, pi_stats, ql_stats directories for the respective algorithms.

Other files

Folowing python libraries are used only :

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
blackjack		blackjack
.gitignore		.gitignore
P_dict_simulated-test.pickle		P_dict_simulated-test.pickle
README.md		README.md
agent.py		agent.py
blackjack_analysis.ipynb		blackjack_analysis.ipynb
common.py		common.py
gym_to_mdp.py		gym_to_mdp.py
policy_iteration.py		policy_iteration.py
q_learning.py		q_learning.py
requirement.txt		requirement.txt
value_iteration.py		value_iteration.py

aashishd/rl_blackjack

Folders and files

Latest commit

History

Repository files navigation

Blackjack

Step 1 : To run the all the experiments can be run using the run files located in the modules specific to the MDPs

Step 2: Once the algorithms executed, the necessary stats and policy images will be stored in the vi_stats, pi_stats, ql_stats directories for the respective algorithms.

Other files

Folowing python libraries are used only :

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages