Skip to content

Exploring reinforcement learning concepts and implementation on maze problem. Discusses model state determination, state reduction techniques, impact of learning rate (α), and impact of discount factor (γ) on decision-making and convergence.

Notifications You must be signed in to change notification settings

proshir/Flagged-Maze-Reinforcement-Learning-Implementation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Flagged-Maze-Reinforcement-Learning

image

This project is presented in Jupyter Notebook format, providing visibility into the implementation of class definitions and algorithm training. You can observe the program's execution by using the rl.play(True, True) command within the pygame environment. This command executes the program, irrespective of the epsilon capability, facilitating a more focused search.

Model State Determination and Reduction

The number of model states is contingent upon the environment's size. We achieve state reduction by equating certain positions, streamlining the model's complexity.

Concepts and Components

States: These correspond to the agent's positions within the environment. Actions: Define agent movements, encompassing "up," "down," "left," and "right." Rewards: Define the system of penalties and incentives governing agent behavior. Goal State: Identified as "T," this marks the endpoint the agent must reach.

Learning Rate (α) Impact

The learning rate (α) significantly influences the algorithm's performance:

It affects the speed of convergence and oscillation. It strikes a balance between exploration and exploitation. It plays a pivotal role in stabilization and solution accuracy.

Discount Factor (γ) Impact

The discount factor (γ) holds a crucial role in reinforcement learning:

It delineates the importance of long-term versus short-term rewards. It guides the pursuit of optimal policies and underscores the significance of achieving the goal. It influences the convergence rate and temporal consistency of the learning process.

Result

image image

About

Exploring reinforcement learning concepts and implementation on maze problem. Discusses model state determination, state reduction techniques, impact of learning rate (α), and impact of discount factor (γ) on decision-making and convergence.

Resources

Stars

Watchers

Forks