Reinforcement learning algorithms implementation. Custom OpenCV based environment to test codes.
- Value Iteration
- Policy Iteration
- Q-Learning
- Deep Q-Learning
- Vanilla Policy Gradient - REINFORCE
- A2C
- A3C
- PPO
- DDPG
Method | Deterministic Frozen Lake | Stochastic Frozen Lake |
---|---|---|
Value Iteration | 7 | 8 |
Policy Iteration | 7 | 3 |