Implementation of RL algorithms in various environments
Environments Coded:
- GridWorld Environment
- CartPole
- Mountain Car
Algorithms Coded:
- Cross Entropy
- First Choice Hill Climbing
- Temporal Difference Learning
- Sarsa Learning
- Q Learning
- TD-Lambda
- Sarsa-Lambda
- Q-Lambda
- Actor Critic
- REINFORCE
Action Selection Techniques:
- Epsilon Greedy
- Softmax