Soft Actor-Critic implementation with SOTA model-free extension (REDQ) and SOTA model-based extension (MBPO).
Algorithms implemented:
- Soft Actor-Critic (SAC)
- Model-Based Policy Optimization (MBPO)
- Randomized Ensembled Double Q-Learning (REDQ)
I was inspired by the following repositories: