Made for RL course UNIT8. Selfmade PPO 50k steps.
This is a trained model of a PPO agent playing LunarLander-v2.
-