Q-Learning Agent playing1 FrozenLake-v1

This is a trained model of a Q-Learning agent playing FrozenLake-v1 . With gamma=0 there is no information passed and so the only non zero value corresponds to the state just before the target.

array([[0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 0., 0.],
       [0., 0., 1., 0.],
       [0., 0., 0., 0.]])

Usage


model = load_from_hub(repo_id="casellimarco/q-FrozenLake-v1-4x4-noSlippery-gamma_0", filename="q-learning.pkl")

# Don't forget to check if you need to add additional attributes (is_slippery=False etc)
env = gym.make(model["env_id"])