Abstract: Deep reinforcement learning (DRL) algorithms interact with the environment and aim to learn without labeled data. In high-dimensional spaces, they evolve their policies to maximize the ...
# Note that latest version of mujoco-py, 1.5, does not play nicely with ubuntu 14.04 - # requires patchelf system package not available on 14.04 ...