The videos reported here show the progression of the learned policies at different episodes. We have selected three goals. The videos show the outcome of the experiments for each of the considered agents:
Notice that the goal is represented with a green arrow (the orientation of the robot is indicated by the red axis).
The videos reported here show the progression of the learned policies at different episodes. We have selected three goals. The videos show the outcome of the experiments for each of the considered agents:
Notice that the goal is represented with a green arrow (the orientation of the robot is indicated by the red axis).