Revisione 15:44, 6 Dic 2021

Supplementary material

The videos reported here show the progression of the learned policies at different episodes. We have selected three goals. The videos show the outcome of the experiments for each of the considered agents:

BL: Baseline (TD3)
BL+DM: Baseline + Difficulty Manager
BL+EP: Baseline + Episodic Noise
OURS: Baseline + Difficulty Manager + Episodic noise

Notice that the goal is represented with a green arrow (the orientation of the robot is indicated by the red axis).

@@ Riga 2: / Riga 2: @@
 The videos reported here show the progression of the learned policies at different episodes.
+We have selected three goals. The videos show the outcome of the experiments for each of the considered
+agents:
+* BL: Baseline (TD3)
+* BL+DM: Baseline + Difficulty Manager
+* BL+EP: Baseline + Episodic Noise
+* OURS: Baseline + Difficulty Manager + Episodic noise
+Notice that the goal is represented with a green arrow (the orientation of the robot is indicated by the red axis).
 ==Episode 135==

Differenze tra le versioni di "SERL"

Revisione 15:44, 6 Dic 2021

Supplementary material

Episode 135

Episode 605

Supplementary material[edit]

Episode 135[edit]

Episode 605[edit]