ddpg_pendulum
¶
This is an example to train a task with DDPG algorithm.
Here it creates a gym environment InvertedDoublePendulum. And uses a DDPG with 1M steps.
- Results:
AverageReturn: 250 RiseTime: epoch 499
ddpg_pendulum
¶This is an example to train a task with DDPG algorithm.
Here it creates a gym environment InvertedDoublePendulum. And uses a DDPG with 1M steps.
AverageReturn: 250 RiseTime: epoch 499