td3_pendulum
¶
This is an example to train a task with TD3 algorithm.
Here, we create a gym environment InvertedDoublePendulum and use a TD3 with 1M steps.
- Results:
AverageReturn: 250 RiseTime: epoch 499
td3_pendulum
¶This is an example to train a task with TD3 algorithm.
Here, we create a gym environment InvertedDoublePendulum and use a TD3 with 1M steps.
AverageReturn: 250 RiseTime: epoch 499