vpg_pendulum
¶
This is an example to train a task with VPG algorithm (PyTorch).
Here it runs InvertedDoublePendulum-v2 environment with 100 iterations.
- Results:
AverageReturn: 450 - 650
vpg_pendulum
¶This is an example to train a task with VPG algorithm (PyTorch).
Here it runs InvertedDoublePendulum-v2 environment with 100 iterations.
AverageReturn: 450 - 650