trpo_cartpole
¶
This is an example to train a task with TRPO algorithm.
Here it runs CartPole-v1 environment with 100 iterations.
- Results:
AverageReturn: 100 RiseTime: itr 13
trpo_cartpole
¶This is an example to train a task with TRPO algorithm.
Here it runs CartPole-v1 environment with 100 iterations.
AverageReturn: 100 RiseTime: itr 13