tutorial_vpg
¶
This is an example to add a simple VPG algorithm.
- class SimpleVPG(env_spec, policy, sampler)¶
Simple Vanilla Policy Gradient.
- Parameters
env_spec (EnvSpec) – Environment specification.
policy (garage.tf.policies.StochasticPolicy) – Policy.
sampler (garage.sampler.Sampler) – Sampler.
- init_opt()¶
Initialize optimizer and build computation graph.
- tutorial_vpg(ctxt=None)¶
Train VPG with PointEnv environment.
- Parameters
ctxt (ExperimentContext) – The experiment configuration used by
Trainer
to create theSnapshotter
.