`tutorial_vpg`¶

This is an example to add a simple VPG algorithm.

class SimpleVPG(env_spec, policy, sampler)¶

Simple Vanilla Policy Gradient.

Parameters

train(trainer)¶

Obtain samplers and start actual training for each epoch.

tutorial_vpg(ctxt=None)¶

Train VPG with PointEnv environment.

Parameters: ctxt (ExperimentContext) – The experiment configuration used by Trainer to create the Snapshotter.

tutorial_vpg¶