`garage.np.algos.nop`¶

NOP (no optimization performed) policy search algorithm.

class NOP¶

Bases: garage.np.algos.rl_algorithm.RLAlgorithm

NOP (no optimization performed) policy search algorithm.

optimize_policy(paths)¶

Optimize the policy using the samples.

train(trainer)¶

Obtain samplers and start actual training for each epoch.

Parameters: trainer (Trainer) – Trainer is passed to give algorithm the access to trainer.step_epochs(), which provides services such as snapshotting and sampler control.

garage.np.algos.nop¶