garage.np.algos.nop
¶
NOP (no optimization performed) policy search algorithm.
-
class
NOP
¶ Bases:
garage.np.algos.rl_algorithm.RLAlgorithm
NOP (no optimization performed) policy search algorithm.
-
init_opt
(self)¶ Initialize the optimization procedure.
-
optimize_policy
(self, paths)¶ Optimize the policy using the samples.
Parameters: paths (list[dict]) – A list of collected paths.
-
train
(self, runner)¶ Obtain samplers and start actual training for each epoch.
Parameters: runner (LocalRunner) – LocalRunner is passed to give algorithm the access to runner.step_epochs(), which provides services such as snapshotting and sampler control.
-