garage.np.algos.base module

Interface of RLAlgorithm.

class RLAlgorithm[source]

Bases: abc.ABC

Base class for all the algorithms.

Note

If sampler_cls isn’t specified to the LocalRunner, self.sampler_cls is required to provide default sampler for algorithm.

train(runner)[source]

Obtain samplers and start actual training for each epoch.

Parameters:runner (LocalRunner) – LocalRunner is passed to give algorithm the access to runner.step_epochs(), which provides services such as snapshotting and sampler control.
Returns:The average return in last epoch cycle or None.