garage.torch.policies package¶

PyTorch Policies.

class DeterministicMLPPolicy(env_spec, **kwargs)[source]¶

Implements a deterministic policy network.

The policy network selects action based on the state of the environment. It uses a PyTorch neural network module to fit the function of pi(s).

class GaussianMLPPolicy(env_spec, **kwargs)[source]¶

GaussianMLPPolicy.

A policy that contains a MLP to make prediction based on a gaussian distribution.

Parameters:	env_spec (garage.envs.env_spec.EnvSpec) – Environment specification. module – GaussianMLPModule to make prediction based on a gaussian distribution. –
Returns:

get_action(observation)[source]¶: Get a single action given an observation.

log_likelihood(observation, action)[source]¶: Get log likelihood given observations and action.

class Policy(env_spec)[source]¶

Bases: abc.ABC

Policy base class without Parameterzied.

Parameters:	env_spec (garage.envs.env_spec.EnvSpec) – Environment specification.

Submodules¶