garage.np.exploration_strategies.gaussian_strategy module

Gaussian exploration strategy.

class GaussianStrategy(env_spec, max_sigma=1.0, min_sigma=0.1, decay_period=1000000)[source]

Bases: garage.np.exploration_strategies.base.ExplorationStrategy

Add Gaussian noise to the action taken by the deterministic policy.

get_action(iteration, observation, policy, **kwargs)[source]

Get action from this policy for the input observation.

Parameters:
  • iteration (int) – Iteration.
  • observation (numpy.ndarray) – Observation from the environment.
  • policy (garage.tf.policies.base.Policy) – Policy network to predict action based on the observation.
Returns:

optimal action from this policy. agent_info(dict): Agent information.

Return type:

opt_action(numpy.ndarray)

get_actions(iteration, observations, policy, **kwargs)[source]

Get actions from this policy for the input observation.

Parameters:
  • iteration (int) – Iteration.
  • observatioan (list) – Observationa from the environment.
  • policy (garage.tf.policies.base.Policy) – Policy network to predict action based on the observation.
Returns:

optimal actions from this policy. agent_infos(dict): Agent information.

Return type:

opt_actions(numpy.ndarray)