`garage.np.policies.uniform_random_policy`¶

Uniform random exploration strategy.

class UniformRandomPolicy(env_spec)[source]¶

Action taken is uniformly random.

property name¶

Name of policy.

property env_spec¶

Policy environment specification.

property observation_space¶

Observation space.

property action_space¶

Action space.

reset(do_resets=None)[source]¶

Reset the state of the exploration.

Parameters: do_resets (List[bool] or numpy.ndarray or None) – Which vectorization states to reset.

get_action(observation)[source]¶

Get action from this policy for the input observation.

Parameters: observation (numpy.ndarray) – Observation from the environment.
Returns: Actions with noise. List[dict]: Arbitrary policy state information (agent_info).
Return type: np.ndarray

get_actions(observations)[source]¶

Get actions from this policy for the input observation.

Parameters: observations (list) – Observations from the environment.
Returns: Actions with noise. List[dict]: Arbitrary policy state information (agent_info).
Return type: np.ndarray

garage.np.policies.uniform_random_policy¶