`garage.np.policies.uniform_random_policy`¶

Uniform random exploration strategy.

class UniformRandomPolicy(env_spec)[source]¶

Action taken is uniformly random.

reset(self, do_resets=None)[source]¶

Reset the state of the exploration.

Parameters: do_resets (List[bool] or numpy.ndarray or None) – Which vectorization states to reset.

get_action(self, observation)[source]¶

Get action from this policy for the input observation.

Parameters: observation (numpy.ndarray) – Observation from the environment.
Returns: Actions with noise. List[dict]: Arbitrary policy state information (agent_info).
Return type: np.ndarray

get_actions(self, observations)[source]¶

Get actions from this policy for the input observation.

Parameters: observations (list) – Observations from the environment.
Returns: Actions with noise. List[dict]: Arbitrary policy state information (agent_info).
Return type: np.ndarray

property name(self)¶

Name of policy.

property env_spec(self)¶

Policy environment specification.

property observation_space(self)¶

Observation space.

property action_space(self)¶

Action space.

garage.np.policies.uniform_random_policy¶