garage.envs.step module

Environment step data type.

Step(observation, reward, done, **kwargs)[source]

Create a namedtuple from the results of environment.step(action).

Provides the option to put extra diagnostic info in the kwargs (if it exists) without demanding an explicit positional argument.

Parameters:
  • observation (object) – Agent’s observation of the current environment
  • reward (float) – Amount of reward returned after previous action
  • done (bool) – Whether the episode has ended, in which case further step() calls will return undefined results
  • kwargs – Keyword args
Returns:

A named tuple of the arguments.

Return type:

collections.namedtuple