garage.np.exploration_strategies.ou_strategy module¶
This module creates an OU exploration strategy.
Ornstein Uhlenbeck exploration strategy comes from the Ornstein-Uhlenbeck process. It is often used in DDPG algorithm because in continuous control task it is better to have temporally correlated exploration to get smoother transitions. And OU process is relatively smooth in time.
-
class
OUStrategy
(env_spec, mu=0, sigma=0.3, theta=0.15, dt=0.01, x0=None)[source]¶ Bases:
garage.np.exploration_strategies.base.ExplorationStrategy
An OU exploration strategy to add noise to environment actions.
Parameters: - env_spec – Environment for OUStrategy to explore.
- mu – A parameter to simulate the process.
- sigma – A parameter to simulate the process.
- theta – A parameter to simulate the process.
- dt – A parameter to simulate the process.
- x0 – Initial state.
Example
$ python garage/tf/exploration_strategies/ou_strategy.py