garage.np.algos.cma_es
¶
Covariance Matrix Adaptation Evolution Strategy.
- class CMAES(env_spec, policy, sampler, n_samples, discount=0.99, sigma0=1.0)¶
Bases:
garage.np.algos.rl_algorithm.RLAlgorithm
Covariance Matrix Adaptation Evolution Strategy.
Note
The CMA-ES method can hardly learn a successful policy even for simple task. It is still maintained here only for consistency with original rllab paper.
- Parameters
env_spec (EnvSpec) – Environment specification.
policy (garage.np.policies.Policy) – Action policy.
sampler (garage.sampler.Sampler) – Sampler.
n_samples (int) – Number of policies sampled in one epoch.
discount (float) – Environment reward discount.
sigma0 (float) – Initial std for param distribution.