Cross Entropy Method (CEM)¶
Paper |
The cross-entropy method: A unified approach to Monte Carlo simulation, randomized optimization and machine learning [2] |
Framework(s) |
|
API Reference |
|
Code |
Cross Entropy Method (CEM) works by iteratively optimizing a gaussian distribution of policy.
In each epoch, CEM does the following:
Sample n_samples policies from a gaussian distribution of mean cur_mean and std cur_std.
Collect episodes for each policy.
Update cur_mean and cur_std by doing Maximum Likelihood Estimation over the n_best top policies in terms of return.