This is an example to train MAML-TRPO on ML10 environment.
- maml_trpo_metaworld_ml10(ctxt, seed, epochs, episodes_per_task, meta_batch_size)¶
Set up environment and algorithm and run the task.
seed (int) – Used to seed the random number generator to produce determinism.
epochs (int) – Number of training epochs.
episodes_per_task (int) – Number of episodes per epoch per task for training.
meta_batch_size (int) – Number of tasks sampled per batch.