garage.np.algos.meta_rl_algorithm module¶

Interface of Meta-RL ALgorithms.

class MetaRLAlgorithm[source]¶

Bases: garage.np.algos.rl_algorithm.RLAlgorithm, abc.ABC

Base class for Meta-RL Algorithms.

adapt_policy(exploration_policy, exploration_trajectories)[source]¶

Produce a policy adapted for a task.

Parameters:

exploration_policy (garage.Policy) – A policy which was returned from get_exploration_policy(), and which generated exploration_trajectories by interacting with an environment. The caller may not use this object after passing it into this method.
exploration_trajectories (garage.TrajectoryBatch) – Trajectories to adapt to, generated by exploration_policy exploring the environment.

Returns:

Return type:

garage.Policy

get_exploration_policy()[source]¶

Return a policy used before adaptation to a specific task.

Each time it is retrieved, this policy should only be evaluated in one task.

Returns:	The policy used to obtain samples that are later used for meta-RL adaptation.
Return type:	garage.Policy