garage.torch.q_functions.continuous_mlp_q_function module

This modules creates a continuous Q-function network.

class ContinuousMLPQFunction(env_spec, **kwargs)[source]

Bases: garage.torch.modules.mlp_module.MLPModule

Implements a continuous MLP Q-value network.

It predicts the Q-value for all actions based on the input state. It uses a PyTorch neural network module to fit the function of Q(s, a).

forward(observations, actions)[source]

Return Q-value(s).