garage.tf package¶
Tensorflow Branch.
-
paths_to_tensors
(paths, max_path_length, baseline_predictions, discount, gae_lambda)[source]¶ Return processed sample data based on the collected paths.
Parameters: - paths (list[dict]) – A list of collected paths.
- max_path_length (int) – Maximum length of a single rollout.
- baseline_predictions (numpy.ndarray) – : Predicted value of GAE (Generalized Advantage Estimation) Baseline.
- discount (float) – Environment reward discount.
- gae_lambda (float) – Lambda used for generalized advantage estimation.
Returns: - Processed sample data, with key
- observations: (numpy.ndarray)
- actions: (numpy.ndarray)
- rewards: (numpy.ndarray)
- baselines: (numpy.ndarray)
- returns: (numpy.ndarray)
- valids: (numpy.ndarray)
- agent_infos: (dict)
- env_infos: (dict)
- paths: (list[dict])
Return type:
Subpackages¶
- garage.tf.algos package
- Submodules
- garage.tf.algos.ddpg module
- garage.tf.algos.dqn module
- garage.tf.algos.erwr module
- garage.tf.algos.npo module
- garage.tf.algos.ppo module
- garage.tf.algos.reps module
- garage.tf.algos.rl2 module
- garage.tf.algos.rl2ppo module
- garage.tf.algos.rl2trpo module
- garage.tf.algos.td3 module
- garage.tf.algos.te module
- garage.tf.algos.te_npo module
- garage.tf.algos.te_ppo module
- garage.tf.algos.tnpg module
- garage.tf.algos.trpo module
- garage.tf.algos.vpg module
- Submodules
- garage.tf.baselines package
- garage.tf.distributions package
- garage.tf.embeddings package
- garage.tf.misc package
- garage.tf.models package
- Submodules
- garage.tf.models.categorical_cnn_model module
- garage.tf.models.categorical_gru_model module
- garage.tf.models.categorical_lstm_model module
- garage.tf.models.categorical_mlp_model module
- garage.tf.models.cnn module
- garage.tf.models.cnn_mlp_merge_model module
- garage.tf.models.cnn_model module
- garage.tf.models.cnn_model_max_pooling module
- garage.tf.models.gaussian_cnn_model module
- garage.tf.models.gaussian_gru_model module
- garage.tf.models.gaussian_lstm_model module
- garage.tf.models.gaussian_mlp_model module
- garage.tf.models.gru module
- garage.tf.models.gru_model module
- garage.tf.models.lstm module
- garage.tf.models.lstm_model module
- garage.tf.models.mlp module
- garage.tf.models.mlp_dueling_model module
- garage.tf.models.mlp_merge_model module
- garage.tf.models.mlp_model module
- garage.tf.models.model module
- garage.tf.models.module module
- garage.tf.models.normalized_input_mlp_model module
- garage.tf.models.parameter module
- garage.tf.models.sequential module
- Submodules
- garage.tf.optimizers package
- garage.tf.plotter package
- garage.tf.policies package
- Submodules
- garage.tf.policies.categorical_cnn_policy module
- garage.tf.policies.categorical_gru_policy module
- garage.tf.policies.categorical_lstm_policy module
- garage.tf.policies.categorical_mlp_policy module
- garage.tf.policies.continuous_mlp_policy module
- garage.tf.policies.discrete_qf_derived_policy module
- garage.tf.policies.gaussian_gru_policy module
- garage.tf.policies.gaussian_lstm_policy module
- garage.tf.policies.gaussian_mlp_policy module
- garage.tf.policies.gaussian_mlp_task_embedding_policy module
- garage.tf.policies.policy module
- garage.tf.policies.task_embedding_policy module
- garage.tf.policies.uniform_control_policy module
- Submodules
- garage.tf.q_functions package
- garage.tf.regressors package
- Submodules
- garage.tf.regressors.bernoulli_mlp_regressor module
- garage.tf.regressors.categorical_mlp_regressor module
- garage.tf.regressors.categorical_mlp_regressor_model module
- garage.tf.regressors.continuous_mlp_regressor module
- garage.tf.regressors.gaussian_cnn_regressor module
- garage.tf.regressors.gaussian_cnn_regressor_model module
- garage.tf.regressors.gaussian_mlp_regressor module
- garage.tf.regressors.gaussian_mlp_regressor_model module
- garage.tf.regressors.regressor module
- Submodules
- garage.tf.samplers package