MushroomRL Benchmark

stable

Benchmarks Results:

Policy Search Benchmarks
Actor-Critic Benchmarks
Value-Based Benchmarks

API:

Core functionality
Builders
Networks
Experiment
Utils

MushroomRL Benchmark

Docs »
Overview: module code

All modules for which code is available

mushroom_rl.algorithms.policy_search.black_box_optimization.constrained_reps
mushroom_rl.algorithms.policy_search.black_box_optimization.pgpe
mushroom_rl.algorithms.policy_search.black_box_optimization.reps
mushroom_rl.algorithms.policy_search.black_box_optimization.rwr
mushroom_rl.algorithms.policy_search.policy_gradient.enac
mushroom_rl.algorithms.policy_search.policy_gradient.gpomdp
mushroom_rl.algorithms.policy_search.policy_gradient.reinforce
mushroom_rl.algorithms.value.td.double_q_learning
mushroom_rl.algorithms.value.td.q_lambda
mushroom_rl.algorithms.value.td.q_learning
mushroom_rl.algorithms.value.td.sarsa
mushroom_rl.algorithms.value.td.sarsa_lambda
mushroom_rl.algorithms.value.td.speedy_q_learning
mushroom_rl.algorithms.value.td.weighted_q_learning
mushroom_rl_benchmark.builders.actor_critic.classic_actor_critic.copdac_q
mushroom_rl_benchmark.builders.actor_critic.classic_actor_critic.stochastic_ac
mushroom_rl_benchmark.builders.actor_critic.deep_actor_critic.a2c
mushroom_rl_benchmark.builders.actor_critic.deep_actor_critic.ddpg
mushroom_rl_benchmark.builders.actor_critic.deep_actor_critic.ppo
mushroom_rl_benchmark.builders.actor_critic.deep_actor_critic.sac
mushroom_rl_benchmark.builders.actor_critic.deep_actor_critic.td3
mushroom_rl_benchmark.builders.actor_critic.deep_actor_critic.trpo
mushroom_rl_benchmark.builders.agent_builder
mushroom_rl_benchmark.builders.environment_builder
mushroom_rl_benchmark.builders.network.a2c_network
mushroom_rl_benchmark.builders.network.ddpg_network
mushroom_rl_benchmark.builders.network.sac_network
mushroom_rl_benchmark.builders.network.td3_network
mushroom_rl_benchmark.builders.network.trpo_network
mushroom_rl_benchmark.builders.policy_search.black_box_optimization
mushroom_rl_benchmark.builders.policy_search.policy_gradient
mushroom_rl_benchmark.builders.value.dqn.averaged_dqn
mushroom_rl_benchmark.builders.value.dqn.categorical_dqn
mushroom_rl_benchmark.builders.value.dqn.double_dqn
mushroom_rl_benchmark.builders.value.dqn.dqn
mushroom_rl_benchmark.builders.value.dqn.dueling_dqn
mushroom_rl_benchmark.builders.value.dqn.maxmin_dqn
mushroom_rl_benchmark.builders.value.dqn.noisy_dqn
mushroom_rl_benchmark.builders.value.dqn.prioritized_dqn
mushroom_rl_benchmark.builders.value.td.sarsa_lambda_continuous
mushroom_rl_benchmark.builders.value.td.td_finite
mushroom_rl_benchmark.builders.value.td.td_trace
mushroom_rl_benchmark.builders.value.td.true_online_sarsa_lambda
mushroom_rl_benchmark.core.experiment
mushroom_rl_benchmark.core.logger
mushroom_rl_benchmark.core.suite
mushroom_rl_benchmark.core.suite_visualizer
mushroom_rl_benchmark.core.visualizer
mushroom_rl_benchmark.experiment.run
mushroom_rl_benchmark.experiment.slurm.aggregate_results
mushroom_rl_benchmark.experiment.slurm.arguments
mushroom_rl_benchmark.experiment.slurm.slurm_script
mushroom_rl_benchmark.utils.plot
mushroom_rl_benchmark.utils.primitive
mushroom_rl_benchmark.utils.sweep
mushroom_rl_benchmark.utils.utils

© Copyright 2020, 2021 Carlo D'Eramo, Davide Tateo Revision d6638e98.

Built with Sphinx using a theme provided by Read the Docs.