26 lines
606 B
ReStructuredText
26 lines
606 B
ReStructuredText
.. only:: comment
|
|
|
|
© Crown-owned copyright 2023, Defence Science and Technology Laboratory UK
|
|
|
|
``training_config``
|
|
===================
|
|
|
|
``rl_framework``
|
|
----------------
|
|
The RL (Reinforcement Learning) Framework to use in the training session
|
|
|
|
Options available are:
|
|
|
|
- ``SB3`` (Stable Baselines 3)
|
|
- ``RLLIB_single_agent`` (Single Agent Ray RLLib)
|
|
- ``RLLIB_multi_agent`` (Multi Agent Ray RLLib)
|
|
|
|
``rl_algorithm``
|
|
----------------
|
|
The Reinforcement Learning Algorithm to use in the training session
|
|
|
|
Options available are:
|
|
|
|
- ``PPO`` (Proximal Policy Optimisation)
|
|
- ``A2C`` (Advantage Actor Critic)
|