Merged PR 517: create doc page on rewards

2024-08-23 08:54:47 +00:00
parent ff5a2e1bbe fbbaf65aab
commit 0e0fc96cd3
2 changed files with 117 additions and 0 deletions
--- a/docs/index.rst
+++ b/docs/index.rst
@@ -25,6 +25,7 @@ What is PrimAITE?
   source/game_layer
   source/simulation
   source/config
+   source/rewards
   source/customising_scenarios
   source/varying_config_files
   source/environment
--- a/docs/source/rewards.rst
+++ b/docs/source/rewards.rst
@@ -0,0 +1,116 @@
+.. only:: comment
+
+    © Crown-owned copyright 2024, Defence Science and Technology Laboratory UK
+
+Rewards
+#######
+
+Rewards in PrimAITE are based on a system of individual components that react to events in the simulation. An agent's reward function is calculated as the weighted sum of several reward components.
+
+Components
+**********
+The following API pages describe the use of each reward component and the possible configuration options. An example of configuring each via yaml is also provided.
+
+:py:class:`DummyReward`
+
+.. code-block:: yaml
+    agents:
+      - ref: agent_name
+        # ...
+        reward_function:
+          reward_components:
+            - type: DUMMY
+              weight: 1.0
+
+
+:py:class:`primaite.game.agent.rewards.DatabaseFileIntegrity`
+
+.. code-block:: yaml
+    agents:
+      - ref: agent_name
+        # ...
+        reward_function:
+          reward_components:
+            - type: DATABASE_FILE_INTEGRITY
+              weight: 1.0
+              options:
+                node_hostname: server_1
+                folder_name: database
+                file_name: database.db
+
+
+:py:class:`WebServer404Penalty`
+
+.. code-block:: yaml
+    agents:
+      - ref: agent_name
+        # ...
+        reward_function:
+          reward_components:
+            - type: WEB_SERVER_404_PENALTY
+              node_hostname: web_server
+              weight: 1.0
+              options:
+                service_name: WebService
+                sticky: false
+
+
+:py:class:`WebpageUnavailablePenalty`
+
+.. code-block:: yaml
+    agents:
+      - ref: agent_name
+        # ...
+        reward_function:
+          reward_components:
+            - type: WEBPAGE_UNAVAILABLE_PENALTY
+              node_hostname: computer_1
+              weight: 1.0
+              options:
+                sticky: false
+
+
+:py:class:`GreenAdminDatabaseUnreachablePenalty`
+
+.. code-block:: yaml
+    agents:
+      - ref: agent_name
+        # ...
+        reward_function:
+          reward_components:
+            - type: GREEN_ADMIN_DATABASE_UNREACHABLE_PENALTY
+              weight: 1.0
+              options:
+                node_hostname: admin_pc_1
+                sticky: false
+
+
+:py:class:`SharedReward`
+
+.. code-block:: yaml
+    agents:
+      - ref: scripted_agent
+        # ...
+      - ref: agent_name
+        # ...
+        reward_function:
+          reward_components:
+            - type: SHARED_REWARD
+              weight: 1.0
+              options:
+                agent_name: scripted_agent
+
+
+:py:class:`ActionPenalty`
+
+.. code-block:: yaml
+    agents:
+      - ref: agent_name
+        # ...
+        reward_function:
+          reward_components:
+            - type: ACTION_PENALTY
+              weight: 1.0
+              options:
+                  action_penalty: -0.3
+                  do_nothing_penalty: 0.0