PrimAITE/docs/source/about.rst

.. _about:

About PrimAITE
==============

Features
********

PrimAITE provides the following features:

* A flexible network / system laydown based on the Python networkx framework
* Nodes and links (edges) host Python classes in order to present attributes and methods (and hence, a more representative model of a platform / system)
* A ‘green agent’ Information Exchange Requirement (IER) function allows the representation of traffic (protocols and loading) on any / all links. Application of IERs is based on the status of node operating systems and services
* A ‘green agent’ node Pattern-of-Life (PoL) function allows the representation of core behaviours on nodes (e.g. Hardware state, Software State, Service state, File System state)
* An Access Control List (ACL) function, mimicking the behaviour of a network firewall, is applied across the model, following standard ACL rule format (e.g. DENY/ALLOW, source IP, destination IP, protocol and port). Application of IERs adheres to any ACL restrictions
* Presents an OpenAI Gym interface to the environment, allowing integration with any OpenAI Gym compliant defensive agents
* Red agent activity based on ‘red’ IERs and ‘red’ PoL
* Defined reward function for use with RL agents (based on nodes status, and green / red IER success)
* Fully configurable (network / system laydown, IERs, node PoL, ACL, episode step period, episode max steps) and repeatable to suit the training requirements of agents. Therefore, not bound to a representation of any particular platform, system or technology
* Full capture of discrete metrics relating to agent training (full system state, agent actions taken, average reward)
* Networkx provides laydown visualisation capability

Architecture - Nodes and Links
******************************

**Nodes**

An inheritance model has been adopted in order to model nodes. All nodes have the following base attributes (Class: Node):

* ID
* Name
* Type (e.g. computer, switch, RTU - enumeration)
* Priority (P1, P2, P3, P4 or P5 - enumeration)
* Hardware State (ON, OFF, RESETTING - enumeration)

Active Nodes also have the following attributes (Class: Active Node):

* IP Address
* Software State (GOOD, PATCHING, COMPROMISED - enumeration)
* File System State (GOOD, CORRUPT, DESTROYED, REPAIRING, RESTORING - enumeration)

Service Nodes also have the following attributes (Class: Service Node):

* List of Services (where service is composed of service name and port). There is no theoretical limit on the number of services that can be modelled. Services and protocols are currently intrinsically linked (i.e. a service is an application on a node transmitting traffic of this protocol type)
* Service state (GOOD, PATCHING, COMPROMISED, OVERWHELMED - enumeration)

Passive Nodes are currently not used (but may be employed for non IP-based components such as machinery actuators in future releases).

**Links**

Links are modelled both as network edges (networkx) and as Python classes, in order to extend their functionality. Links include the following attributes:

* ID
* Name
* Bandwidth (bits/s)
* Source node ID
* Destination node ID
* Protocol list (containing the loading of protocols currently running on the link)

When the simulation runs, IERs are applied to the links in order to model traffic loading, individually assigned to each protocol. This allows green (background) and red agent behaviour to be modelled, and defensive agents to identify suspicious traffic patterns at a protocol / traffic loading level of fidelity.

Information Exchange Requirements (IERs)
****************************************

PrimAITE adopts the concept of Information Exchange Requirements (IERs) to model both green agent (background) and red agent (adversary) behaviour. IERs are used to initiate modelling of traffic loading on the network, and have the following attributes:

* ID
* Start step (i.e. which step in the training episode should the IER start)
* End step (i.e. which step in the training episode should the IER end)
* Source node ID
* Destination node ID
* Load (bits/s)
* Protocol
* Port
* Running status (i.e. on / off)

The application of green agent IERs between a source and destination follows a number of rules. Specifically:

1. Does the current simulation time step fall between IER start and end step
2. Is the source node operational (both physically and at an O/S level), and is the service (protocol / port) associated with the IER (a) present on this node, and (b) in an operational state (i.e. not PATCHING)
3. Is the destination node operational (both physically and at an O/S level), and is the service (protocol / port) associated with the IER (a) present on this node, and (b) in an operational state (i.e. not PATCHING)
4. Are there any Access Control List rules in place that prevent the application of this IER
5. Are all switches in the (OSPF) path between source and destination operational (both physically and at an O/S level)

For red agent IERs, the application of IERs between a source and destination follows a number of subtly different rules. Specifically:

1. Does the current simulation time step fall between IER start and end step
2. Is the source node operational, and is the service (protocol / port) associated with the IER (a) present on that node and (b) already in a compromised state
3. Is the destination node operational, and is the service (protocol / port) associated with the IER present on that node
4. Are there any Access Control List rules in place that prevent the application of this IER
5. Are all switches in the (OSPF) path between source and destination operational (both physically and at an O/S level)

Assuming the rules pass, the IER is applied to all relevant links (based on use of OSPF) between source and destination.

Node Pattern-of-Life
********************

Every node can be impacted (i.e. have a status change applied to it) by either green agent pattern-of-life or red agent pattern-of-life. This is distinct from IERs, and allows for attacks (and defence) to be modelled purely within the confines of a node.

The status changes that can be made to a node are as follows:

* All Nodes:

   * Hardware State:

      * ON
      * OFF
      * RESETTING - when a status of resetting is entered, the node will automatically exit this state after a number of steps (as defined by the nodeResetDuration configuration item) after which it returns to an ON state

* Active Nodes and Service Nodes:

   * Software State:

      * GOOD
      * PATCHING - when a status of patching is entered, the node will automatically exit this state after a number of steps (as defined by the osPatchingDuration configuration item) after which it returns to a GOOD state
      * COMPROMISED

   * File System State:

      * GOOD
      * CORRUPT (can be resolved by repair or restore)
      * DESTROYED (can be resolved by restore only)
      * REPAIRING - when a status of repairing is entered, the node will automatically exit this state after a number of steps (as defined by the fileSystemRepairingLimit configuration item) after which it returns to a GOOD state
      * RESTORING - when a status of repairing is entered, the node will automatically exit this state after a number of steps (as defined by the fileSystemRestoringLimit configuration item) after which it returns to a GOOD state

* Service Nodes only:

   * Service State (for any associated service):

      * GOOD
      * PATCHING - when a status of patching is entered, the service will automatically exit this state after a number of steps (as defined by the servicePatchingDuration configuration item) after which it returns to a GOOD state
      * COMPROMISED
      * OVERWHELMED

Red agent pattern-of-life has an additional feature not found in the green pattern-of-life. This is the ability to influence the state of the attributes of a node via a number of different conditions:

   * DIRECT:

   The pattern-of-life described by the configuration file item will be applied regardless of any other conditions in the network. This is particularly useful for direct red agent entry into the network.

   * IER:

   The pattern-of-life described by the configuration file item will be applied to the service on the node, only if there is an IER of the same protocol / service type incoming at the specified timestep.

   * SERVICE:

   The pattern-of-life described by the configuration file item will be applied to the node based on the state of a service. The service can either be on the same node, or a different node within the network.

Access Control List modelling
*****************************

An Access Control List (ACL) is modelled to provide the means to manage traffic flows in the system. This will allow defensive agents the means to turn on / off rules, or potentially create new rules, to counter an attack.

The ACL follows a standard network firewall format. For example:

.. list-table:: ACL example
   :widths: 25 25 25 25 25
   :header-rows: 1

   * - Permission
     - Source IP
     - Dest IP
     - Protocol
     - Port
   * - DENY
     - 192.168.1.2
     - 192.168.1.3
     - HTTPS
     - 443
   * - ALLOW
     - 192.168.1.4
     - ANY
     - SMTP
     - 25
   * - DENY
     - ANY
     - 192.168.1.5
     - ANY
     - ANY

All ACL rules are considered when applying an IER. Logic follows the order of rules, so a DENY or ALLOW for the same parameters will override an earlier entry.

Observation Spaces
******************
The observation space provides the blue agent with information about the current status of nodes and links.

PrimAITE builds on top of Gym Spaces to create an observation space that is easily configurable for users. It's made up of components which are managed by the :py:class:`primaite.environment.observations.ObservationHandler`. Each training scenario can define its own observation space, and the user can choose which information to inlude, and how it should be formatted.

NodeLinkTable component
-----------------------
For example, the :py:class:`primaite.environment.observations.NodeLinkTable` component represents the status of nodes and links as a ``gym.spaces.Box`` with an example format shown below:

An example observation space is provided below:

.. list-table:: Observation Space example
   :widths: 25 25 25 25 25 25 25
   :header-rows: 1

   * -
     - ID
     - Hardware State
     - SoftwareState
     - File System State
     - Service / Protocol A
     - Service / Protocol B
   * - Node A
     - 1
     - 1
     - 1
     - 1
     - 1
     - 1
   * - Node B
     - 2
     - 1
     - 3
     - 1
     - 1
     - 1
   * - Node C
     - 3
     - 2
     - 1
     - 1
     - 3
     - 2
   * - Link 1
     - 5
     - 0
     - 0
     - 0
     - 0
     - 10000
   * - Link 2
     - 6
     - 0
     - 0
     - 0
     - 0
     - 10000
   * - Link 3
     - 7
     - 0
     - 0
     - 0
     - 5000
     - 0

For the nodes, the following values are represented:

 * ID
 * Hardware State:

    * 1 = ON
    * 2 = OFF
    * 3 = RESETTING

 * SoftwareState:

    * 1 = GOOD
    * 2 = PATCHING
    * 3 = COMPROMISED

 * Service State:

    * 1 = GOOD
    * 2 = PATCHING
    * 3 = COMPROMISED
    * 4 = OVERWHELMED

 * File System State:

    * 1 = GOOD
    * 2 = CORRUPT
    * 3 = DESTROYED
    * 4 = REPAIRING
    * 5 = RESTORING

(Note that each service available in the network is provided as a column, although not all nodes may utilise all services)

For the links, the following statuses are represented:

 * ID
 * Hardware State = N/A
 * SoftwareState = N/A
 * Protocol = loading in bits/s

NodeStatus component
----------------------
This is a MultiDiscrete observation space that can be though of as a one-dimensional vector of discrete states, represented by integers.
The example above would have the following structure:

.. code-block::

  [
    node1_info
    node2_info
    node3_info
  ]

Each ``node_info`` contains the following:

.. code-block::

  [
    hardware_state    (0=none, 1=ON, 2=OFF, 3=RESETTING)
    software_state    (0=none, 1=GOOD, 2=PATCHING, 3=COMPROMISED)
    file_system_state (0=none, 1=GOOD, 2=CORRUPT, 3=DESTROYED, 4=REPAIRING, 5=RESTORING)
    service1_state    (0=none, 1=GOOD, 2=PATCHING, 3=COMPROMISED)
    service2_state    (0=none, 1=GOOD, 2=PATCHING, 3=COMPROMISED)
  ]

In a network with three nodes and two services, the full observation space would have 15 elements. It can be written with ``gym`` notation to indicate the number of discrete options for each of the elements of the observation space. For example:

.. code-block::

  gym.spaces.MultiDiscrete([4,5,6,4,4,4,5,6,4,4,4,5,6,4,4])

LinkTrafficLevels
-----------------
This component is a MultiDiscrete space showing the traffic flow levels on the links in the network, after applying a threshold to convert it from a continuous to a discrete value.
The number of bins can be customised with 5 being the default. It has the following strucutre:
.. code-block::

  [
    link1_status
    link2_status
    link3_status
  ]

Each ``link_status`` is a number from 0-4 representing the network load in relation to bandwidth.

.. code-block::

  0 = No traffic (0%)
  1 = low traffic (<33%)
  2 = medium traffic (<66%)
  3 = high traffic (<100%)
  4 = max traffic/ overwhelmed (100%)

If the network has three links, the full observation space would have 3 elements. It can be written with ``gym`` notation to indicate the number of discrete options for each of the elements of the observation space. For example:

.. code-block::

  gym.spaces.MultiDiscrete([5,5,5])

Action Spaces
**************

The action space available to the blue agent comes in two types:

 1. Node-based
 2. Access Control List
 3. Any (Agent can take both node-based and ACL-based actions)

The choice of action space used during a training session is determined in the config_[name].yaml file.

**Node-Based**

The agent is able to influence the status of nodes by switching them off, resetting, or patching operating systems and services. In this instance, the action space is an OpenAI Gym spaces.Discrete type, as follows:

 * Dictionary item {... ,1: [x1, x2, x3,x4] ...}
   The placeholders inside the list under the key '1' mean the following:

    * [0, num nodes] - Node ID (0 = nothing, node ID)
    * [0, 4] - What property it's acting on (0 = nothing, 1 = state, 2 = SoftwareState, 3 = service state, 4 = file system state)
    * [0, 3] - Action on property (0 = nothing, 1 = on / scan, 2 = off / repair, 3 = reset / patch / restore)
    * [0, num services] - Resolves to service ID (0 = nothing, resolves to service)

**Access Control List**

The blue agent is able to influence the configuration of the Access Control List rule set (which implements a system-wide firewall). In this instance, the action space is an OpenAI spaces.Discrete type, as follows:

   * Dictionary item {... ,1: [x1, x2, x3, x4, x5, x6] ...}
   The placeholders inside the list under the key '1' mean the following:

     * [0, 2] - Action (0 = do nothing, 1 = create rule, 2 = delete rule)
     * [0, 1] - Permission (0 = DENY, 1 = ALLOW)
     * [0, num nodes] - Source IP (0 = any, then 1 -> x resolving to IP addresses)
     * [0, num nodes] - Dest IP (0 = any, then 1 -> x resolving to IP addresses)
     * [0, num services] - Protocol (0 = any, then 1 -> x resolving to protocol)
     * [0, num ports] - Port (0 = any, then 1 -> x resolving to port)

**ANY**
The agent is able to carry out both **Node-Based** and **Access Control List** operations.

This means the dictionary will contain key-value pairs in the format of BOTH Node-Based and Access Control List as seen above.

Rewards
*******

A reward value is presented back to the blue agent on the conclusion of every step. The reward value is calculated via two methods which combine to give the total value:

 1. Node and service status
 2. IER status

**Node and service status**

On every step, the status of each node is compared against both a reference environment (simulating the situation if the red and blue agents had not impacted the environment)
and the before and after state of the environment. If the comparison against the reference environment shows no difference, then the score provided is "AllOK". If there is a
difference with respect to the reference environment, the before and after states are compared, and a score determined. See :ref:`config` for details of reward values.

**IER status**

On every step, the full IER set is examined to determine whether green and red agent IERs are being permitted to run. Any red agent IERs running incur a penalty; any green agent
IERs not permitted to run also incur a penalty. See :ref:`config` for details of reward values.

Future Enhancements
*******************

The PrimAITE project has an ambition to include the following enhancements in future releases:

* Integration with a suitable standardised framework to allow multi-agent integration
* Integration with external threat emulation tools, either using off-line data, or integrating at runtime
* Provision of data such that agents can construct alternative observation spaces (as an alternative to the default PrimAITE observation space)
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								.. _about:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								About PrimAITE
 								==============
 								Features
 								********
 								PrimAITE provides the following features:
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* A flexible network / system laydown based on the Python networkx framework
 								* Nodes and links (edges) host Python classes in order to present attributes and methods (and hence, a more representative model of a platform / system)
 								* A ‘green agent’ Information Exchange Requirement (IER) function allows the representation of traffic (protocols and loading) on any / all links. Application of IERs is based on the status of node operating systems and services
 								* A ‘green agent’ node Pattern-of-Life (PoL) function allows the representation of core behaviours on nodes (e.g. Hardware state, Software State, Service state, File System state)
 								* An Access Control List (ACL) function, mimicking the behaviour of a network firewall, is applied across the model, following standard ACL rule format (e.g. DENY/ALLOW, source IP, destination IP, protocol and port). Application of IERs adheres to any ACL restrictions
 								* Presents an OpenAI Gym interface to the environment, allowing integration with any OpenAI Gym compliant defensive agents
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								* Red agent activity based on ‘red’ IERs and ‘red’ PoL
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* Defined reward function for use with RL agents (based on nodes status, and green / red IER success)
 								* Fully configurable (network / system laydown, IERs, node PoL, ACL, episode step period, episode max steps) and repeatable to suit the training requirements of agents. Therefore, not bound to a representation of any particular platform, system or technology
 								* Full capture of discrete metrics relating to agent training (full system state, agent actions taken, average reward)
 								* Networkx provides laydown visualisation capability
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								Architecture - Nodes and Links
 								******************************
 								**Nodes**
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								An inheritance model has been adopted in order to model nodes. All nodes have the following base attributes (Class: Node):
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* ID
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								* Name
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* Type (e.g. computer, switch, RTU - enumeration)
 								* Priority (P1, P2, P3, P4 or P5 - enumeration)
 								* Hardware State (ON, OFF, RESETTING - enumeration)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								Active Nodes also have the following attributes (Class: Active Node):
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* IP Address
 								* Software State (GOOD, PATCHING, COMPROMISED - enumeration)
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								* File System State (GOOD, CORRUPT, DESTROYED, REPAIRING, RESTORING - enumeration)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								Service Nodes also have the following attributes (Class: Service Node):
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* List of Services (where service is composed of service name and port). There is no theoretical limit on the number of services that can be modelled. Services and protocols are currently intrinsically linked (i.e. a service is an application on a node transmitting traffic of this protocol type)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								* Service state (GOOD, PATCHING, COMPROMISED, OVERWHELMED - enumeration)
 								Passive Nodes are currently not used (but may be employed for non IP-based components such as machinery actuators in future releases).
 								**Links**
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								Links are modelled both as network edges (networkx) and as Python classes, in order to extend their functionality. Links include the following attributes:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* ID
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								* Name
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* Bandwidth (bits/s)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								* Source node ID
 								* Destination node ID
 								* Protocol list (containing the loading of protocols currently running on the link)
 								When the simulation runs, IERs are applied to the links in order to model traffic loading, individually assigned to each protocol. This allows green (background) and red agent behaviour to be modelled, and defensive agents to identify suspicious traffic patterns at a protocol / traffic loading level of fidelity.
 								Information Exchange Requirements (IERs)
 								****************************************
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								PrimAITE adopts the concept of Information Exchange Requirements (IERs) to model both green agent (background) and red agent (adversary) behaviour. IERs are used to initiate modelling of traffic loading on the network, and have the following attributes:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* ID
 								* Start step (i.e. which step in the training episode should the IER start)
 								* End step (i.e. which step in the training episode should the IER end)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								* Source node ID
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								* Destination node ID
 								* Load (bits/s)
 								* Protocol
 								* Port
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								* Running status (i.e. on / off)
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								The application of green agent IERs between a source and destination follows a number of rules. Specifically:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+. Does the current simulation time step fall between IER start and end step
 . Is the source node operational (both physically and at an O/S level), and is the service (protocol / port) associated with the IER (a) present on this node, and (b) in an operational state (i.e. not PATCHING)
 . Is the destination node operational (both physically and at an O/S level), and is the service (protocol / port) associated with the IER (a) present on this node, and (b) in an operational state (i.e. not PATCHING)
 . Are there any Access Control List rules in place that prevent the application of this IER
 . Are all switches in the (OSPF) path between source and destination operational (both physically and at an O/S level)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								For red agent IERs, the application of IERs between a source and destination follows a number of subtly different rules. Specifically:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+. Does the current simulation time step fall between IER start and end step
 . Is the source node operational, and is the service (protocol / port) associated with the IER (a) present on that node and (b) already in a compromised state
 . Is the destination node operational, and is the service (protocol / port) associated with the IER present on that node
 . Are there any Access Control List rules in place that prevent the application of this IER
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+. Are all switches in the (OSPF) path between source and destination operational (both physically and at an O/S level)
 								Assuming the rules pass, the IER is applied to all relevant links (based on use of OSPF) between source and destination.
 								Node Pattern-of-Life
 								********************
 								Every node can be impacted (i.e. have a status change applied to it) by either green agent pattern-of-life or red agent pattern-of-life. This is distinct from IERs, and allows for attacks (and defence) to be modelled purely within the confines of a node.
 								The status changes that can be made to a node are as follows:
 								* All Nodes:
-												#1355 - Carried out full renaming in node.py, active_node.py, passive_node.py, and service_node.py to make params and variable names explicit.
- Made the same renaming in the yaml laydown config files.
- Added Type hints wherever I've been.
- Added a custom NodeType in custom_typing.py to encompass the Union of ActiveNode, PassiveNode, ServiceNode.

											
										
										
											2023-05-25 21:03:11 +01:00
+								   * Hardware State:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								      * ON
 								      * OFF
-												Ran pre-commit hook on all files and performed changes to fix flake8 failures

											
										
										
											2023-05-25 11:42:19 +01:00
+								      * RESETTING - when a status of resetting is entered, the node will automatically exit this state after a number of steps (as defined by the nodeResetDuration configuration item) after which it returns to an ON state
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								* Active Nodes and Service Nodes:
-												#1355 - Carried out full renaming in node.py, active_node.py, passive_node.py, and service_node.py to make params and variable names explicit.
- Made the same renaming in the yaml laydown config files.
- Added Type hints wherever I've been.
- Added a custom NodeType in custom_typing.py to encompass the Union of ActiveNode, PassiveNode, ServiceNode.

											
										
										
											2023-05-25 21:03:11 +01:00
+								   * Software State:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								      * GOOD
 								      * PATCHING - when a status of patching is entered, the node will automatically exit this state after a number of steps (as defined by the osPatchingDuration configuration item) after which it returns to a GOOD state
 								      * COMPROMISED
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								   * File System State:
 								      * GOOD
 								      * CORRUPT (can be resolved by repair or restore)
 								      * DESTROYED (can be resolved by restore only)
 								      * REPAIRING - when a status of repairing is entered, the node will automatically exit this state after a number of steps (as defined by the fileSystemRepairingLimit configuration item) after which it returns to a GOOD state
 								      * RESTORING - when a status of repairing is entered, the node will automatically exit this state after a number of steps (as defined by the fileSystemRestoringLimit configuration item) after which it returns to a GOOD state
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								* Service Nodes only:
 								   * Service State (for any associated service):
 								      * GOOD
 								      * PATCHING - when a status of patching is entered, the service will automatically exit this state after a number of steps (as defined by the servicePatchingDuration configuration item) after which it returns to a GOOD state
 								      * COMPROMISED
 								      * OVERWHELMED
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								Red agent pattern-of-life has an additional feature not found in the green pattern-of-life. This is the ability to influence the state of the attributes of a node via a number of different conditions:
 								   * DIRECT:
 								   The pattern-of-life described by the configuration file item will be applied regardless of any other conditions in the network. This is particularly useful for direct red agent entry into the network.
 								   * IER:
 								   The pattern-of-life described by the configuration file item will be applied to the service on the node, only if there is an IER of the same protocol / service type incoming at the specified timestep.
 								   * SERVICE:
 								   The pattern-of-life described by the configuration file item will be applied to the node based on the state of a service. The service can either be on the same node, or a different node within the network.
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								Access Control List modelling
 								*****************************
-												#915 - Created app dirs and set as constants in the top-level init.
- renamed _config_values_main to training_config.py and renamed the ConfigValuesMain class to TrainingConfig.
Moved training_config.py to src/primaite/config/training_config.py
- Renamed all training config yaml file keys to make creating an instance of TrainingConfig easier.
Moved action_type and num_steps over to the training config.
- Decoupled the training config and lay down config.
- Refactored main.py so that it can be ran from CLI and can take a training config path and a lay down config path.
- refactored all outputs so that they save to the session dir.
- Added some necessary setup scripts that handle creating app dirs, fronting example config files to the user, fronting demo notebooks to the user, performing clean-up in between installations etc.
- Added functions that attempt to retrieve the file path of users example config files that have been fronted by the primaite setup.
- Added logging config and a getLogger function in the top-level init.
- Refactored all logs entries logged to use a logger using the primaite logging config.
- Added basic typer CLI for doing things like setup, viewing logs, viewing primaite version, running a basic session.
- Updated test to use new features and config structures.
- Began updating docs. More to do here.

											
										
										
											2023-06-07 22:40:16 +01:00
+								An Access Control List (ACL) is modelled to provide the means to manage traffic flows in the system. This will allow defensive agents the means to turn on / off rules, or potentially create new rules, to counter an attack.
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								The ACL follows a standard network firewall format. For example:
 								.. list-table:: ACL example
 								   :widths: 25 25 25 25 25
 								   :header-rows: 1
 								   * - Permission
 								     - Source IP
 								     - Dest IP
 								     - Protocol
 								     - Port
 								   * - DENY
 								     - 192.168.1.2
 								     - 192.168.1.3
 								     - HTTPS
 								     - 443
 								   * - ALLOW
 								     - 192.168.1.4
 								     - ANY
 								     - SMTP
 								     - 25
 								   * - DENY
 								     - ANY
 								     - 192.168.1.5
 								     - ANY
 								     - ANY
 								All ACL rules are considered when applying an IER. Logic follows the order of rules, so a DENY or ALLOW for the same parameters will override an earlier entry.
 								Observation Spaces
 								******************
-												Update docs page on observations

											
										
										
											2023-06-01 21:42:34 +01:00
+								The observation space provides the blue agent with information about the current status of nodes and links.
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												Update docs page on observations

											
										
										
											2023-06-01 21:42:34 +01:00
+								PrimAITE builds on top of Gym Spaces to create an observation space that is easily configurable for users. It's made up of components which are managed by the :py:class:`primaite.environment.observations.ObservationHandler`. Each training scenario can define its own observation space, and the user can choose which information to inlude, and how it should be formatted.
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-												Update docs page on observations

											
										
										
											2023-06-01 21:42:34 +01:00
+								NodeLinkTable component
 								-----------------------
 								For example, the :py:class:`primaite.environment.observations.NodeLinkTable` component represents the status of nodes and links as a ``gym.spaces.Box`` with an example format shown below:
-												Update docs on MultiDiscrete observation spaces.

											
										
										
											2023-05-30 16:54:34 +01:00
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								An example observation space is provided below:
 								.. list-table:: Observation Space example
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								   :widths: 25 25 25 25 25 25 25
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								   :header-rows: 1
-												Ran pre-commit hook on all files and performed changes to fix flake8 failures

											
										
										
											2023-05-25 11:42:19 +01:00
+								   * -
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								     - ID
-												#1355 - Carried out full renaming in node.py, active_node.py, passive_node.py, and service_node.py to make params and variable names explicit.
- Made the same renaming in the yaml laydown config files.
- Added Type hints wherever I've been.
- Added a custom NodeType in custom_typing.py to encompass the Union of ActiveNode, PassiveNode, ServiceNode.

											
										
										
											2023-05-25 21:03:11 +01:00
+								     - Hardware State
 								     - SoftwareState
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								     - File System State
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								     - Service / Protocol A
 								     - Service / Protocol B
 								   * - Node A
 								     - 1
 								     - 1
 								     - 1
 								     - 1
 								     - 1
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								     - 1
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								   * - Node B
 								     - 2
 								     - 1
 								     - 3
 								     - 1
 								     - 1
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								     - 1
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								   * - Node C
 								     - 3
 								     - 2
 								     - 1
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								     - 1
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								     - 3
 								     - 2
 								   * - Link 1
 								     - 5
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								     - 0
 								     - 0
 								     - 0
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								     - 0
 								     - 10000
 								   * - Link 2
 								     - 6
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								     - 0
 								     - 0
 								     - 0
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								     - 0
 								     - 10000
 								   * - Link 3
 								     - 7
 								     - 0
 								     - 0
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								     - 0
 								     - 5000
 								     - 0
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								For the nodes, the following values are represented:
 								 * ID
-												#1355 - Carried out full renaming in node.py, active_node.py, passive_node.py, and service_node.py to make params and variable names explicit.
- Made the same renaming in the yaml laydown config files.
- Added Type hints wherever I've been.
- Added a custom NodeType in custom_typing.py to encompass the Union of ActiveNode, PassiveNode, ServiceNode.

											
										
										
											2023-05-25 21:03:11 +01:00
+								 * Hardware State:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								    * 1 = ON
 								    * 2 = OFF
 								    * 3 = RESETTING
-												#1355 - Carried out full renaming in node.py, active_node.py, passive_node.py, and service_node.py to make params and variable names explicit.
- Made the same renaming in the yaml laydown config files.
- Added Type hints wherever I've been.
- Added a custom NodeType in custom_typing.py to encompass the Union of ActiveNode, PassiveNode, ServiceNode.

											
										
										
											2023-05-25 21:03:11 +01:00
+								 * SoftwareState:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								    * 1 = GOOD
 								    * 2 = PATCHING
 								    * 3 = COMPROMISED
 								 * Service State:
 								    * 1 = GOOD
 								    * 2 = PATCHING
 								    * 3 = COMPROMISED
 								    * 4 = OVERWHELMED
-												Committed the v1.1.0 code provided by James Short. Had to add setuptools==66 to setup.py as older versions of Gym are uninstallable with setuptools>=67

											
										
										
											2023-04-06 11:04:09 +01:00
+								 * File System State:
 								    * 1 = GOOD
 								    * 2 = CORRUPT
 								    * 3 = DESTROYED
 								    * 4 = REPAIRING
 								    * 5 = RESTORING
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								(Note that each service available in the network is provided as a column, although not all nodes may utilise all services)
 								For the links, the following statuses are represented:
 								 * ID
-												#1355 - Carried out full renaming in node.py, active_node.py, passive_node.py, and service_node.py to make params and variable names explicit.
- Made the same renaming in the yaml laydown config files.
- Added Type hints wherever I've been.
- Added a custom NodeType in custom_typing.py to encompass the Union of ActiveNode, PassiveNode, ServiceNode.

											
										
										
											2023-05-25 21:03:11 +01:00
+								 * Hardware State = N/A
 								 * SoftwareState = N/A
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								 * Protocol = loading in bits/s
-												Update docs page on observations

											
										
										
											2023-06-01 21:42:34 +01:00
+								NodeStatus component
 								----------------------
 								This is a MultiDiscrete observation space that can be though of as a one-dimensional vector of discrete states, represented by integers.
-												Update docs on MultiDiscrete observation spaces.

											
										
										
											2023-05-30 16:54:34 +01:00
+								The example above would have the following structure:
 								.. code-block::
 								  [
 								    node1_info
 								    node2_info
 								    node3_info
 								  ]
 								Each ``node_info`` contains the following:
 								.. code-block::
 								  [
 								    hardware_state    (0=none, 1=ON, 2=OFF, 3=RESETTING)
 								    software_state    (0=none, 1=GOOD, 2=PATCHING, 3=COMPROMISED)
 								    file_system_state (0=none, 1=GOOD, 2=CORRUPT, 3=DESTROYED, 4=REPAIRING, 5=RESTORING)
 								    service1_state    (0=none, 1=GOOD, 2=PATCHING, 3=COMPROMISED)
 								    service2_state    (0=none, 1=GOOD, 2=PATCHING, 3=COMPROMISED)
 								  ]
-												Update docs page on observations

											
										
										
											2023-06-01 21:42:34 +01:00
+								In a network with three nodes and two services, the full observation space would have 15 elements. It can be written with ``gym`` notation to indicate the number of discrete options for each of the elements of the observation space. For example:
 								.. code-block::
 								  gym.spaces.MultiDiscrete([4,5,6,4,4,4,5,6,4,4,4,5,6,4,4])
 								LinkTrafficLevels
 								-----------------
 								This component is a MultiDiscrete space showing the traffic flow levels on the links in the network, after applying a threshold to convert it from a continuous to a discrete value.
 								The number of bins can be customised with 5 being the default. It has the following strucutre:
 								.. code-block::
 								  [
 								    link1_status
 								    link2_status
 								    link3_status
 								  ]
 								Each ``link_status`` is a number from 0-4 representing the network load in relation to bandwidth.
-												Update docs on MultiDiscrete observation spaces.

											
										
										
											2023-05-30 16:54:34 +01:00
 								.. code-block::
 = No traffic (0%)
 = low traffic (<33%)
 = medium traffic (<66%)
 = high traffic (<100%)
 = max traffic/ overwhelmed (100%)
-												Update docs page on observations

											
										
										
											2023-06-01 21:42:34 +01:00
+								If the network has three links, the full observation space would have 3 elements. It can be written with ``gym`` notation to indicate the number of discrete options for each of the elements of the observation space. For example:
-												Update docs on MultiDiscrete observation spaces.

											
										
										
											2023-05-30 16:54:34 +01:00
 								.. code-block::
-												Update docs page on observations

											
										
										
											2023-06-01 21:42:34 +01:00
+								  gym.spaces.MultiDiscrete([5,5,5])
-												Update docs on MultiDiscrete observation spaces.

											
										
										
											2023-05-30 16:54:34 +01:00
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								Action Spaces
 								**************
 								The action space available to the blue agent comes in two types:
 . Node-based
 . Access Control List
-- applied changes raised during PR

											
										
										
											2023-06-06 13:12:28 +01:00
+. Any (Agent can take both node-based and ACL-based actions)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								The choice of action space used during a training session is determined in the config_[name].yaml file.
 								**Node-Based**
-- updated the docs to reflect changes made to action space

											
										
										
											2023-06-06 11:57:04 +01:00
+								The agent is able to influence the status of nodes by switching them off, resetting, or patching operating systems and services. In this instance, the action space is an OpenAI Gym spaces.Discrete type, as follows:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-- updated the docs to reflect changes made to action space

											
										
										
											2023-06-06 11:57:04 +01:00
+								 * Dictionary item {... ,1: [x1, x2, x3,x4] ...}
 								   The placeholders inside the list under the key '1' mean the following:
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-- updated the docs to reflect changes made to action space

											
										
										
											2023-06-06 11:57:04 +01:00
+								    * [0, num nodes] - Node ID (0 = nothing, node ID)
 								    * [0, 4] - What property it's acting on (0 = nothing, 1 = state, 2 = SoftwareState, 3 = service state, 4 = file system state)
 								    * [0, 3] - Action on property (0 = nothing, 1 = on / scan, 2 = off / repair, 3 = reset / patch / restore)
 								    * [0, num services] - Resolves to service ID (0 = nothing, resolves to service)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								**Access Control List**
-- updated the docs to reflect changes made to action space

											
										
										
											2023-06-06 11:57:04 +01:00
+								The blue agent is able to influence the configuration of the Access Control List rule set (which implements a system-wide firewall). In this instance, the action space is an OpenAI spaces.Discrete type, as follows:
-- applied changes raised during PR

											
										
										
											2023-06-06 13:12:28 +01:00
+								   * Dictionary item {... ,1: [x1, x2, x3, x4, x5, x6] ...}
 								   The placeholders inside the list under the key '1' mean the following:
-- updated the docs to reflect changes made to action space

											
										
										
											2023-06-06 11:57:04 +01:00
 								     * [0, 2] - Action (0 = do nothing, 1 = create rule, 2 = delete rule)
 								     * [0, 1] - Permission (0 = DENY, 1 = ALLOW)
 								     * [0, num nodes] - Source IP (0 = any, then 1 -> x resolving to IP addresses)
 								     * [0, num nodes] - Dest IP (0 = any, then 1 -> x resolving to IP addresses)
 								     * [0, num services] - Protocol (0 = any, then 1 -> x resolving to protocol)
 								     * [0, num ports] - Port (0 = any, then 1 -> x resolving to port)
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-- updated the docs to reflect changes made to action space

											
										
										
											2023-06-06 11:57:04 +01:00
+								**ANY**
 								The agent is able to carry out both **Node-Based** and **Access Control List** operations.
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
-- applied changes raised during PR

											
										
										
											2023-06-06 13:12:28 +01:00
+								This means the dictionary will contain key-value pairs in the format of BOTH Node-Based and Access Control List as seen above.
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
 								Rewards
 								*******
 								A reward value is presented back to the blue agent on the conclusion of every step. The reward value is calculated via two methods which combine to give the total value:
 . Node and service status
 . IER status
 								**Node and service status**
-												Ran pre-commit hook on all files and performed changes to fix flake8 failures

											
										
										
											2023-05-25 11:42:19 +01:00
+								On every step, the status of each node is compared against both a reference environment (simulating the situation if the red and blue agents had not impacted the environment)
 								and the before and after state of the environment. If the comparison against the reference environment shows no difference, then the score provided is "AllOK". If there is a
-												Initial commit of v1.0.0. Updated the .gitignore for the standard Python gitignore. Added Azure DevOps release pipeline for proper artifact release from the start.

											
										
										
											2023-03-28 17:33:34 +01:00
+								difference with respect to the reference environment, the before and after states are compared, and a score determined. See :ref:`config` for details of reward values.
 								**IER status**
 								On every step, the full IER set is examined to determine whether green and red agent IERs are being permitted to run. Any red agent IERs running incur a penalty; any green agent
 								IERs not permitted to run also incur a penalty. See :ref:`config` for details of reward values.
 								Future Enhancements
 								*******************
 								The PrimAITE project has an ambition to include the following enhancements in future releases:
 								* Integration with a suitable standardised framework to allow multi-agent integration
 								* Integration with external threat emulation tools, either using off-line data, or integrating at runtime
 								* Provision of data such that agents can construct alternative observation spaces (as an alternative to the default PrimAITE observation space)