PrimAITE

Author	SHA1	Message	Date
SunilSamra	257be9532f	#901 - Changed num_eval_steps back to 1 in ppo_seeded_training_config.yaml	2023-07-17 15:54:15 +01:00
Sunil Samra	dd21f9440f	Apply suggestions from code review	2023-07-17 14:21:37 +00:00
SunilSamra	2526427f2f	#901 - Fixed bug in implicit rule - comparing it to string ALLOW or DENY in access_control_list.py	2023-07-17 13:58:06 +01:00
SunilSamra	78d7f39342	#901 - Removed flatten from training configs - Added flatten operation in observations.py when there are multiple obs components - Updated config.rst docs	2023-07-17 13:44:16 +01:00
SunilSamra	da20c0e9e6	#901 - Removed bool apply_implicit_rule - Set default implicit_rule to EXPLICIT DENY - Added position to ACLs in laydown configs - Removed apply_implicit_rule from training configs	2023-07-17 13:00:58 +01:00
SunilSamra	707d8f6189	#901 - Added check in access_control_list.py which sets implicit permission to NA if boolean is False - Changed the defaults in training_config.py so that each scenario has an EXPLICIT ALLOW rule as default implicit rule - Updated the test_seeding_and_deterministic_session.py because of change no2 adds an extra rule to that scenario	2023-07-17 10:27:56 +01:00
SunilSamra	9df8d132fc	#901 - added to config.rst and added new ACL main config options	2023-07-17 10:08:12 +01:00
SunilSamra	8aa71c3ff8	#901 - amended comment in observations.py	2023-07-14 16:04:13 +01:00
SunilSamra	1b6244d13f	#901 - amended comment in training_config_main.yaml	2023-07-14 15:49:18 +01:00
SunilSamra	661c865108	#901 - - Added comments in access_control_list.py - Changed obs_shape to max_number_acl_rules from max_number_acl_rules + 1 as index starts from 1 - Commented episode and step print line from test_single_action_space.py	2023-07-14 15:27:37 +01:00
SunilSamra	eb75d15722	901 - Added another test and tidied up comments in test_observation_space.py and tidied up comments in observations.py	2023-07-14 14:51:26 +01:00
Chris McCarthy	f9c7cafe87	#901 - Dropped temp_primaite_sessiion_2 from conftest.py. - Re-added the hard-coded mean rewards per episode values from a rpe-trained agent to the deterministic test in test_seeding_and_deterministic_session.py - Partially tidies up some tests in test_observation_space.py; Still some work to be done on this at a later date.	2023-07-14 14:13:11 +01:00
SunilSamra	e743b2380c	901 - fixed test_observation_space.py, added test fixture for test_seeding_and_deterministic_session.py and increased default max number of acls	2023-07-14 12:29:50 +01:00
SunilSamra	558223e8b6	901 - removed print statements and merged with dev	2023-07-13 17:14:59 +01:00
SunilSamra	77f717c649	Merge remote-tracking branch 'origin/dev' into feature/901-change-functionality-acl-rules	2023-07-13 16:48:02 +01:00
SunilSamra	0ab4dab72a	901 - fixed test_single_action_space.py test	2023-07-13 11:45:23 +01:00
SunilSamra	f8cb18c654	901 - changed acl current obs from list to numpy.array, changed default ACL list in training_config.py to FALSE, and tried to make test_seeding_and_deterministic_session.py test without fixed reward results	2023-07-13 11:04:11 +01:00
SunilSamra	06c20f6984	Merge remote-tracking branch 'origin/dev' into feature/901-change-functionality-acl-rules # Conflicts: # src/primaite/acl/access_control_list.py	2023-07-12 10:45:03 +01:00
SunilSamra	f817efdc69	901 - fixed how acls are added into list with new logic - agent cannot overwrite another acl in the list	2023-07-12 09:47:16 +01:00
SunilSamra	350b3db3f6	901 - changed implicit_acl_rule from str to enum name	2023-07-11 12:36:22 +01:00
SunilSamra	6b59ce960d	Merge remote-tracking branch 'origin/dev' into feature/1566-configure_episode-steps-learn-eval # Conflicts: # src/primaite/config/training_config.py	2023-07-11 11:39:21 +01:00
SunilSamra	563ff72fd6	1566 - fixed the test_training_config.py test file by removing num_steps from init	2023-07-10 13:24:34 +01:00
SunilSamra	921dc934c2	1566 - added correct num_train_episodes etc values to configs, fixed test_reward.py	2023-07-10 11:25:26 +01:00
Marek Wolan	bd6f9fc309	Merge remote-tracking branch 'origin/dev' into bugfix/1587-hardcoded-agent	2023-07-10 09:15:25 +01:00
Marek Wolan	91287f8666	Merge remote-tracking branch 'origin/dev' into feature/1572-fix-docs-formatting	2023-07-09 18:13:57 +01:00
Marek Wolan	605a5b4cd6	Merge remote-tracking branch 'origin/dev' into bugfix/1587-hardcoded-agent	2023-07-09 18:07:30 +01:00
Marek Wolan	17894376c6	Removed comment	2023-07-09 18:07:21 +01:00
Marek Wolan	677d12b550	Merged PR 106: Resolve TODOs about documenting functions ## Summary - Added type hints and docstrings to functions imported from ADSP. - Imported `get_relevant_rules` which was referenced but didn't exist. - Removed duplicated function definitions in `agents.utils` ## Test process The changes in this PR are almost exclusively cosmetic. I can confirm that after adding/removing functions, the unit tests passed fine. I was also able to run the Hardcoded node and ACL agents without problems. ## Checklist - [x] This PR is linked to a work item - [x] I have performed self-review of the code - [na] I have written tests for any new functionality added with this PR - [na] I have updated the documentation if this PR changes or adds functionality - [x] I have run pre-commit checks for code style Related work items: #1575	2023-07-07 15:10:44 +00:00
Chris McCarthy	40381833d3	#1566 - Refactored the test_train_eval_episode_steps.py to sue TempPrimaiteSession. - Fixed all errors that were caused b fixing the above. - Some tests still fail, these are for SS to fix. - Dropped the old run_generic stuff from conftest.py	2023-07-07 15:50:14 +01:00
SunilSamra	35b481a2f3	Merge remote-tracking branch 'origin/dev' into feature/901-change-functionality-acl-rules	2023-07-07 15:14:05 +01:00
Chris McCarthy	d49f73f139	Merge remote-tracking branch 'origin/dev' into 1566-configure-episode-steps-learn-eval # Conflicts: # src/primaite/agents/rllib.py	2023-07-07 14:34:20 +01:00
SunilSamra	e03c29b921	1566 - added test file and edited configs to include types of num steps and modifed agents to use correct step and episode counts	2023-07-07 14:13:47 +01:00
Marek Wolan	7e0eee5d73	Merge remote-tracking branch 'origin/dev' into feature/1572-fix-docs-formatting	2023-07-07 10:30:11 +01:00
Marek Wolan	f4b98542b6	Standardise docstring summary line placement.	2023-07-07 10:28:00 +01:00
Czar Echavez	04e52453b1	Merge branch 'dev' into feature/1386-enable-a-repeatable-or-deterministic-baseline-test	2023-07-06 22:22:37 +01:00
Christopher McCarthy	207601b81f	Merged PR 109: Auto save agent at end of training ## Summary * Made RLlib and SB3 agents save at the end of each learning session by default using a common file naming format. Also now agents only checkpoint every n and not on the final episode. ## Test process Tests saved agent file in the test_primaite_session test. ## Checklist - [X] This PR is linked to a work item* - [X] I have performed self-review of the code - [X] I have written tests for any new functionality added with this PR - [ ] I have updated the documentation if this PR changes or adds functionality - [X] I have run pre-commit checks for code style Related work items: #1593	2023-07-06 16:29:48 +00:00
Marek Wolan	86725064ec	Added docstrings to class intialisers	2023-07-06 16:08:51 +01:00
Chris McCarthy	c9f4741655	#1593 - Ran pre-commit hook	2023-07-06 14:18:49 +01:00
Chris McCarthy	159d47fd6c	#1963 - Made RLlib and SB3 agents save at the end of each learning session by default using a common file naming format. Also now agents only checkpoint every n and not on the final episode	2023-07-06 13:56:12 +01:00
Marek Wolan	c5d7d55747	Change reward to float and divide by 10000	2023-07-06 12:52:14 +01:00
Czar Echavez	99f1f7cfc1	#1386 : remove setting of global seed + running pre-commit checks	2023-07-06 12:10:26 +01:00
Chris McCarthy	3438ce7e09	#1386 - Updated tests in test_seeding_and_deterministic_session.py to use TempPrimaiteSession. - Added test_seeded_learning test and test_deterministic_evaluation test. - Passed config values seed and deterministic to ppo agent - Dropped deterministic override in evaluate functions - TempPrimaiteSession now writes files to a UUID folder rather than datetime - Added seed to Ray RLlib agent setup in rllib.py - Added seed to SB3 agent setup in sb3.py	2023-07-06 11:35:44 +01:00
SunilSamra	4371ca13fc	1566 - added train_episodes, train_steps, eval_episodes and eval_steps to training_config_main.yaml	2023-07-06 11:12:51 +01:00
SunilSamra	f651937759	901 - changed how acl rules are added to access control list and added structure to AccessControlList observation	2023-07-06 11:07:21 +01:00
Marek Wolan	e174db5d9e	Rescaled default rewards by a factor of 1/10000	2023-07-06 10:51:34 +01:00
Marek Wolan	87bdaa1ec3	Updated documentation	2023-07-06 10:34:27 +01:00
Marek Wolan	c38dda34b9	Removed duplicated function definitions	2023-07-06 10:23:14 +01:00
Chris McCarthy	8faf9d70a0	temp	2023-07-06 10:07:54 +01:00
Marek Wolan	b426d5802e	Updated docstrings	2023-07-05 16:46:23 +01:00
Marek Wolan	5c167293e3	Add docstrings and type hints.	2023-07-05 16:19:43 +01:00

1 2 3 4 5

218 Commits