PrimAITE

Author	SHA1	Message	Date
Marek Wolan	47d7e9f3f6	Merged PR 104: Fix formatting in docstrings ## Summary Fixes some incorrectly formatted documentations, such as in the observation module. Also adds some missing module-level docstrings. Also adds a PrimAITE Favicon to docs. Removed Primaite-dependencies.rst as it's autogenerated. ## Test process Purely cosmetic, so functionality not tested. I did render the HTML output to observe that some mistakes have been fixed. ## Checklist - [x] This PR is linked to a work item - [x] I have performed self-review of the code - [na] I have written tests for any new functionality added with this PR - [x] I have updated the documentation if this PR changes or adds functionality - [ ] I have run pre-commit checks for code style Related work items: #1572	2023-07-10 08:14:08 +00:00
Marek Wolan	0145532103	Update docs	2023-07-09 20:23:53 +01:00
Marek Wolan	91287f8666	Merge remote-tracking branch 'origin/dev' into feature/1572-fix-docs-formatting	2023-07-09 18:13:57 +01:00
Marek Wolan	605a5b4cd6	Merge remote-tracking branch 'origin/dev' into bugfix/1587-hardcoded-agent	2023-07-09 18:07:30 +01:00
Marek Wolan	17894376c6	Removed comment	2023-07-09 18:07:21 +01:00
Marek Wolan	9d49406df6	Merge remote-tracking branch 'origin/dev' into feature/1596-better-observation-docs	2023-07-09 18:05:13 +01:00
Brian Kanyora	23adc740cd	Resolved more syntax errors	2023-07-07 16:32:35 +01:00
SunilSamra	41fab6562e	1566 - updated configs to correct values of step count and number of episodes	2023-07-07 16:26:12 +01:00
Brian Kanyora	752a611b89	Fixed the rst syntax	2023-07-07 16:25:55 +01:00
Marek Wolan	677d12b550	Merged PR 106: Resolve TODOs about documenting functions ## Summary - Added type hints and docstrings to functions imported from ADSP. - Imported `get_relevant_rules` which was referenced but didn't exist. - Removed duplicated function definitions in `agents.utils` ## Test process The changes in this PR are almost exclusively cosmetic. I can confirm that after adding/removing functions, the unit tests passed fine. I was also able to run the Hardcoded node and ACL agents without problems. ## Checklist - [x] This PR is linked to a work item - [x] I have performed self-review of the code - [na] I have written tests for any new functionality added with this PR - [na] I have updated the documentation if this PR changes or adds functionality - [x] I have run pre-commit checks for code style Related work items: #1575	2023-07-07 15:10:44 +00:00
Chris McCarthy	40381833d3	#1566 - Refactored the test_train_eval_episode_steps.py to sue TempPrimaiteSession. - Fixed all errors that were caused b fixing the above. - Some tests still fail, these are for SS to fix. - Dropped the old run_generic stuff from conftest.py	2023-07-07 15:50:14 +01:00
SunilSamra	35b481a2f3	Merge remote-tracking branch 'origin/dev' into feature/901-change-functionality-acl-rules	2023-07-07 15:14:05 +01:00
Chris McCarthy	d49f73f139	Merge remote-tracking branch 'origin/dev' into 1566-configure-episode-steps-learn-eval # Conflicts: # src/primaite/agents/rllib.py	2023-07-07 14:34:20 +01:00
Marek Wolan	d7bf678b1f	Reworded observation description	2023-07-07 14:24:37 +01:00
SunilSamra	e03c29b921	1566 - added test file and edited configs to include types of num steps and modifed agents to use correct step and episode counts	2023-07-07 14:13:47 +01:00
Marek Wolan	bbb305d561	Update observation space documentation	2023-07-07 13:52:14 +01:00
Brian Kanyora	4ef7831bfa	Added a getting started file	2023-07-07 11:37:57 +01:00
Marek Wolan	7e0eee5d73	Merge remote-tracking branch 'origin/dev' into feature/1572-fix-docs-formatting	2023-07-07 10:30:11 +01:00
Marek Wolan	f4b98542b6	Standardise docstring summary line placement.	2023-07-07 10:28:00 +01:00
Czar Echavez	036e0fe342	Merged PR 89: #1386 Enable a repeatable/deterministic baseline test ## Summary - Added the fix from #1535 with minor changes to make sure that the `primaite_env.step()` function can properly parse the action - added the config deterministic and seed to training config - added the deterministic and seed to the Training config class, with defaults `False` and `None` respectively - minor fix to `primaite_env.close()` function so that it now works ## Test process Added e2e tests for generic, ppo and a2c which evaluates a trained agent twice to make sure that the seeding and deterministic action works ## Checklist - [x] This PR is linked to a work item - [x] I have performed self-review of the code - [x] I have written tests for any new functionality added with this PR - [x] I have updated the documentation if this PR changes or adds functionality - [x] I have run pre-commit checks for code style #1386: added the ability to set deterministic and seeding RNG when training and evaluating + the fix provided in #1535 Related work items: #1386, #1535	2023-07-07 09:22:47 +00:00
Czar Echavez	04e52453b1	Merge branch 'dev' into feature/1386-enable-a-repeatable-or-deterministic-baseline-test	2023-07-06 22:22:37 +01:00
Christopher McCarthy	207601b81f	Merged PR 109: Auto save agent at end of training ## Summary * Made RLlib and SB3 agents save at the end of each learning session by default using a common file naming format. Also now agents only checkpoint every n and not on the final episode. ## Test process Tests saved agent file in the test_primaite_session test. ## Checklist - [X] This PR is linked to a work item* - [X] I have performed self-review of the code - [X] I have written tests for any new functionality added with this PR - [ ] I have updated the documentation if this PR changes or adds functionality - [X] I have run pre-commit checks for code style Related work items: #1593	2023-07-06 16:29:48 +00:00
Marek Wolan	3a75ed8ccc	Merged PR 108: Divide default rewards by 10000 ## Summary As per the discussion this morning, this PR reimplements changes that were made by ADSP to make the default rewards smaller. This also adds type hints rewards as floats. ## Test process I checked that sessions are able to run and that they report values similar to what we are used to but smaller by a factor of 10000. I did not change the reward values in the integration test configs, and the tests still pass. ## Checklist - [x] This PR is linked to a work item - [x] I have performed self-review of the code - [x] I have written tests for any new functionality added with this PR - [x] I have updated the documentation if this PR changes or adds functionality - [x] I have run pre-commit checks for code style Related work items: #889, #1586	2023-07-06 15:17:47 +00:00
Marek Wolan	86725064ec	Added docstrings to class intialisers	2023-07-06 16:08:51 +01:00
Marek Wolan	2a08d3a2a5	Removed reference to file that no longer exists	2023-07-06 15:18:49 +01:00
Marek Wolan	82a5122276	Add __init__ to class special members doc	2023-07-06 15:18:33 +01:00
Marek Wolan	4c03aaee24	undeleted api (lol)	2023-07-06 15:05:39 +01:00
Marek Wolan	1ade92f55c	Deleted icon	2023-07-06 15:04:46 +01:00
Chris McCarthy	c9f4741655	#1593 - Ran pre-commit hook	2023-07-06 14:18:49 +01:00
Chris McCarthy	82d7c168fe	#1593 - Check that agent saved file exists	2023-07-06 14:13:02 +01:00
Chris McCarthy	159d47fd6c	#1963 - Made RLlib and SB3 agents save at the end of each learning session by default using a common file naming format. Also now agents only checkpoint every n and not on the final episode	2023-07-06 13:56:12 +01:00
Czar Echavez	46b44f9e23	#1386 : remove redundant config files + test fixtures + fixing deterministic and seed config description in documentation to avoid misunderstandings	2023-07-06 13:27:44 +01:00
Marek Wolan	3b91a99070	Updated rewards type description in docs	2023-07-06 12:56:24 +01:00
Marek Wolan	c5d7d55747	Change reward to float and divide by 10000	2023-07-06 12:52:14 +01:00
Czar Echavez	99f1f7cfc1	#1386 : remove setting of global seed + running pre-commit checks	2023-07-06 12:10:26 +01:00
Chris McCarthy	3438ce7e09	#1386 - Updated tests in test_seeding_and_deterministic_session.py to use TempPrimaiteSession. - Added test_seeded_learning test and test_deterministic_evaluation test. - Passed config values seed and deterministic to ppo agent - Dropped deterministic override in evaluate functions - TempPrimaiteSession now writes files to a UUID folder rather than datetime - Added seed to Ray RLlib agent setup in rllib.py - Added seed to SB3 agent setup in sb3.py	2023-07-06 11:35:44 +01:00
SunilSamra	4371ca13fc	1566 - added train_episodes, train_steps, eval_episodes and eval_steps to training_config_main.yaml	2023-07-06 11:12:51 +01:00
SunilSamra	f651937759	901 - changed how acl rules are added to access control list and added structure to AccessControlList observation	2023-07-06 11:07:21 +01:00
Marek Wolan	e174db5d9e	Rescaled default rewards by a factor of 1/10000	2023-07-06 10:51:34 +01:00
Marek Wolan	87bdaa1ec3	Updated documentation	2023-07-06 10:34:27 +01:00
Marek Wolan	c38dda34b9	Removed duplicated function definitions	2023-07-06 10:23:14 +01:00
Chris McCarthy	8faf9d70a0	temp	2023-07-06 10:07:54 +01:00
Marek Wolan	b426d5802e	Updated docstrings	2023-07-05 16:46:23 +01:00
Marek Wolan	5c167293e3	Add docstrings and type hints.	2023-07-05 16:19:43 +01:00
Marek Wolan	0ae7158859	Merge branch 'bugfix/1587-hardcoded-agent' into feature/1575-docstring-param-desc	2023-07-05 15:22:13 +01:00
Czar Echavez	713225b432	#1386 : remove unneeded configs + setting the seed globally + temp test	2023-07-05 15:02:41 +01:00
Marek Wolan	7482aead76	typo	2023-07-05 14:50:03 +01:00
Marek Wolan	f62b2aef1c	Fix minor typos in docstrings	2023-07-05 14:13:43 +01:00
Marek Wolan	171b5cb58e	Imported ADSP function for ACL	2023-07-05 14:10:52 +01:00
Marek Wolan	b3d4eb4ec0	Changed hardcoded agent helper for new obs space	2023-07-05 13:58:46 +01:00

... 3 4 5 6 7 ...

493 Commits