PrimAITE

Author	SHA1	Message	Date
Marek Wolan	07a0581fce	Merge remote-tracking branch 'origin/dev' into feature/1572-fix-docs-formatting	2023-07-09 18:13:57 +01:00
Marek Wolan	72aef78391	Merged PR 106: Resolve TODOs about documenting functions ## Summary - Added type hints and docstrings to functions imported from ADSP. - Imported `get_relevant_rules` which was referenced but didn't exist. - Removed duplicated function definitions in `agents.utils` ## Test process The changes in this PR are almost exclusively cosmetic. I can confirm that after adding/removing functions, the unit tests passed fine. I was also able to run the Hardcoded node and ACL agents without problems. ## Checklist - [x] This PR is linked to a work item - [x] I have performed self-review of the code - [na] I have written tests for any new functionality added with this PR - [na] I have updated the documentation if this PR changes or adds functionality - [x] I have run pre-commit checks for code style Related work items: #1575	2023-07-07 15:10:44 +00:00
Marek Wolan	1d1f3f2403	Merge remote-tracking branch 'origin/dev' into feature/1572-fix-docs-formatting	2023-07-07 10:30:11 +01:00
Marek Wolan	5618283cc5	Standardise docstring summary line placement.	2023-07-07 10:28:00 +01:00
Czar Echavez	4acb220c0c	Merged PR 89: #1386 Enable a repeatable/deterministic baseline test ## Summary - Added the fix from #1535 with minor changes to make sure that the `primaite_env.step()` function can properly parse the action - added the config deterministic and seed to training config - added the deterministic and seed to the Training config class, with defaults `False` and `None` respectively - minor fix to `primaite_env.close()` function so that it now works ## Test process Added e2e tests for generic, ppo and a2c which evaluates a trained agent twice to make sure that the seeding and deterministic action works ## Checklist - [x] This PR is linked to a work item - [x] I have performed self-review of the code - [x] I have written tests for any new functionality added with this PR - [x] I have updated the documentation if this PR changes or adds functionality - [x] I have run pre-commit checks for code style #1386: added the ability to set deterministic and seeding RNG when training and evaluating + the fix provided in #1535 Related work items: #1386, #1535	2023-07-07 09:22:47 +00:00
Czar Echavez	76997f403e	Merge branch 'dev' into feature/1386-enable-a-repeatable-or-deterministic-baseline-test	2023-07-06 22:22:37 +01:00
Christopher McCarthy	5ac196b3cb	Merged PR 109: Auto save agent at end of training ## Summary * Made RLlib and SB3 agents save at the end of each learning session by default using a common file naming format. Also now agents only checkpoint every n and not on the final episode. ## Test process Tests saved agent file in the test_primaite_session test. ## Checklist - [X] This PR is linked to a work item* - [X] I have performed self-review of the code - [X] I have written tests for any new functionality added with this PR - [ ] I have updated the documentation if this PR changes or adds functionality - [X] I have run pre-commit checks for code style Related work items: #1593	2023-07-06 16:29:48 +00:00
Marek Wolan	87710ec22b	Merged PR 108: Divide default rewards by 10000 ## Summary As per the discussion this morning, this PR reimplements changes that were made by ADSP to make the default rewards smaller. This also adds type hints rewards as floats. ## Test process I checked that sessions are able to run and that they report values similar to what we are used to but smaller by a factor of 10000. I did not change the reward values in the integration test configs, and the tests still pass. ## Checklist - [x] This PR is linked to a work item - [x] I have performed self-review of the code - [x] I have written tests for any new functionality added with this PR - [x] I have updated the documentation if this PR changes or adds functionality - [x] I have run pre-commit checks for code style Related work items: #889, #1586	2023-07-06 15:17:47 +00:00
Marek Wolan	653d76ec62	Added docstrings to class intialisers	2023-07-06 16:08:51 +01:00
Marek Wolan	9167816896	Removed reference to file that no longer exists	2023-07-06 15:18:49 +01:00
Marek Wolan	8d466accf5	Add __init__ to class special members doc	2023-07-06 15:18:33 +01:00
Marek Wolan	eb068e22b6	undeleted api (lol)	2023-07-06 15:05:39 +01:00
Marek Wolan	70bde700b7	Deleted icon	2023-07-06 15:04:46 +01:00
Chris McCarthy	ddabf991ce	#1593 - Ran pre-commit hook	2023-07-06 14:18:49 +01:00
Chris McCarthy	fc98441a11	#1593 - Check that agent saved file exists	2023-07-06 14:13:02 +01:00
Chris McCarthy	1e7f5b62f3	#1963 - Made RLlib and SB3 agents save at the end of each learning session by default using a common file naming format. Also now agents only checkpoint every n and not on the final episode	2023-07-06 13:56:12 +01:00
Czar Echavez	08220ff6ea	#1386 : remove redundant config files + test fixtures + fixing deterministic and seed config description in documentation to avoid misunderstandings	2023-07-06 13:27:44 +01:00
Marek Wolan	33f6f8bc34	Updated rewards type description in docs	2023-07-06 12:56:24 +01:00
Marek Wolan	dd8593e489	Change reward to float and divide by 10000	2023-07-06 12:52:14 +01:00
Czar Echavez	bb9bfc50a5	#1386 : remove setting of global seed + running pre-commit checks	2023-07-06 12:10:26 +01:00
Chris McCarthy	a35c363345	#1386 - Updated tests in test_seeding_and_deterministic_session.py to use TempPrimaiteSession. - Added test_seeded_learning test and test_deterministic_evaluation test. - Passed config values seed and deterministic to ppo agent - Dropped deterministic override in evaluate functions - TempPrimaiteSession now writes files to a UUID folder rather than datetime - Added seed to Ray RLlib agent setup in rllib.py - Added seed to SB3 agent setup in sb3.py	2023-07-06 11:35:44 +01:00
Marek Wolan	30b08fd48b	Rescaled default rewards by a factor of 1/10000	2023-07-06 10:51:34 +01:00
Marek Wolan	d9394d274d	Updated documentation	2023-07-06 10:34:27 +01:00
Marek Wolan	b9549497d2	Removed duplicated function definitions	2023-07-06 10:23:14 +01:00
Chris McCarthy	f92d2fb65d	temp	2023-07-06 10:07:54 +01:00
Marek Wolan	ead02ed691	Updated docstrings	2023-07-05 16:46:23 +01:00
Marek Wolan	013dcb94a8	Add docstrings and type hints.	2023-07-05 16:19:43 +01:00
Marek Wolan	d598ffaa65	Merge branch 'bugfix/1587-hardcoded-agent' into feature/1575-docstring-param-desc	2023-07-05 15:22:13 +01:00
Czar Echavez	0068092d8b	#1386 : remove unneeded configs + setting the seed globally + temp test	2023-07-05 15:02:41 +01:00
Marek Wolan	8f4e8bf538	typo	2023-07-05 14:50:03 +01:00
Marek Wolan	5f98c9b1bd	Fix minor typos in docstrings	2023-07-05 14:13:43 +01:00
Marek Wolan	376ff9f597	Imported ADSP function for ACL	2023-07-05 14:10:52 +01:00
Marek Wolan	0664389bdc	Changed hardcoded agent helper for new obs space	2023-07-05 13:58:46 +01:00
Czar Echavez	b0c83d7148	#1386 : fix saving of agent	2023-07-05 11:41:18 +01:00
Marek Wolan	247136ed6d	Merge branch 'feature/1572-fix-docs-formatting' of https://dev.azure.com/ma-dev-uk/PrimAITE/_git/PrimAITE into feature/1572-fix-docs-formatting	2023-07-05 10:14:20 +01:00
Marek Wolan	fa6dbd8338	Move class docstrings out of init function.	2023-07-05 10:14:16 +01:00
Marek Wolan	d7bf90e6f4	Updated access_control_list.py	2023-07-05 09:00:41 +00:00
Marek Wolan	b81c29d46e	Update some param descriptions for hardcoded agent	2023-07-05 09:54:50 +01:00
Marek Wolan	24a4f96ed0	Add blank lines at the end of file.	2023-07-05 09:22:49 +01:00
Marek Wolan	17b5c6bf92	Add missing module level docstrings.	2023-07-05 09:19:58 +01:00
Czar Echavez	818d64f330	#1386 : fix bug with agent zip file not being saved after run	2023-07-04 16:30:31 +01:00
Marek Wolan	544d8777ea	add module level docstrings	2023-07-04 13:11:06 +01:00
Marek Wolan	3aacd71a5e	remove primaite dependencies as it's autogenerated	2023-07-04 11:57:10 +01:00
Marek Wolan	5db8bd7c4c	Resolve remaining build warnings for docs	2023-07-04 11:34:36 +01:00
Marek Wolan	9244c160b1	Format docstrings	2023-07-04 11:11:52 +01:00
Marek Wolan	91273b2f99	fix formatting on Observation docs	2023-07-04 10:57:00 +01:00
Marek Wolan	2bcaf79a51	Add Favicon	2023-07-04 10:55:07 +01:00
Czar Echavez	c7de7bf21b	Merge branch 'dev' into feature/1386-enable-a-repeatable-or-deterministic-baseline-test	2023-07-04 09:41:07 +01:00
Christopher McCarthy	6006f022a1	Merged PR 101: Integrate ADSP RLlib and use PrimaiteSession for running between agent frameworks ## Summary * Brought over the RLlib, hardcoded agents, and simple agents from ADSP 1.1.0. This opened a can of worms... ADSP got their stuff working in notebooks (*_stares at data scientists!_ 😂) but hadn't integrated it into the PrimAITE package or made the other PrimAITE functionality work with it. * RLlib agents have been fully integrated with the wider PrimAITE package. This was done by: * The creation of an `AgentSessionABC` and `HardCodedAgentSessionABC` classes. * `SB3Agent` and `RLlibAgent` classes then inherited from `AgentSessionABC`. * The ADSP hardcoded agents were integrated into subclasses of `HardCodedAgentSessionABC`. * The random and dummy agents were also integrated into subclasses of `HardCodedagentSessionABC`. * A set of session output directories were created and managed by the agent session to enable consistent storage of session outputs in a common format regardless of the agent type. * The main config was rafactored so that it had * agent_framework - To identify whether SB3, RLlib, or Custom. * agent_identifier - To identify whether PPO, A2C, hardcoded, random, or dummy. * deep_learning_framework - To identify which framework to use for RLlib. * Transactions have been overhauled to simplify the process. It also means that they're written in real time so they're not lost if the agent crashes. * Tests completely overhauled to use `PrimaiteSession`, or at least a test subclass, `TempPrimaiteSession`. It's temp because it uses temp directory rather than main primaite session directory, and it cleans up after itself. * All the crap removed from `main.py` and made it so that it just runs `PrimaiteSession`. Now this is where I went off on a tangent... * CLI added to just make my life and everyone else's life easier. * Primaite app config added to hold things like logging format, levels etc. * A `primaite.data_viz.session_plots` module added so that the average reward per episode for each session is plotted and saves for each session (this helped while we were testing and bug fixing). ## Test process * All tests use `TempPrimaiteSession`, which uses `PrimaiteSession`. * I still need to write a tests that runs the RLlib, hardcoded, and random/dummy agents. I'll do that now while this is being reviewed. ## Still to do * Update docs. I'm getting this PR up now so we can get it in to make use of the features. I'll get the docs updated today either on this branch or another branch (depending on how long this review takes). ## Checklist - [X] This PR is linked to a work item - [X] I have performed self-review of the code - [X] I have written tests for any new functionality added with this PR - [ ] I have updated the documentation if this PR changes or adds functionality - [X] I have run pre-commit checks for code style Related work items: #917, #1563	2023-07-04 08:08:31 +00:00
Chris McCarthy	27e22edaf1	#917 - Reinstalled the pre-commit hook	2023-07-03 20:40:38 +01:00

1 2 3 4 5 ...

256 Commits