TaskEnvironment #49

SynapticSage · 2023-05-29T00:27:50Z

This PR introduces a TaskEnvironment that inherits ratinabox.Environment and pettingzoo.env environments. The main goal is to provide a streamlined interface for encoding and executing tasks and to improve the integration of RIB with reinforcement learning tools compatible with Gym APIs, like Stable Baselines3 and Rllib.

… some function structure.

…rminate the episode

…ode terminates/refreshes.

…ce yet, no reward func

…ction_space gym.Dict for the environment.

…ion)`. added `reward_space`.

…for object types. Small fixes.

… by default have certain things like agents and environments that they can gym.render(). SpatialGoal env merely adds an ability at the end of that pipeline to render spatial goal.

…ions now for distance. option added to teleport agent to new location at end of each episode.

…ize curve. Size and color however change dynamically in plot_trajectory, so blitting will not faithfully reproduce with only a set_offset() call. Added _agent_style() static method that temporarily keeps that styling similar (dynamically recomputing color,size). Meaning some changes there will not fully propagate here until Agent has its own render() method.

…itialize causes regression when the second agent appears. I may have to purely mimic the style and not initialize with it for now.

…naling logic. Solved a bug where rewards need to copy the default object instead of reference it.

… multi-agent.

…de counter or write new episode data

…goal which allow calling set() on a collection of goals. goalcache returns full set of unique goals with get_goals() or agent-specific goals qwith get_agent_goals()

…oo's official test suite, too. made some changes in order to pass the test -- pettingzoo env required a list of active, non-terminated agents env.agents:list[str]

… more clear.

… field is now TaskEnvironments.Ags. Also simplified the goal_cache/taskenvironment relation --- reduced cognitive load for a programmer to learn, plus a little cleaner. helpful changes to tutorials. test_taskenvironment refactors for breaking changes from dev.

…put of reset() to match new API and set setup.cfg requirement

TomGeorge1234 · 2023-05-30T15:30:50Z

excellent work. very excited to give this a go! feel free to keep PR as/when you change or improve things

thanks again for the excellent work here and for making it public, I think this will benefit a lot of users.

SynapticSage added 30 commits January 22, 2023 03:46

Added module for task-driven environments: only templated/provisioned…

a79e70c

… some function structure.

Filled out some more details for task environs.

6feb7d9

Few more organizaiotn decisions re: task environments

7ee02fa

Changed to use objectives

923853d

Super minimally functional now.

38d2d2f

Merge remote-tracking branch 'origin/dev' into task_environments

0a8505b

Decreased demo render period so runs at a reasonable speed

7314b4d

Render agent history during demo

bcb1ad6

Now spatial goal renders with goal_radius, where the agent will be te…

82bf07f

…rminate the episode

Method to clear rendering cache. Now also called by reset() when epis…

b17f426

…ode terminates/refreshes.

Connected observation space to fit gym specification -- no action spa…

a372a97

…ce yet, no reward func

Adding action_space per agent now. Those are stored into an overall a…

5dc449b

…ction_space gym.Dict for the environment.

step() now adheres to `new_state, reward, done, info = env.step(act…

9267149

…ion)`. added `reward_space`.

objectives output reward value when reached

932c3c5

Example works with step() interface

2b566a6

Refactors. Organized the render methods. Now splits out render calls …

053d6c9

…for object types. Small fixes.

Test changes

fd8e2e3

Bug fixes. Pytests work for 2D.

1f0f088

Massively cleaned up all the rendering BS. Now more compact. TaskEnvs…

054ff6a

… by default have certain things like agents and environments that they can gym.render(). SpatialGoal env merely adds an ability at the end of that pipeline to render spatial goal.

env time continuous instead of integer step. uses RiAB geometry funct…

a767a78

…ions now for distance. option added to teleport agent to new location at end of each episode.

Provisioned a reward class

aee094d

pettingzoo adaptions. untested.

030f5c3

pettingzoo version bugs.

d716b51

provisioned flexible rewards. pettingzoo changes from yesterday.

ac2b3b8

Two agent test in __main__ after 1 agent.

4c67d90

Installed reward draft. Also in previous commit, plot_trajectory() in…

16c8929

…itialize causes regression when the second agent appears. I may have to purely mimic the style and not initialize with it for now.

Reward cache plugged into step(). plot_theoretical_reward() fixed.

0cb2612

Non-sparse reward appears to work. Minor change to the expiration sig…

ae70a12

…naling logic. Solved a bug where rewards need to copy the default object instead of reference it.

doc updates

9792a9c

SynapticSage added 28 commits April 8, 2023 03:15

Changes broke a few things. Some fixes. Still not totally working for…

3c2caf6

… multi-agent.

Couple bug fixes. Remaining bug: goal doesn't detect second agent.

fc37d1f

reset() when nothing happened in the episode does not increment episo…

550d7b1

…de counter or write new episode data

Cleaning some option names. Bug: goalorder="sequential" breaks

f559d8d

moved propose goal logic into reset. its only called by reset().

d04c247

No changes. Reordered classes. TaskEnv shows up first.

6f106f9

commentary/help in __main__

cc6790a

Multiagent check() bug resolved

8316698

Fixed bugs. All the goalorder and agentmodes work. Added __hash__ to …

3f7c792

…goal which allow calling set() on a collection of goals. goalcache returns full set of unique goals with get_goals() or agent-specific goals qwith get_agent_goals()

Merge remote-tracking branch 'origin/dev' into task_environments

8592b8a

updated pytests for previous api changes last week. now runs pettingz…

fc55ed9

…oo's official test suite, too. made some changes in order to pass the test -- pettingzoo env required a list of active, non-terminated agents env.agents:list[str]

Merge remote-tracking branch 'rib/dev' into task_environments

d4290ae

Added pettingzoo req to setup.cfg

d9051a7

filter-branch to remove build files previously included and merge

a74d6cd

remove pandas dependency

1084aa6

added agent removal mechanism: env.remove_agents()

3b22279

Drafting a doc file ... some progress on that

72f9ee9

typo

bb63713

doc file tweak

763fbdb

Doc file drafting. Small changes. Few variable renames to make things…

549e4c1

… more clear.

test refactor

3ac6396

adding some teaching files (incomplete) and a few TaskEnv.py changes

7a0c82c

Merge remote-tracking branch 'rib/dev' into task_environments

547ac6e

removing blank images

edcc931

tutorial tweaks

1091752

(testfile, see last commit)

8e6e2fc

Github CI job failed because pettingzoo update to 1.23 -- changed out…

daa1107

…put of reset() to match new API and set setup.cfg requirement

TomGeorge1234 merged commit 0b7902c into RatInABox-Lab:dev May 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TaskEnvironment #49

TaskEnvironment #49

SynapticSage commented May 29, 2023 •

edited

Loading

TomGeorge1234 commented May 30, 2023

TaskEnvironment #49

TaskEnvironment #49

Conversation

SynapticSage commented May 29, 2023 • edited Loading

TomGeorge1234 commented May 30, 2023

SynapticSage commented May 29, 2023 •

edited

Loading