74 Policy evaluation and training cli (rllib) #85

chenkins · 2024-11-01T08:36:25Z

Changes

extract ml dependencies (torch, gym, ray, etc.) to optional dependencies in pyproject.toml, new requirements-ml.txt and slimmer requirements[-dev].txt, keep core gym free.
accordingly, move corresponding code to new flatland.ml module
accordingly, move corresponding tests to tests.ml (tests becomes a Python module)
preparations for rllib (ray.io) and gym(nasium) compatibility #73 / 73 Get Pettingzoo example to work again. #102 (cleanup of flatland.contrib s)

Related issues

Closes #74.
Fixes #23.
Closes #75

Checklist

Tests are included for relevant behavior changes.
Documentation is added in the docs folder for relevant behavior changes. If you made important user-facing
changes, describe them under the [Unreleased] tag in CHANGELOG.md.
New package dependencies are declared in the pyproject.toml file.
Requirement files have been updated by running tox -e requirements.
Code works with all supported Python versions (3.8, 3.9 and 3.10). Checks run with all three version and are
required to run successfully.
Code is formatted according to PEP 8 (an IDE like PyCharm can do this for you).
Technical guidelines listed in CONTRIBUTING.md are followed.

chenkins · 2024-11-01T08:45:08Z

flatland/envs/rail_env.py

@@ -192,8 +192,6 @@ def __init__(self,
        self.num_resets = 0
        self.distance_map = DistanceMap(self.agents, self.height, self.width)

-        self.action_space = [5]


Is this removal safe? Remove from flatland.core.Environment.env as well or redefine there?

What does mean removal safe - remove from ... ? You like to delete the file or just action_space ?

flatland/ml/observations/flatten_tree_observation_for_rail_env.py

tests/test_flatland_malfunction.py

flatland/ml/observations/flatten_tree_observation_for_rail_env.py

chenkins · 2025-01-10T13:58:10Z

flatland/ml/ray/ray_multi_agent_rail_env.py

+            self.env_renderer = RenderTool(wrap)
+
+        self.action_space: gym.spaces.Dict = spaces.Dict({
+            # TODO document why str is necessary - is it?


TODO document why str is necessary - is it?

str(i) === i don't understand this !!!

Do you like to get for each agent a Discrete(5) : |agents| x |Actions space|

chenkins · 2025-01-10T13:58:36Z

tests/ml/ray/examples/test_flatland_training_with_parameter_sharing.py

+            # TODO dqn not working:  rewards = scipy.signal.lfilter([1], [1, -gamma], raw_rewards[::-1], axis=0)[
+            #                                                      ~~~~~~~~~~~^^^^^^
+            # TypeError: unhashable type: 'slice'
+            # "DQN",


TODO dqn not working

Co-authored-by: Serge Croisé <[email protected]>

manuschn · 2025-01-24T13:28:02Z

flatland/env_generation/env_creator.py

+
+
+# defaults from Flatland 3 Round 2 Test_0, see https://flatland.aicrowd.com/challenges/flatland3/envconfig.html
+def env_creator(n_agents=7,


maybe use env_generator to use same naming as for rail and line generators?

manuschn · 2025-01-24T13:31:40Z

flatland/ml/ray/wrappers.py

+    return RayMultiAgentWrapper(wrap, render_mode)
+
+
+def ray_env_creator(render_mode: Optional[str] = None, **kwargs) -> RayMultiAgentWrapper:


same questions as above regarding naming _creator or _generator

aiAdrian · 2025-02-03T12:16:11Z

flatland/envs/rail_env.py

@@ -192,8 +192,6 @@ def __init__(self,
        self.num_resets = 0
        self.distance_map = DistanceMap(self.agents, self.height, self.width)

-        self.action_space = [5]


What does mean removal safe - remove from ... ? You like to delete the file or just action_space ?

aiAdrian · 2025-02-03T12:21:47Z

flatland/ml/ray/ray_multi_agent_rail_env.py

+            self.env_renderer = RenderTool(wrap)
+
+        self.action_space: gym.spaces.Dict = spaces.Dict({
+            # TODO document why str is necessary - is it?


str(i) === i don't understand this !!!

Do you like to get for each agent a Discrete(5) : |agents| x |Actions space|

chenkins changed the base branch from main to python-base-version-310 November 1, 2024 08:37

chenkins commented Nov 1, 2024

View reviewed changes

flatland/ml/observations/flatten_tree_observation_for_rail_env.py Outdated Show resolved Hide resolved

Base automatically changed from python-base-version-310 to main November 18, 2024 14:36

chenkins changed the title ~~74 policy evaluation and training cli~~ 74 Policy evaluation and training cli (rllib) Nov 20, 2024

chenkins added 7 commits December 4, 2024 14:13

Add flatland training cli with ray.

68f7394

Fix error from ray passive_env_checker.

f62da96

Fix flattened gym observation builders.

6f36350

Unit test for (gym) observation builder returned (d)types sizes/shapes.

63a65f3

Extract ray examples.

7e682de

Update TODO.

6abc591

Add ray address cli param.

3f903ca

chenkins force-pushed the 74-policy-evaluation-and-training-cli branch from daec963 to 3f903ca Compare December 4, 2024 13:13

SergeCroise reviewed Dec 4, 2024

View reviewed changes

tests/test_flatland_malfunction.py Outdated Show resolved Hide resolved

SergeCroise reviewed Dec 4, 2024

View reviewed changes

tests/test_flatland_malfunction.py Outdated Show resolved Hide resolved

SergeCroise reviewed Dec 4, 2024

View reviewed changes

tests/test_flatland_malfunction.py Outdated Show resolved Hide resolved

chenkins added 14 commits December 6, 2024 09:52

Cleanup training cli example.

3fe01dc

Create test stub for env_creator.

e9d389b

Mark training test as slow.

ecf6deb

Gym obs builder unit tests.

5c7a17e

Code cleanup gym obs builders.

73d6bd2

Code cleanup gym obs builders.

37fe60f

Add option checkpointing.

113e719

Add rllib_demo.ipynb.

b5e4681

Remove obsolete ray shutdown.

e2a758c

Add rllib_demo.ipynb.

952688d

Cleanup example cli interface.

8b1f65a

Fix FlattenTreeObservation to contain all 12 features.

a778e8c

Add TreeObs TODOs.

043c412

Update TODOs.

ab29664

chenkins added 7 commits December 20, 2024 18:34

Add regression test for tree obs.

9acf958

Split flattening and normalization in FlattenTreeObsForRailEnv.

ea222fa

Split flattening and normalization in FlattenTreeObsForRailEnv.

e07eccc

Update TODOs.

2af24f0

Add documentation on feature groups in tree obs flattening.

f3547b7

Refactor tree obs normalization.

058d349

Refactor tree obs normalization.

63de68c

SergeCroise reviewed Dec 22, 2024

View reviewed changes

flatland/ml/observations/flatten_tree_observation_for_rail_env.py Outdated Show resolved Hide resolved

Cleanup.

3cea4ce

chenkins mentioned this pull request Jan 10, 2025

Bump ray from 1.5.2 to 2.8.1 in /flatland/contrib #108

Closed

chenkins commented Jan 10, 2025

View reviewed changes

Apply suggestions from code review

1c4f41b

Co-authored-by: Serge Croisé <[email protected]>

chenkins requested a review from manuschn January 10, 2025 14:01

chenkins marked this pull request as ready for review January 10, 2025 14:02

chenkins requested a review from a team as a code owner January 10, 2025 14:02

chenkins added this to the 4.0.4 milestone Jan 17, 2025

manuschn reviewed Jan 24, 2025

View reviewed changes

aiAdrian approved these changes Feb 3, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

74 Policy evaluation and training cli (rllib) #85

74 Policy evaluation and training cli (rllib) #85

chenkins commented Nov 1, 2024 •

edited

Loading

chenkins Nov 1, 2024

aiAdrian Feb 3, 2025

chenkins Jan 10, 2025

aiAdrian Feb 3, 2025

chenkins Jan 10, 2025

manuschn Jan 24, 2025

manuschn Jan 24, 2025

aiAdrian Feb 3, 2025

aiAdrian Feb 3, 2025



		# defaults from Flatland 3 Round 2 Test_0, see https://flatland.aicrowd.com/challenges/flatland3/envconfig.html
		def env_creator(n_agents=7,

		return RayMultiAgentWrapper(wrap, render_mode)


		def ray_env_creator(render_mode: Optional[str] = None, **kwargs) -> RayMultiAgentWrapper:

74 Policy evaluation and training cli (rllib) #85

Are you sure you want to change the base?

74 Policy evaluation and training cli (rllib) #85

Conversation

chenkins commented Nov 1, 2024 • edited Loading

Changes

Related issues

Checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chenkins commented Nov 1, 2024 •

edited

Loading