Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

74 Policy evaluation and training cli (rllib) #85

Open
wants to merge 30 commits into
base: main
Choose a base branch
from

Conversation

chenkins
Copy link
Contributor

@chenkins chenkins commented Nov 1, 2024

Changes

Related issues

Closes #74.
Fixes #23.
Closes #75

Checklist

  • Tests are included for relevant behavior changes.
  • Documentation is added in the docs folder for relevant behavior changes. If you made important user-facing
    changes, describe them under the [Unreleased] tag in CHANGELOG.md.
  • New package dependencies are declared in the pyproject.toml file.
    Requirement files have been updated by running tox -e requirements.
  • Code works with all supported Python versions (3.8, 3.9 and 3.10). Checks run with all three version and are
    required to run successfully.
  • Code is formatted according to PEP 8 (an IDE like PyCharm can do this for you).
  • Technical guidelines listed in CONTRIBUTING.md are followed.

@chenkins chenkins changed the base branch from main to python-base-version-310 November 1, 2024 08:37
@@ -192,8 +192,6 @@ def __init__(self,
self.num_resets = 0
self.distance_map = DistanceMap(self.agents, self.height, self.width)

self.action_space = [5]
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is this removal safe? Remove from flatland.core.Environment.env as well or redefine there?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What does mean removal safe - remove from ... ? You like to delete the file or just action_space ?

Base automatically changed from python-base-version-310 to main November 18, 2024 14:36
@chenkins chenkins changed the title 74 policy evaluation and training cli 74 Policy evaluation and training cli (rllib) Nov 20, 2024
@chenkins chenkins force-pushed the 74-policy-evaluation-and-training-cli branch from daec963 to 3f903ca Compare December 4, 2024 13:13
self.env_renderer = RenderTool(wrap)

self.action_space: gym.spaces.Dict = spaces.Dict({
# TODO document why str is necessary - is it?
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO document why str is necessary - is it?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

str(i) === i don't understand this !!!

Do you like to get for each agent a Discrete(5) : |agents| x |Actions space|

Comment on lines +26 to +29
# TODO dqn not working: rewards = scipy.signal.lfilter([1], [1, -gamma], raw_rewards[::-1], axis=0)[
# ~~~~~~~~~~~^^^^^^
# TypeError: unhashable type: 'slice'
# "DQN",
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

TODO dqn not working

@chenkins chenkins requested a review from manuschn January 10, 2025 14:01
@chenkins chenkins marked this pull request as ready for review January 10, 2025 14:02
@chenkins chenkins requested a review from a team as a code owner January 10, 2025 14:02
@chenkins chenkins added this to the 4.0.4 milestone Jan 17, 2025


# defaults from Flatland 3 Round 2 Test_0, see https://flatland.aicrowd.com/challenges/flatland3/envconfig.html
def env_creator(n_agents=7,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

maybe use env_generator to use same naming as for rail and line generators?

return RayMultiAgentWrapper(wrap, render_mode)


def ray_env_creator(render_mode: Optional[str] = None, **kwargs) -> RayMultiAgentWrapper:
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

same questions as above regarding naming _creator or _generator

@@ -192,8 +192,6 @@ def __init__(self,
self.num_resets = 0
self.distance_map = DistanceMap(self.agents, self.height, self.width)

self.action_space = [5]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What does mean removal safe - remove from ... ? You like to delete the file or just action_space ?

self.env_renderer = RenderTool(wrap)

self.action_space: gym.spaces.Dict = spaces.Dict({
# TODO document why str is necessary - is it?
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

str(i) === i don't understand this !!!

Do you like to get for each agent a Discrete(5) : |agents| x |Actions space|

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Algorithm/Policy Abstraction Flatland policy evaluation and training cli. DummyObservationBuilder fix
4 participants