-
Notifications
You must be signed in to change notification settings - Fork 16
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
66a0e00
commit 42e5bb3
Showing
8 changed files
with
330 additions
and
37 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1 @@ | ||
Eduardo Pignatelli |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,18 @@ | ||
Copyright 2023 The Helx Authors. | ||
|
||
Licensed to the Apache Software Foundation (ASF) under one | ||
or more contributor license agreements. See the NOTICE file | ||
distributed with this work for additional information | ||
regarding copyright ownership. The ASF licenses this file | ||
to you under the Apache License, Version 2.0 (the | ||
"License"); you may not use this file except in compliance | ||
with the License. You may obtain a copy of the License at | ||
|
||
http://www.apache.org/licenses/LICENSE-2.0 | ||
|
||
Unless required by applicable law or agreed to in writing, | ||
software distributed under the License is distributed on an | ||
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY | ||
KIND, either express or implied. See the License for the | ||
specific language governing permissions and limitations | ||
under the License. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
Apache epignatelli/helx | ||
Copyright 2023 The Helx Authors. | ||
|
||
This product includes software developed at | ||
The Apache Software Foundation (http://www.apache.org/). |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,2 +1,170 @@ | ||
# navix | ||
Minigrid in jax | ||
[](https://github.com/epignatelli/helx/actions/workflows/CI.yml) | ||
[](https://github.com/epignatelli/helx/actions/workflows/CD.yml) | ||
|
||
 | ||
 | ||
 | ||
|
||
[](https://www.repostatus.org/#wip) | ||
[](https://opensource.org/licenses/Apache-2.0) | ||
|
||
-------------- | ||
|
||
# Helx | ||
|
||
Helx aims to excel at two things: | ||
|
||
*1.* To **interoperate** between a variety of Reinforcement Learning (RL) environments and Deep Learning libraries with a **single interface**. | ||
For example, you can use `gym` with a `pytorch`, `jax` or `tensofrlow` agent, and reuse the same agents with a `bsuite` or a `dm_control` environment **without changing your code**. | ||
|
||
*2.* To easily **implement** a new Deep RL algorithm that one can plug-and-play in the ecosystem described above. | ||
|
||
|
||
## Installation | ||
```bash | ||
pip install git+https://github.com/epignatelli/helx | ||
``` | ||
|
||
If you also want to download the binaries for `mujoco`, both `gym` and `dm_control`, and `atari`: | ||
```bash | ||
helx-download-extras | ||
``` | ||
|
||
And then tell the system where the mujoco binaries are: | ||
```bash | ||
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/path/to/mujoco/lib | ||
export MJLIB_PATH=/path/to/home/.mujoco/mujoco210/bin/libmujoco210.so | ||
export MUJOCO_PY_MUJOCO_PATH=/path/to/home/.mujoco/mujoco210 | ||
``` | ||
|
||
--- | ||
## Example | ||
|
||
A typical use case is to design an agent, and toy-test it on `catch` before evaluating it on more complex environments, such as atari, procgen or mujoco. | ||
|
||
```python | ||
import bsuite | ||
import gym | ||
|
||
import helx.environment | ||
import helx.experiment | ||
import helx.agents | ||
|
||
# create the enviornment in you favourite way | ||
env = bsuite.load_from_id("catch/0") | ||
# convert it to an helx environment | ||
env = helx.environment.to_helx(env) | ||
# create the agent | ||
hparams = helx.agents.Hparams(env.obs_space(), env.action_space()) | ||
agent = helx.agents.Random(hparams) | ||
|
||
# run the experiment | ||
helx.experiment.run(env, agent, episodes=100) | ||
``` | ||
|
||
|
||
Switching to a different environment is as simple as changing the `env` variable. | ||
|
||
|
||
```diff | ||
import bsuite | ||
import gym | ||
|
||
import helx.environment | ||
import helx.experiment | ||
import helx.agents | ||
|
||
# create the enviornment in you favourite way | ||
-env = bsuite.load_from_id("catch/0") | ||
+env = gym.make("procgen:procgen-coinrun-v0") | ||
# convert it to an helx environment | ||
env = helx.environment.to_helx(env) | ||
# create the agent | ||
hparams = helx.agents.Hparams(env.obs_space(), env.action_space()) | ||
agent = helx.agents.Random(hparams) | ||
|
||
# run the experiment | ||
helx.experiment.run(env, agent, episodes=100) | ||
``` | ||
|
||
--- | ||
## Supported libraries | ||
|
||
We currently support these external environment models: | ||
- [dm_env](https://github.com/deepmind/dm_env) | ||
- [bsuite](https://github.com/deepmind/bsuite) | ||
- [dm_control](https://github.com/deepmind/dm_control), including | ||
- [Mujoco](https://mujoco.org) | ||
- [gym](https://github.com/openai/gym) and [gymnasium](https://github.com/Farama-Foundation/Gymnasium), including | ||
- The [minigrid]() family | ||
- The [minihack]() family | ||
- The [atari](https://github.com/mgbellemare/Arcade-Learning-Environment) family | ||
- The legacy [mujoco](https://www.roboti.us/download.html) family | ||
- And the standard gym family | ||
- [gym3](https://github.com/openai/gym3), including | ||
- [procgen](https://github.com/openai/procgen) | ||
|
||
#### On the road: | ||
- [gymnax](https://github.com/RobertTLange/gymnax) | ||
- [ivy_gym](https://github.com/unifyai/gym) | ||
--- | ||
## Adding a new agent (`helx.agents.Agent`) | ||
|
||
An `helx` agent interface is designed as the minimal set of functions necessary to *(i)* interact with an environment and *(ii)* reinforcement learn. | ||
|
||
```python | ||
class Agent(ABC): | ||
"""A minimal RL agent interface.""" | ||
|
||
@abstractmethod | ||
def sample_action(self, timestep: Timestep) -> Action: | ||
"""Applies the agent's policy to the current timestep to sample an action.""" | ||
|
||
@abstractmethod | ||
def update(self, timestep: Timestep) -> Any: | ||
"""Updates the agent's internal state (knowledge), such as a table, | ||
or some function parameters, e.g., the parameters of a neural network.""" | ||
``` | ||
|
||
--- | ||
## Adding a new environment library (`helx.environment.Environment`) | ||
|
||
To add a new library requires three steps: | ||
1. Implement the `helx.environment.Environment` interface for the new library. | ||
See the [dm_env](helx/environment/dm_env.py) implementation for an example. | ||
1. Implement serialisation (to `helx`) of the following objects: | ||
- `helx.environment.Timestep` | ||
- `helx.spaces.Discrete` | ||
- `helx.spaces.Continuous` | ||
2. Add the new library to the [`helx.environment.to_helx`](helx/environment/interop.py#L16) function to tell `helx` about the new protocol. | ||
|
||
--- | ||
## Cite | ||
If you use `helx` please consider citing it as: | ||
|
||
```bibtex | ||
@misc{helx, | ||
author = {Pignatelli, Eduardo}, | ||
title = {Helx: Interoperating between Reinforcement Learning Experimental Protocols}, | ||
year = {2021}, | ||
publisher = {GitHub}, | ||
journal = {GitHub repository}, | ||
howpublished = {\url{https://github.com/epignatelli/helx}} | ||
} | ||
``` | ||
|
||
--- | ||
## A note on maintainance | ||
This repository was born as the recipient of personal research code that was developed over the years. | ||
Its maintainance is limited by the time and the resources of a research project resourced with a single person. | ||
Even if I would like to automate many actions, I do not have the time to maintain the whole body of automation that a well maintained package deserves. | ||
This is the reason of the WIP badge, which I do not plan to remove soon. | ||
Maintainance will prioritise the code functionality over documentation and automation. | ||
|
||
Any help is very welcome. | ||
A quick guide to interacting with this repository: | ||
- If you find a bug, please open an issue, and I will fix it as soon as I can. | ||
- If you want to request a new feature, please open an issue, and I will consider it as soon as I can. | ||
- If you want to contribute yourself, please open an issue first, let's discuss objective, plan a proposal, and open a pull request to act on it. | ||
|
||
If you would like to be involved further in the development of this repository, please contact me directly at: `edu dot pignatelli at gmail dot com`. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,94 @@ | ||
# This toml file is in comliance with PEP 517, PEP 518 and PEP 621 | ||
# However, setuptools support for these is only in Beta | ||
# For now, `setup.py` is still the first option for pip | ||
[build-system] | ||
build-backend = "setuptools.build_meta" | ||
requires = ["setuptools >= 50", "setuptools-scm[toml]>=6.2", "wheel"] | ||
|
||
[project] | ||
name = "Helx" | ||
dynamic = ["version", "dependencies"] | ||
description = "Interoperate among reinforcement learning libraries with jax, pytorch, gym and dm_env" | ||
requires-python = ">=3.9" | ||
readme = "README.md" | ||
license = {file = "LICENSE", name = "Apache-2.0"} | ||
authors = [ | ||
{name = "Eduardo Pignatelli", email = "[email protected]"}, | ||
] | ||
maintainers = [ | ||
{name = "Eduardo Pignatelli", email = "[email protected]"}, | ||
] | ||
classifiers = [ | ||
"Development Status :: 3 - Alpha", | ||
"Intended Audience :: Developers", | ||
"Intended Audience :: Science/Research", | ||
"Intended Audience :: Financial and Insurance Industry", | ||
"Intended Audience :: Healthcare Industry", | ||
"Environment :: GPU", | ||
"Environment :: GPU :: NVIDIA CUDA", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.0", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.1", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.2", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.3", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.4", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.5", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.6", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.7", | ||
"Environment :: GPU :: NVIDIA CUDA :: 11.8", | ||
"Framework :: Pytest", | ||
"License :: OSI Approved :: Apache Software License", | ||
"Operating System :: OS Independent", | ||
"Programming Language :: Python", | ||
"Programming Language :: Python :: 3", | ||
"Programming Language :: Python :: 3.9", | ||
"Programming Language :: Python :: 3.10", | ||
"Programming Language :: Python :: 3.11", | ||
"Topic :: Scientific/Engineering", | ||
"Topic :: Scientific/Engineering :: Artificial Intelligence", | ||
"Topic :: Software Development", | ||
"Topic :: Software Development :: Libraries", | ||
"Topic :: Software Development :: Libraries :: Python Modules", | ||
] | ||
|
||
|
||
[project.urls] | ||
homepage = "https://github.com/epignatelli/helx" | ||
repository = "https://github.com/epignatelli/helx" | ||
bug_tracker = "https://github.com/epignatelli/helx/issues" | ||
|
||
|
||
[project.optional-dependencies] | ||
atari = [ | ||
"AutoROM.accept-rom-license==0.4.2" | ||
] | ||
mujoco = [ | ||
|
||
] | ||
all = [ | ||
"AutoROM.accept-rom-license==0.4.2" | ||
] | ||
|
||
|
||
[project.scripts] | ||
helx-download-extras = "helx.download:download_all" | ||
|
||
|
||
[tool.setuptools.dynamic] | ||
dependencies = {file = "./requirements.txt"} | ||
|
||
|
||
[tool.setuptools.packages.find] | ||
include = ["helx*"] | ||
exclude = ["test", "examples"] | ||
|
||
|
||
[tool.distutils.bdist_wheel] | ||
universal = true | ||
|
||
|
||
[tool.setuptools_scm] | ||
write_to = "helx/version.py" | ||
|
||
|
||
[tool.black] | ||
line-length = 88 |
Oops, something went wrong.