Infrastructure already provided:

The OpenAI gym specification provides a widely recognized standard API for specifying sequential decision problems (PO)MDPs (Brockman et al) which is compatible with most leading RL frameworks. A minimal gym object is a Python class with at least three core methods: init, reset, and step, like so:

class MyEnv(gym.Env):
    def __init__(self, env_config):
        self.action_space = <gym.Space>
        self.observation_space = <gym.Space>
    def reset(self):
        return <obs>
    def step(self, action):
        return <obs>, <reward: float>, <done: bool>, <info: dict>

In practice, gyms will often define different versions of the class corresponding to alternate and often more complex variations of the same objective. Class inheritance comes in handy in such cases. In principle it is possible to allow alternative configurations (i.e. alternative parameter values) for a given gym through the env_config dictionary in the init method, but in practice methods are usually benchmarked against the default configuration, so choose these carefully.
A minimally functional example showing two versions, class inheritence, and 'registering' of the gym are included here. More information can be found in the gym repository (including its extensive collection of gyms included there-in) and by examining other existing gyms.

Tasks to deploy template:

Inital steps

Update setup.py metadata
Rename gym_template directory to match your module name
Update module name in Makefile
Update tests to reflect module name.

Building the gym

Edit the methods and class definitions in renamed gym_template directory to define the core gym methods.

Infrastructure already provided:

This template includes automatic linting and testing using GitHub Actions and a Makefile for automating common tasks:

make check-codestyle Check code style conforms to black code formatter (with 79 char soft character limit per line). (see format below to auto-reformat valid code)
make format Auto-reformats code using isort (imports) and black.
make type runs pytype to check Python classes.
make lint runs flake8 to check for syntax errors in Python code
make commit-checks: runs make calls for format, type, and lint. (Ideally should be run before any git commit)
make install installs the python module
make pytest runs pytest suite using configuration in scripts/run_tests.sh
make release: Publish a releast to pypi.org using twine, will ask for required credentials
Note that additional tools for documentation via sphinx: make spelling, make doc, and make clean` are not fully flushed out here.

Makefile automation is based on (MIT-licensed) stable-baselines3

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
gym_template		gym_template
scripts		scripts
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py
shell		shell

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Inital steps

Building the gym

Infrastructure already provided:

About

Releases

Packages

Languages

License

mlap/gym_template

Folders and files

Latest commit

History

Repository files navigation

Inital steps

Building the gym

Infrastructure already provided:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages