Skip to content

Latest commit

 

History

History
111 lines (97 loc) · 4.04 KB

README.md

File metadata and controls

111 lines (97 loc) · 4.04 KB

rl4fisheries

RL and Bayesian optimization methodologies for harvest control rule optimization in fisheries. Includes:

  • A gymnasium environment for a Walleye population dynamics model
  • Policy functions for different commonly-tested policies (including those in the paper)
  • Scripts to optimize RL policies and non-RL policies
  • Notebooks to reproduce paper figures
  • Templates to train new RL policies on our Walleye environment

Installation

To install this source code, you need to have git, Python and pip installed. To quickly check whether these are installed you can open the terminal and run the following commands:

git version
pip --version
python -V

If the commands are not recognized by the terminal, refer to here for git installation instructions, here for Python installation instructions and/or here for pip installation instructions.

To install this source code, run

git clone https://github.com/boettiger-lab/rl4fisheries.git
cd rl4fisheries
pip install -e .

Optimized policies

The optimized policies presented in the paper---both RL policies and non-RL policies such as the precautionary policy---are saved in a public hugging-face repository. RL policies are saved as zip files named PPO-AsmEnv-(...)-UMx-(...).zip since the RL algorithm PPO was used to optimize them. Here UM stands for utility model and x=1, 2, or 3 designates which utility model the policy was optimized for. Precautionary policies are named cr-UMx.pkl (CR stands for "cautionary rule", an acronym we used during the research phase of this collaboration). Similarly, constant escapement policies are saved as esc-UMx.pkl and FMSY policies are saved as msy-UMx.pkl.

Reproducing paper figures

The Jupyter notebooks found at rl4fisheries/notebooks/for_results may be used to recreate the figures found in the paper. Notice that the data for the plots is re-generated each time the notebook is run so, e.g., the time-series plots will look different.

To reproduce these figures in your own machine you need to have Jupyter Notebooks installed, however you can navigate to https://github.com/boettiger-lab/rl4fisheries and click on code > codespaces > Create codespace on main to open the notebooks in a Github codespace.

Optimizing RL policies

To optimize an RL policy from scratch, use the command

python scripts/train.py -f path/to/config/file.yml

You can use the following template config file:

python scripts/train.py -f hyperpars/RL-template.yml

The config files we used for the policies in our paper are found at hyperpars/for_results/. For example this config file was used to train 1-Obs. RL in Scenario 1 (utility = total harvest). The trained model is automatically pushed to hugging-face if a hugging-face token is provided.

Source code structure

rl4fisheries
|
|-- hyperpars
|   |
|   |-- configuration yaml files
|
|-- notebooks
|   |
|   |-- Jupyter notebooks
|
|-- src/rl4fisheries
|   |
|   |-- agents
|   |   |
|   |   |-- interfaces for policies such as Precautionary Policy, FMSY, Constant Escapement
|   |
|   |-- envs
|   |   |
|   |   |-- Gymnasium environments used in our study.
|   |       (specifically, asm_env.py is used for our paper).
|   |
|   |-- utils
|       |
|       |-- ray.py: RL training within Ray framework (not used in paper)
|       |
|       |-- sb3.py: RL training within Stable Baselines framework (used in paper)
|       |
|       |-- simulation.py: helper functions to simulate the system dynamics using a policy
|    
|-- tests
|   |
|   |-- continuous integration tests to ensure code quality in pull requests
|
|-- noxfile.py: file used to run continuous integration tests
|
|-- pyproject.toml: file used in the installation of this source code
|
|-- README.md