Online Lever Adaptation

This repository aims to bundle all the code and experiments for my Master's thesis on online adaptation in multi-agent reinforcement learning (MARL). The thesis is jointly supervised by Jakob Foerster with support from his research group at FLAIR and Arnaud Doucet from the Department of Statistics at the University of Oxford.

The top-level scripts describe some of the basic functionality and structure of this project.

01_step_through_env.py shows how to initialize and step through an iterated lever environment with custom parameters and partner policies.
02_q_learning.py combines the environment with a learner of class DQNAgent to perform vanilla q-learning.
03_es_meta_learning.py exemplifies how the OpenES class - which implements the evolution strategies algorithm Open-ES - can be used to learn initial network weights capable of remembering a fixed partner pattern of length three.
04_es_learn_history_representations.py shows how evolution strategies can be used to learn the parameters of a LSTM giving a history representation suitable for effective q-learning.
05_learning_with_drqn.py exemplifies the adaptation baseline (a simple deep recurrent q-learner based on the work by Hausknecht et al.).
06_step_through_marl_env.py shows how to step through the iterated lever environment without a fixed partner policy, but a pair of (possibly learning) agents.

Name		Name	Last commit message	Last commit date
Latest commit History 263 Commits
experiments		experiments
levers		levers
.gitignore		.gitignore
01_step_through_env.py		01_step_through_env.py
02_q_learning.py		02_q_learning.py
03_es_meta_learning.py		03_es_meta_learning.py
04_es_learn_history_representations.py		04_es_learn_history_representations.py
05_learning_with_drqn.py		05_learning_with_drqn.py
06_step_through_marl_env.py		06_step_through_marl_env.py
07_online_dqn_crossplay.py		07_online_dqn_crossplay.py
README.md		README.md
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Online Lever Adaptation

About

Languages

hericks/online-lever-adaptation

Folders and files

Latest commit

History

Repository files navigation

Online Lever Adaptation

About

Topics

Resources

Stars

Watchers

Forks

Languages