Pessimistic Agents

Pessimistic Agents are ask-for-help reinforcement learning agents that offer guarantees of:

Eventually outperforming the mentor
Eventually stopping querying the mentor
Never causing unprecedented events to happen, with arbitrary probability

In this repository, we investigate their behaviour in the faithful setting, and explore approximations that allow them to be used in real-world RL problems.

Overview - see individual README.md files for more detail.

Distributional Q Learning - dist_q_learning/

We introduce a tractable implementation of Pessimistic Agents. Approximate the Bayesian world and mentor models as a distribution over epistemic uncertainty of Q values. By using a pessimistic (low) quantily, we demonstrate the expected behaviour and safety results for a pessimistic agent.

Work	Status
Finite state Q Table proof of concept
Continuous deep Q learning implementation

Faithful implementation - cliffworld/

Implement and investigate a faithful representation of a Bayesian Pessimistic Agent.

Work	Status
Environment
Agent

On hold, some progress made in implementing the environment and mentor models.

Pessimistic RL - pessimistic_prior/

Apply pessimism approximation to neural network based, deep Q learning RL agents.

Work	Status
DQN proof of concept

Setup

Supported conda env

With anaconda

conda env create -f torch_env_cpu.yml

Name	Name	Last commit message	Last commit date
Latest commit jamie Fixing environment bugs Dec 3, 2021 f82bd10 · Dec 3, 2021 History 415 Commits
.github/workflows	.github/workflows	Use requirements in git build	Nov 30, 2021
cliffworld	cliffworld	Adding more explanations, format	Mar 5, 2021
dist_q_learning	dist_q_learning	Fixing environment bugs	Dec 3, 2021
.gitignore	.gitignore	Adding pickle files to gitignore	Apr 19, 2021
README.md	README.md	Polishing README for finite state, and progress reports	Jun 3, 2021
latest_env.yml	latest_env.yml	Updating env to fix tests	Nov 24, 2021
pessimistic_prior.txt	pessimistic_prior.txt	Create pessimistic_prior.txt	Mar 3, 2021
torch_env_cpu.yml	torch_env_cpu.yml	Adding gpu env	Mar 20, 2021
torch_env_cuda.yml	torch_env_cuda.yml	Adding gpu env	Mar 20, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pessimistic Agents

Distributional Q Learning - dist_q_learning/

Faithful implementation - cliffworld/

Pessimistic RL - pessimistic_prior/

Setup

Supported conda env

About

Releases

Packages

Contributors 5

Languages

j-bernardi/pessimistic-agents

Folders and files

Latest commit

History

Repository files navigation

Pessimistic Agents

Distributional Q Learning - dist_q_learning/

Faithful implementation - cliffworld/

Pessimistic RL - pessimistic_prior/

Setup

Supported conda env

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages