mope

martingale off-policy evaluation

conda installation

conda env create -f environment.yml

outline

opebet.py contains the code for MOPE (wealth_lb_2d) and its ablations:
- wealth_lb_1d: scalar betting
- wealth_2d: exact wealth maximization
- wealth_lb_2d_individual_qps: individual bets per value on a grid
opebetrp.py contains code for reward predictors and gated deployment
- wealth_lb_rp subtracts the reward predictor control variate from w*r
- wealth_lb_rp_double_hedge the double hedging strategy
- wealth_lb_gd confidence sequence for gated deployment

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
environments		environments
.gitignore		.gitignore
Betting-Coverage.ipynb		Betting-Coverage.ipynb
Betting-Width.ipynb		Betting-Width.ipynb
Gated-Deployment.ipynb		Gated-Deployment.ipynb
LICENSE		LICENSE
Mnist-Policies.ipynb		Mnist-Policies.ipynb
README.md		README.md
Reward-Predictor.ipynb		Reward-Predictor.ipynb
bad_case.json		bad_case.json
cs_via_supermartingale.py		cs_via_supermartingale.py
environment.yml		environment.yml
experiments.py		experiments.py
opebet.py		opebet.py
opebetrp.py		opebetrp.py
width.pdf		width.pdf
width1d_10_0.05.pkl		width1d_10_0.05.pkl
width1d_10_0.5.pkl		width1d_10_0.5.pkl
width1d_50_0.05.pkl		width1d_50_0.05.pkl
width1d_50_0.5.pkl		width1d_50_0.5.pkl
width2d_10_0.5.pkl		width2d_10_0.5.pkl
width2d_50_0.05.pkl		width2d_50_0.05.pkl
width2d_50_0.5.pkl		width2d_50_0.5.pkl
widthEWA_10_0.05.pkl		widthEWA_10_0.05.pkl
widthEWA_10_0.5.pkl		widthEWA_10_0.5.pkl
widthEWA_50_0.05.pkl		widthEWA_50_0.05.pkl
widthEWA_50_0.5.pkl		widthEWA_50_0.5.pkl
widthEWAslow_10_0.05.pkl		widthEWAslow_10_0.05.pkl
widthEWAslow_10_0.5.pkl		widthEWAslow_10_0.5.pkl
widthEWAslow_50_0.05.pkl		widthEWAslow_50_0.05.pkl
widthEWAslow_50_0.5.pkl		widthEWAslow_50_0.5.pkl
widthiqp_10_0.05.pkl		widthiqp_10_0.05.pkl
widthiqp_10_0.5.pkl		widthiqp_10_0.5.pkl
widthiqp_50_0.05.pkl		widthiqp_50_0.05.pkl
widthiqp_50_0.5.pkl		widthiqp_50_0.5.pkl
widthlog_10_0.05.pkl		widthlog_10_0.05.pkl
widthlog_10_0.5.pkl		widthlog_10_0.5.pkl
widthlog_50_0.05.pkl		widthlog_50_0.05.pkl
widthlog_50_0.5.pkl		widthlog_50_0.5.pkl
widthpointasym_10_0.05.pkl		widthpointasym_10_0.05.pkl
widthpointasym_10_0.5.pkl		widthpointasym_10_0.5.pkl
widthpointasym_50_0.05.pkl		widthpointasym_50_0.05.pkl
widthpointasym_50_0.5.pkl		widthpointasym_50_0.5.pkl
widthsupermartingale1d_10_0.05.pkl		widthsupermartingale1d_10_0.05.pkl
widthsupermartingale1d_10_0.5.pkl		widthsupermartingale1d_10_0.5.pkl
widthsupermartingale1d_50_0.05.pkl		widthsupermartingale1d_50_0.05.pkl
widthsupermartingale1d_50_0.5.pkl		widthsupermartingale1d_50_0.5.pkl
widthsupermartingale_10_0.05.pkl		widthsupermartingale_10_0.05.pkl
widthsupermartingale_10_0.5.pkl		widthsupermartingale_10_0.5.pkl
widthsupermartingale_50_0.05.pkl		widthsupermartingale_50_0.05.pkl
widthsupermartingale_50_0.5.pkl		widthsupermartingale_50_0.5.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mope

conda installation

outline

About

Releases

Packages

Languages

License

zmhammedi/mope

Folders and files

Latest commit

History

Repository files navigation

mope

conda installation

outline

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages