Skip to content

Latest commit

 

History

History
6 lines (4 loc) · 324 Bytes

README.md

File metadata and controls

6 lines (4 loc) · 324 Bytes

Reinforcement-Learning

Here is a study of the Multi Armed Bandit Applied to A/B testing in a Marketing Use Case. The aim of this project was to vulgarize the paper "A framework for Multi-A(rmed)/B(andit) testing with online FDR control" from Fanny Yang, Aaditya Ramdas, Kevin Jamieson, Martin J. Wainwright.

Hugo Mallet