Contextual bandit example

This repo contains a simple Python implementation of a contextual bandit, and an example showing how to use it to optimise click-though rates for different advertisments. The bandit maintains one regression model per arm, in order to predict the expected cost for each arm (i.e. negative reward). Exploration is done 10% of the time -- you can edit this by changing the epsilon parameter in app.py.

Requirements

The bandit requires Python 3.5+ and associated packages scikit, scipy and numpy. To install them all:

 pip install -U scikit-learn scipy numpy

Running the simulation

python3 app.py

Running the demo application

Note: the demo requires Docker to be installed on your machine

Build the Docker image

From the root directory of the repository:

docker build -t bandit-demo .
docker run -p 8000:8000 -v $(pwd)/static:/bandit/static bandit-demo

The demo can be run by visiting http://0.0.0.0:8000 in any browser. If you want to make changes to the demo, edit static/index.html and reload the browser tab.

Todo

Remove base64 encoding to server logic
Add support for shared features
Add support for model loading via queue
Add support for model training via queue
Add support for Boltzmann exploration

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
static		static
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
bandit.py		bandit.py
server.py		server.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Contextual bandit example

Requirements

Running the simulation

Running the demo application

Build the Docker image

Todo

About

Releases

Packages

Contributors 2

Languages

SC5/bandits

Folders and files

Latest commit

History

Repository files navigation

Contextual bandit example

Requirements

Running the simulation

Running the demo application

Build the Docker image

Todo

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages