GitHub - allenday/contextual-bandit: working example of a contextual multi-armed bandit

Summary

This project contains a working example of a contextual multi-armed bandit.

I wrote this after becoming interested in the contextual bandit problem for providing personalized recommendations, but not being able to find any working code. So I made this to be able to understand how the algorithm works :)

This repository indexes high on code and low on docs. Briefly, it contains:

A driver ipython notebook contextual_bandit_sim.ipynb. You should start here to understand the contents.
A data generator. We initialize hidden contextual variables that are used to create synthetic samples. Let's call this latent contextual variable set L. Let's call the synthetic data X.
Two strategies / reinforcement functions S by which the bandits can be evaluated (binomial: BinaryStrategy.py and continuous: PositiveStrategy.py).
A simulator Simulator.py that contains a set of variables M that are estimates of L, that through observation of X and evaluation of those observations using stratigies S can be used to update estimates M to converge to the hidden L.

If you find this useful, please share, retweet, and follow me @allenday on twitter.

Have fun!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
BinaryStrategy.py		BinaryStrategy.py
DataGenerator.py		DataGenerator.py
OnlineVariance.py		OnlineVariance.py
PositiveStrategy.py		PositiveStrategy.py
README.md		README.md
Simulator.py		Simulator.py
contextual_bandit_sim.ipynb		contextual_bandit_sim.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Summary

About

Releases

Packages

Languages

allenday/contextual-bandit

Folders and files

Latest commit

History

Repository files navigation

Summary

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages