This respository contains code for the following paper:
C.J. Yuan, Y. Chandak, S. Giguere, P.S. Thomas, and S. Niekum. SOPE: Spectrum of Off-Policy Estimators. Neural Information Processing Systems (NeurIPS), December 2021.
The code built upon Caltech OPE Benchmarking Suite (COBS) Library.
Please use Python 3.6+.
pip install -r requirements.txt
pip install -e .
To reproduce the plots from the paper, see:
run_graph.py
run_toymc.py
@article{yuan2021sope,
title={SOPE: Spectrum of Off-Policy Estimators},
author={Yuan, Christina J and Chandak, Yash and Giguere, Stephen and Thomas, Philip S and Niekum, Scott},
journal={arXiv preprint arXiv:2111.03936},
year={2021}
}