diff --git a/README.md b/README.md index 98899a9..9837ab2 100644 --- a/README.md +++ b/README.md @@ -7,7 +7,7 @@ Half-life regression (HLR) is a model for spaced repetition practice, with parti This repository contains a public release of the data and code used for several experiments in the following paper (which introduces HLR): > B. Settles and B. Meeder. 2016. [A Trainable Spaced Repetition Model for Language Learning](settles.acl16.pdf). -> In _Proceedings of the Association for Computational Linguistics (ACL)_, to appear. +> In _Proceedings of the Association for Computational Linguistics (ACL)_, pages 1848-1858. When using this data set and/or software, please cite this publication. A BibTeX record is: @@ -19,6 +19,8 @@ When using this data set and/or software, please cite this publication. A BibTeX Publisher = {ACL}, Title = {A Trainable Spaced Repetition Model for Language Learning}, Year = {2016} + DOI = {10.18653/v1/P16-1174}, + URL = {http://www.aclweb.org/anthology/P16-1174} } ``` @@ -32,7 +34,7 @@ The file ``evaluation.r`` implements an R function, ``sr_evaluate()``, which tak ## Data Set and Format -The data set is available here: [settles.acl16.learning_traces.13m.csv.gz](https://s3.amazonaws.com/duolingo-papers/publications/settles.acl16.learning_traces.13m.csv.gz) (361 MB). This is a gzipped CSV file containing the 13 million Duolingo student learning traces used in our experiments. +The data set is available on [Dataverse](https://dataverse.harvard.edu/dataset.xhtml?persistentId=doi:10.7910/DVN/N8XJME) (361 MB). This is a gzipped CSV file containing the 13 million Duolingo student learning traces used in our experiments. The columns are as follows: