A starter pack for running online experiments on Heroku using Psiturk or Prolific.
Note: Looking for the previous version using JsPsych? See the jspsych branch.
However, I encourage you to give the master version a try. I worked with jspsych for a long time, and ultimately found that it limited what I could do and forced me into bad coding patterns. Unlike jspsych, the code here takes full advantage of the power of modern javascript (in particular, classes and async/await). It also uses a more flexible (and less error-prone) event-based data-recording system.
If you haven't already, start by clicking the green "Use this template" on Github to make
your own repository based on this one. If you don't want to make a repo on GitHub, you can
also just clone it: git clone https://github.com/fredcallaway/heroku-experiment
.
Make sure you have all of these installed before continuing:
- Python 3.9 - 3.11 see note below
- Postgres: https://www.postgresql.org/download/ or
brew install postgresql
- Heroku CLI: https://devcenter.heroku.com/articles/heroku-cli
Currently, the package does not work with Python 3.12 due to the removal
of the six library. You'll need to use Python 3.11 if you want to test the server
locally (which is necessary to test data saving and loading).
pyenv
is one easy way to do this. After installing it, run pyenv local 3.11
,
and pyenv will automatically use this python version in this directory
(if it doesn't work, check python --version
and which python
; it should
print "$HOME/.pyenv/shims/python").
Create a new repository using this repository as a template (on github there is a green "Use this template" button at the top of the page). Clone the new repository to your machine and cd
into the directory from a terminal.
Create a virtual environment and install the requirements with the following commands.
python3 -m venv env
source env/bin/activate
pip install -r requirements.txt
You can then see the experiment by running make dev
and opening the printed link (cmd-click). By default the experiment will be served at http://localhost:22363. You can change the port number in config.txt (e.g., to allow previewing multiple experiments at once).
- Update the email in experiment.js
- Put your IRB-approved consent form in templates/consent.html.
- If using prolific: update the [Prolific] section of config.txt
- If using mturk: do a search for "bodacious" to find places where you should change info, mostly in ad.html and config.txt
Make sure you're logged into the correct Heroku account using the Heroku CLI (use heroku auth
to see useful commands).
Create a new app and add a Postgres database. Note: these commands must be run from the project directory (the one containing this README.md). You should probably change the name of your app to something less silly.
heroku create dizzydangdoozle --buildpack heroku/python
heroku git:remote -a dizzydangdoozle
heroku addons:create heroku-postgresql
You can confirm that the heroku site has been created with the heroku domains
, which will print the domain of your shiny new website!
Make some changes and commit them using git. You can then deploy all commited changes with
git push -f heroku HEAD:master
This makes heroku build your app, which can take a minute or so. Then your website will be updated. The HEAD:master
part ensures that you post your current git branch (you can use branches to keep track of different versions of the experiment).
- The structure of the experiment is defined in static/js/experiment.js
- Add any additional experiment files and dependencies to templates/exp.html.
- Run
make dev
to preview your experiment (or use your text editor's preview function). - Edit, refresh, edit, refresh, edit, refresh....
Important note: Make sure you have disabled the browser cache, either using the developer tools of your browser or by using another tool (such as a live preview tool) that does this for you. Otherwise, you will see an old version of the code in your browser.
From here, you can probably figure out everything else by looking at the code in static/js/experiment.js and static/js/intructions.js. However, you may want to quickly skim the rest of this section before you do that.
This template makes heavy use of asynchronous programming (Promises), using the async/await
syntax. If this is new to you, you should look at this tutorial before diving in. Javascript async
support is incredible and it is exactly what you want when developing complex highly interactive web interfaces (e.g., experiments).
If you are familiar with javascript Promises, you'll notice that we do some non-standard things with them, like writing promise.reject()
. This is because we use the Deferred Promise design (see deferredPromise
in utils.js).
You can control many things about the experiment using URL parameters. For example: http://0.0.0.0:22363/?instruct=2&showSecretStage=true will show you the second stage of the instructions, setting PARAMS.showSecretStage = true
. You can also jump to a stage of the instructions by name instruct=introduce_task
or to a different block of the experiment with block=main
or you. This is extremeley useful when debugging an experiment, as you can imagine.
Important note: The first parameter must be preceeded by ?
and all others must be preceeded by &
.
Data is recorded with the DATA.recordEvent
function, e.g. DATA.recordEvent('trial.complete', {choice, rt})
. You can also use the DATA.setKeyValue
function for high-level
information that will go into the summary participants.csv file.
It's up to you how you want to handle data representation. Frameworks like jsPsych often batch up all the data for a trial into one object. You can do that if you want; just call DATA.recordEvent
at the end of each trial passing a big object with all the data. I prefer to just log everything that happens and then I worry about formatting it later. This is the safest way to ensure that you record everything you might need.
By default, data will not be saved when running locally. If you want to save data while debugging, follow these steps:
- Make sure you're using
make dev
and not viewing index.html as a static page. - Visit http://localhost:22363/test. The port (22363) is configured in config.txt. The critical addition is to add "/test" at the end of the URL.
- The fields necessary to store your data will be automatically added to the URL. Take note of or change the workerid (something like debug58523) if you wish. Note that if you use the same workerid twice, it will overwrite the previous data.
- The data will be saved to the local participants.db sqlite database.
- See the "Downloading Data" section below for details.
I have included general-purpose tools in the static/js/tools/ directory.
- inputs.js: classes/functions to handle all the standard user inputs (buttons, sliders, textboxes) ensuring that all interactions are recorded in the data
- alerts.js: modal alerts with personality, powered by sweetalert2
- component.js: defines the abstact
Component
class that handles a lot of the boilerplate associated with data recording, configuration, and trial sequencing. - instructions.js: navigable instructions that can gracefully handle complex user interactions on each page
- quiz.js: define a multiple-choice quiz in text format to put at the end of your instructions
- survey.js: define complex surveys with a JSON object that you can build with a gui
- canvas.js: basic drawing functionality
- status-bar: a trial- and point-counter with a help button
These tools might make it easier for you to quickly design experiments based on this template. However, you do not have to use them to use this template! If you just want to use this template to streamline the annoying backend stuff (data managment), then you can do that without using anything in tools/
First, update codeversion in config.txt. This is how the database knows to keep different versions of your study separate. What you do next depends on the recruitment service.
For your first pass, you should create the study with Prolific's web interface.
- Set the URL to.
https://<YOUR_APP_DOMAIN>.herokuapp.com/consent?mode=live&workerId={{%PROLIFIC_PID%}}&hitId=prolific&assignmentId={{%SESSION_ID%}}
. Make sure to replace<YOUR_APP_DOMAIN>
in the link with the current domain, which you can see with theheroku domains
command. - Make sure "I'll use URL parameters" is checked.
- Select "I'll redirect them using a URL". Copy the code and set it as
PROLIFIC_CODE
in experiment.js, e.g.const PROLIFIC_CODE = "6A5FDC7A"
. - As always, do a dry run with Prolific's "preview" mechanism before actually posting the study. I also recommend running only a couple people on your first go in case there are unforseen issues.
We also provide an alpha-release CLI for Prolific, using the Prolific API. Run bin/prolific.py
to see the available commands. The most useful ones are
pay
approves subbmissions and assigns bonuses using the bonus.csv file produced bybin/fetch_data.py
post_duplicate
posts a copy of your last study (as if you had used Prolific's "duplicate study" feature) with an updated name. You can update the pay and number of places in config.txt. It won't actually post the study without you confirming (after printing a link to preview it on Prolific).
You'll need to install two additional dependencies for this script: pip install markdown fire
. Note that if you run this command with the virtual environment active, then you'll need to activate the virtual environment running bin/prolific.py in the future. I recommend installing these packages in your global python environment.
Give participants with this format: https://<YOUR_APP_DOMAIN>.herokuapp.com/consent?mode=live&workerId=XXXX&hitId=XXXXX&assignmentId=XXXXX
. You can set hitId and assignmentId however you want. The critical thing is to make sure workerId is unique for each participant, so as not to overwrite data.
This has not been tested, but you should be able to redirect participants to SONA at the end of the experiment if you set the COMPLETION_LINK variable. For example, at NYU we would use something like this (changing the XXXX of course)
COMPLETION_LINK = `nyu-psych.sona-systems.com/webstudy_credit.aspx?experiment_id=XXXX&credit_token=XXXX&survey_code=${workerId}`
I haven't used MTurk in a while, so I'm not sure this actually works, but...
Start the psiturk shell with the command psiturk
. Run hit create 30 1.50 0.5
to create 30 hits, each of which pays $1.50 and has a 30 minute time limit. You'll get a warning about your server not running. You are using an external server process, so you can press y
to bypass the error message.
To download data for a given version run bin/fetch_data.py <VERSION>
. If you don't provide a version, it will use the current one in config.txt.
You will find the data in data/raw/[codeversion]/events/
. There is one file per participant. It is a json list with one object for every time you called DATA.recordEvent
. This data has identifiers and should not be shared. Make sure not to accidentally put it on github (data is in .gitignore so this shouldn't be a problem). The mapping from the anonymized "wid" to "workerid" is saved in data/raw//identifiers.csv.
What if you don't see the data? If you're looking for data that you generated while testing, make sure you used the /test URL as described above.
If you want to "download" data from the local participants.db database (if you were testing using make dev
, not on the live heroku page), then use bin/fetch_data.py --local
. If you want to include data you generated while testing the heroku site (using the /test URL), then use the --debug
flag. By default, bin/fetch_data.py
will not download data with "debug" in the workerId or assignmentId.
If you run bin/fetch_data.py --process
, the script will also create a directory data/process/[codeversion]/ with processed CSV files. The preprocessing is defined in bin/preprocessing.py. Obviously, you'll need to adjust this file to reflect the structure of your own experiment. However, if you implement your task using the Component
class, you will be able to reuse a lot of this code.
It is often useful to have a permanent link to different versions of the experiment. This is easy to do if you have your own personal website that you can rsync to. First set the relevant parameters in bin/post_static. Then you can run e.g. bin/post_static v1
.
I recommend working in VS Code or one of its clones. The Live Preview functionality integrates the developer console into your editor so you can click on error message and jump to the code. Also, LLMs are great at writing javascript, and you can often get a long way just by describing the task and asking it to write the code for you. If possible, you should use an LLM tool that integrates with your editor. This way, it will automatically write code that is consistent with the template. As of October 2024, I'm using Cursor and loving it.
Can I check how many participants there are without downloading the full dataset?
Yes. Use e.g. heroku pg:psql -c "select count(*) from participants where codeversion = 'v1'"
. You can also open an interactive SQL terminal with just heroku pg:psql
. Another useful query is select workerid,codeversion,cond,beginhit,endhit from participants order by beginhit desc;
Why are you still using jQuery / what year is it?
I have yet to find an option that is as easy as jQuery but doesn't require compiling your code (which in my experience makes server-side errors very difficult to debug). If you're interested in developing a version of the template that uses React/Vue/Alpine etc..., please get in touch though because I'm curious what this would look like.
Why are you using psiTurk when you run the study on Prolific?
The main reason I haven't ditched psiTurk is because it provides balanced condition assignment. I would love to ditch this dependency though, so if this is something you're interested in working on, let me know.
- Fred Callaway
- Carlos Correa