The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Yuchen Cui*, Qiping Zhang*, Alessandro Allievi, Peter Stone, Scott Niekum, W. Bradley Knox

Overview of the EMPATHIC framework:

This repository contains code used to conduct experiments reported in the paper "The EMPATHIC Framework for Task Learning from Implicit Human Feedback" published at CoRL 2020.

If you find this repository is useful in your research, please cite the paper:

@inproceedings{cui2020empathic,
  title={The EMPATHIC Framework for Task Learning from Implicit Human Feedback},
  author={Cui, Yuchen and Zhang, Qiping and Allievi, Alessandro and Stone, Peter and Niekum, Scott and Knox, W Bradley},
  booktitle={Conference on Robot Learning},
  year={2020},
  organization={PMLR}
}

Instructions for running source code

Clone the Repository

git clone --recursive https://github.com/Pearl-UTexas/EMPATHIC.git

Requirements

All modules require Python 3.6 or above.

To install all Python dependencies, run:

python -m pip install --upgrade -r requirements.txt

Test Online Learning

Specify the path to your OpenFace installation in start_openface.bash

Run online learning (a webcam that can see your face is required):

python online_learning.py

(you may need to manually kill the process when it finishes)

Training Human Reaction Mappings

Download the pre-processed dataset from here, and extract the files in a directory called detected/.

Training on per-subject datasets

python train_mlp_net_facs.py WkOsToXr9v

This will generate a model file "WkOsToXr9v_[lowest_test_loss].pkl" of the trained MLP in the directory MLP_facs_reward_models/, for testing on the human subject data with ID "WkOsToXr9v".

Note that if you would like to train a model for another subject, you need to make sure the processed data of that subject already exists in a subdirectory with the name of his/her ID, under the folder detected/.

Per-subject models are used for random search of hyper-parameters.

Training on single-model dataset

python train_mlp_net_facsall.py

This will generate a model file "allsubjects_[lowest_test_loss].pkl" of the trained MLP in the directory MLP_facs_reward_models/.

The trained model is used for evaluating data in holdout set.

Predicting Reward Rankings

python test_facs.py WkOsToXr9v

Using Robotaxi Environment

To play the Robotaxi game, update the environment, replay recorded trajectories, and collect new user-study data, refer to the repository: Robotaxi.

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
RobotaxiEnv @ 59fe65e		RobotaxiEnv @ 59fe65e
assets		assets
online_learning_reward_models		online_learning_reward_models
recordings		recordings
reward_mappings		reward_mappings
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
data_loader_all.py		data_loader_all.py
data_loader_facs.py		data_loader_facs.py
network_facs.py		network_facs.py
online_learning.py		online_learning.py
requirements.txt		requirements.txt
start_openface.bash		start_openface.bash
test_facs.py		test_facs.py
train_mlp_net_facs.py		train_mlp_net_facs.py
train_mlp_net_facsall.py		train_mlp_net_facsall.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Instructions for running source code

Clone the Repository

Requirements

Test Online Learning

Training Human Reaction Mappings

Training on per-subject datasets

Training on single-model dataset

Predicting Reward Rankings

Using Robotaxi Environment

About

Releases

Packages

Contributors 2

Languages

Pearl-UTexas/EMPATHIC

Folders and files

Latest commit

History

Repository files navigation

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Instructions for running source code

Clone the Repository

Requirements

Test Online Learning

Training Human Reaction Mappings

Training on per-subject datasets

Training on single-model dataset

Predicting Reward Rankings

Using Robotaxi Environment

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages