Transformer-based Reinforcement Learning

Final Project COMP2050 - VinUniversity

Team: Tran Quoc Bao, Tran Huy Hoang Anh, Le Chi Cuong

Description:

Goal 1: Implement a minimal version of the Decision Transformer model to play the Atari Breakout game
Goal 2: Train the model on multiple environments and test its ability to generalize to new distributions

How to run

Create new environment

conda env create -f environment.yml
conda activate transformer-based-rl

In order to use atari, you must import ROMS following this instruction

cd data
pip install git+https://github.com/takuseno/d4rl-atari
python download_dataset.py --mix_games False

Use --mix_games True to download synthetic dataset used for the distribution shift experiment

Data options: mixed, medium, expert

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
conservative_q_learning		conservative_q_learning
data		data
decision_transformer		decision_transformer
deep_q_learning		deep_q_learning
online_dt		online_dt
.gitignore		.gitignore
README.md		README.md
environment.yml		environment.yml