Repository for the paper produced for the course IN5490.
Project paper can be found at doc/paper.pdf.
Requires python>=3.10
. To install the necessary packages, do the following after cloning the repository:
$ python3 -m venv env
$ source env/bin/activate
$ pip install -r requirements.txt
You will also need the LLM model Llama2-7B-chat. To do this, you need a get an access key from Meta. This can be aquired by filling out this form. After receiving the email, run download.sh
and select 7B-chat
.
Change the parameters in src/hyperparams.json
to what you want to test. Run ./start.sh ppo.py
. Run python3 plot_rewards.py
to see the results afterwards. You need a GPU with 24GB of VRAM.