Finetune_GPTNEO_GPTJ6B

Overview

This repo contains code to fine-tune GPT-J-6B with a famous quotes dataset. Originally, the repo downloaded and converted the model weights when GPTJ was not yet added to huggingface transformer package. That code can still be seen under the branch original_youtube

/quotes_dataset contains the dataset properly formatted for fine-tuning. See repo for making this dataset here

/finetuning_repo contains code orginally from the repo here that I have modified to work with GPT-J-6B

Walkthrough

See the video for orignal repo code here for a video tutorial

First create a conda envrionment and enter the environment
Run the ./install_requirements.sh script
Then you want to copy the data from train.csv and validation.csv from /quotes_dataset to the /finetuning_repo folder
Run the finetuning code with appropriate flags to fine tune the model. See example_run.txt

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
finetuning_repo		finetuning_repo
inference		inference
quotes_dataset		quotes_dataset
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build_image.sh		build_image.sh
install_requirements.sh		install_requirements.sh
run_image.sh		run_image.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Finetune_GPTNEO_GPTJ6B

Overview

Walkthrough

About

Releases

Packages

Contributors 4

Languages

License

oskrim/finetune

Folders and files

Latest commit

History

Repository files navigation

Finetune_GPTNEO_GPTJ6B

Overview

Walkthrough

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages