GitHub

This repository contains code for data preperation, training and evaluation of various classifiers on Go-Emotions Dataset.

goemotion_6class_subset.ipynb

Use this file to see how the original 28-class problem was mapped to 6-class problem.

seq_classification.py

This file implements sequence classification pipeline from HuggingFace. Additionally, there is a parameter search implementation using Ray library and HuggingFace Trainer module.

supervised_finetuning.py

This file has Lightning modules for data preperation and training to be used with Lightning Framework. I have used 4-bit quantization from BitsAndBytes and peft library from HuggingFace, to train a 13 billion parameter model on a 16GB GPU.

config.yaml

This file needs to be used with previous file to run supervised finetuning under various parameter settings.

convert_checkpoints.py

I extract the adapters from the checkpoints created from supervised_finetuning.py. This saves space because the base Llama2-13B model is fixed during supervised finetuning, and therefore doesn't need to be saved.

evaluate_peft_model.py

Here I load the adapters saved in the previous step with the Llama2-13B model and generate text from validation split of Go Emotions dataset. The module has been designed to take inputs from commandline for various checkpoints.

zeroshot-openai.py

This is a short scipt to run OpenAI text completion API to generate labels from the validation split, in a zero-shot fashion.

compile_zeroshot.ipynb

Here I list the samples generated by GPT3.5-Turbo and Llama2-13B finetuned models. The model predictions can also be found in pkl format.

visualize_peft.ipynb

A notebook to visualize a pytorch model with adapter modules.

scratchpad

A folder with various debugging scripts and notebooks.

cmds

A file with the commands I used for supervised finetuning.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.vscode		.vscode
scratchpad		scratchpad
.gitignore		.gitignore
README.md		README.md
best_run.pkl		best_run.pkl
cmds		cmds
compile_zeroshot.ipynb		compile_zeroshot.ipynb
config.yaml		config.yaml
convert_checkpoints.py		convert_checkpoints.py
evaluate_peft_model.py		evaluate_peft_model.py
goemotion_6class_subset.ipynb		goemotion_6class_subset.ipynb
gpt3.5-turbo-instruct-results.pkl		gpt3.5-turbo-instruct-results.pkl
predictions_llama13b_epoch3_bs8_lr1e-4-epoch=2-step=1677.pkl		predictions_llama13b_epoch3_bs8_lr1e-4-epoch=2-step=1677.pkl
requirements.txt		requirements.txt
seq_classification.py		seq_classification.py
supervised_finetuning.py		supervised_finetuning.py
templates.py		templates.py
utils.py		utils.py
visualize_peft.ipynb		visualize_peft.ipynb
zeroshot-openai.py		zeroshot-openai.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

goemotion_6class_subset.ipynb

seq_classification.py

supervised_finetuning.py

config.yaml

convert_checkpoints.py

evaluate_peft_model.py

zeroshot-openai.py

compile_zeroshot.ipynb

visualize_peft.ipynb

scratchpad

cmds

About

Releases

Packages

Languages

MaveriQ/goemotions

Folders and files

Latest commit

History

Repository files navigation

goemotion_6class_subset.ipynb

seq_classification.py

supervised_finetuning.py

config.yaml

convert_checkpoints.py

evaluate_peft_model.py

zeroshot-openai.py

compile_zeroshot.ipynb

visualize_peft.ipynb

scratchpad

cmds

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages