Fine-tune RoBERTa/DeBERTa models to predict emotion labels on text data

This project sets a pipeline to fine-tune pretrained large language models to automatically predict emotion labels on text data. Users can customize hyperparameters in the config.py and tuner.py. This work is a part of the dissertation project to examine the dynamics of suicide-related emotions and suicide risk from moment to moment. The chosen emotions include guilt, shame, loneliness, anger, sadness, depressed, and anxiety.

Models include Facebook RoBERTa and Microsoft DeBERTa (base and/or large, parameter sizes 100M+ to 300M+). Hyperparameter tuning is performed using Microsoft NNI (Neural Network Intelligence) toolkit 2.5. Computational cost is optimized by using distributed training on multiple GPUs, automatic mixed precision (AMP) training, and low-rank adaptation (LoRA).

Special thanks to Maitrey Mehta and Mattia Medina-Grespan for their suggestions in the codes.

Copy the repository

git clone https://github.com/XinY-Z/bert-text2emo

Installation

pip install -r requirements.txt

Data

The data used in this project came from following datasets:

Usage

First, customize the config.py file to your own data.

config['train_path']: <your training data path>
config['dev_path']: <your development data path>
config['test_path']: <your test data path>
config['x'] = <your text column>
config['y'] = <your label column>

Second, customize the tuner.py file to your own hyperparameter tuning settings. For example,

# specify the search space of the models
experiment.config.search_space = {
    "model_name": {
        "_type": "choice",
        "_value": ["facebookAI/roberta-base", "facebookAI/roberta-large", "microsoft/deberta-base", "microsoft/deberta-large"]
    },
}

Third, run the tuner.py file to start hyperparameter tuning.

python3 tuner.py
# This will open a web interface to monitor the training process

Find the best hyperparameters from the NNI web interface and update the config.py file.

Fourth, run the train.py file to train the model with the best hyperparameters.

python3 train.py

Finally, check the model performance on the test set by running the evaluate.py file.

python3 evaluate.py

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
LICENSE		LICENSE
README.md		README.md
config.py		config.py
evaluate.py		evaluate.py
model.py		model.py
preprocessor.py		preprocessor.py
requirements.txt		requirements.txt
train.py		train.py
tuner.py		tuner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-tune RoBERTa/DeBERTa models to predict emotion labels on text data

Copy the repository

Installation

Data

Usage

About

Releases

Packages

Languages

License

XinY-Z/bert-text2emo

Folders and files

Latest commit

History

Repository files navigation

Fine-tune RoBERTa/DeBERTa models to predict emotion labels on text data

Copy the repository

Installation

Data

Usage

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages