Sentiment-with-LLMs

Sentiment Classification through a pre-train and fine-tune BERT paradigm

About

As Language Models do not have enough context information for downstream tasks like sentiment classification, Masked Language Modeling by masking tokens that are more sentiment-related can prove useful. These Sentiment Tokens can be computed using various methods like topic modeling, TF-IDF, or even manually. In this code, we use the Log-Odds Ratio to compute these tokens and then Mask 15% of input tokens accordingly. The model is then fine-tuned on the training dataset. This code uses the twitter samples dataset provided by NLTK and achieves an F1 score of around 0.7.
This model is inspired by the Knowledge Enhanced Masked Language Model paper

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
Sentiment-with-LLM.ipynb		Sentiment-with-LLM.ipynb
z_scores.txt		z_scores.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment-with-LLMs

About

About

Releases

Packages

Languages

shivikasharmaaa/Sentiment-with-LLMs

Folders and files

Latest commit

History

Repository files navigation

Sentiment-with-LLMs

About

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages