Multi-Class Text Classification with BERT 🚀

Project Overview

🏢 Business Overview

In this NLP project, we aim to perform multiclass text classification using a pre-trained BERT model. The dataset consists of more than two million customer complaints about consumer financial products, with columns for complaint text and product labels.

🎯 Aim

The goal is to leverage the power of the BERT (Bidirectional Encoder Representations) model, an open-source ML framework for Natural Language Processing, to achieve state-of-the-art results in multiclass text classification.

Data Description

The dataset includes customer complaints about financial products, with columns for complaint text and product labels. The task is to predict the product category based on the complaint text.

Tech Stack

Language: Python
Libraries: pandas, torch, nltk, numpy, pickle, re, tqdm, sklearn, transformers

Prerequisite

Install the torch framework
Understanding of Multiclass Text Classification using Naive Bayes
Familiarity with Skip Gram Model for Word Embeddings
Knowledge of building Multi-Class Text Classification Models with RNN and LSTM
Understanding Text Classification Model with Attention Mechanism in NLP

Approach

Data Processing
- Read CSV, handle null values, encode labels, preprocess text.
Model Building
- Create BERT model, define dataset, train and test functions.
Training
- Load data, split, create datasets and loaders.
- Train BERT model on GPU/CPU.
Predictions
- Make predictions on new text data.

Project Structure

Input: complaints.csv
Output: bert_pre_trained.pth, label_encoder.pkl, labels.pkl, tokens.pkl
Source: model.py, data.py, utils.py
Files: Engine.py, bert.ipynb, processing.py, predict.py, README.md, requirements.txt

Takeaways

Solving business problems using pre-trained models.
Leveraging BERT for text classification.
Data preparation and model training.
Making predictions on new data.

Open for Project Collaboration

🤝 **Kindly connect on LinkedIn and follow on Kaggle. Let's collaborate and innovate together! 🌐✨

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
Modular Code		Modular Code
README.md		README.md
bert.ipynb		bert.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Class Text Classification with BERT 🚀

Project Overview

🏢 Business Overview

🎯 Aim

Data Description

Tech Stack

Prerequisite

Approach

Project Structure

Takeaways

Open for Project Collaboration

About

Releases

Packages

Languages

Vidhi1290/Multi-Class-Text-Classification-using-BERT-Model

Folders and files

Latest commit

History

Repository files navigation

Multi-Class Text Classification with BERT 🚀

Project Overview

🏢 Business Overview

🎯 Aim

Data Description

Tech Stack

Prerequisite

Approach

Project Structure

Takeaways

Open for Project Collaboration

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages