Breast Cancer Prediction Using Machine Learning

Overview

This project aims to predict breast cancer diagnoses (Malignant or Benign) using machine learning models based on tumor characteristics. Data preprocessing, model training, and AutoML techniques were employed to achieve high accuracy and optimize performance.

Dataset

The dataset contains tumor features such as radius, texture, and perimeter, with the target variable being the diagnosis (Malignant or Benign). The dataset can be accessed here:https://www.kaggle.com/uciml/breast-cancer-wisconsin-data

Workflow

Data Preprocessing:
- Cleaned data by dropping missing values and encoding diagnosis labels (M → 1, B → 0).
- Standardized features using StandardScaler.
Model Training: Trained multiple models, including:
- Ridge Classifier
- AdaBoost Classifier
- Extra Trees Classifier
- Decision Tree Classifier
- Random Forest Classifier
AutoML with PyCaret:
- Compared models using PyCaret's setup() and compare_models() functions.
- Tuned the best-performing model (Random Forest) to maximize accuracy.
- Visualized results with confusion matrices and feature importance plots.
Model Evaluation: The Random Forest model achieved the best performance with high precision, recall, and F1-score. The model was saved for future predictions.

Technologies Used

Python, Pandas, NumPy, Scikit-learn, PyCaret
Visualization: Matplotlib, Seaborn

Results

The project successfully predicted breast cancer diagnoses with high accuracy, showcasing the power of machine learning in healthcare applications.

For more information, contact me at:

Email: [email protected]
LinkedIn: https://www.linkedin.com/in/himani-s1001/

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
README.md		README.md
breast_cancer_prediction.ipynb		breast_cancer_prediction.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Breast Cancer Prediction Using Machine Learning

Overview

Dataset

Workflow

Technologies Used

Results

About

Releases

Packages

Languages

himani1001/Breast-Cancer-Prediction

Folders and files

Latest commit

History

Repository files navigation

Breast Cancer Prediction Using Machine Learning

Overview

Dataset

Workflow

Technologies Used

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages