GitHub - madhusudanladda/Cancer-Cell-Classifier-High-Acc: Machine Learning–based cancer cell classification using SVM with ~96% accuracy. Includes data preprocessing, feature scaling, and performance evaluation.

Cancer Cell Classifier – High Accuracy

A Machine Learning–based project for classifying cancerous and non-cancerous cells using Support Vector Machine (SVM). The model achieves high accuracy and is evaluated using standard performance metrics.

🎯 Motivation

Early detection of cancer plays a crucial role in saving lives.
This project aims to assist in identifying whether a cell is Benign or Malignant using Machine Learning techniques on medical data.

📁 Dataset Description

The dataset contains numerical features extracted from breast cell samples.

Features:

Clump Thickness
Uniformity of Cell Size
Uniformity of Cell Shape
Marginal Adhesion
Single Epithelial Cell Size
Bare Nuclei
Bland Chromatin
Normal Nucleoli
Mitoses

Target Class:

2 → Benign (Non-Cancerous)
4 → Malignant (Cancerous)

The ID column is removed during preprocessing as it does not contribute to prediction.

⚙️ Project Workflow

Load CSV dataset
Data cleaning and preprocessing
Feature scaling using StandardScaler
Train-test split
Model training using SVM (RBF kernel)
Model evaluation and visualization

🤖 Machine Learning Model

Algorithm: Support Vector Machine (SVM)
Kernel: RBF (Radial Basis Function)

Why SVM?

Performs well on high-dimensional data
Effective for medical classification tasks
Handles non-linear decision boundaries efficiently

📊 Results & Evaluation

Accuracy: ~96%
Performance evaluated using:
- Accuracy Score
- Precision, Recall, F1-score
- Confusion Matrix

📈 Model Performance Visualizations

🔹 Accuracy The following image shows the achieved accuracy of the model:

🔹 Confusion Matrix The confusion matrix visualizes correct and incorrect predictions:

🛠️ Tech Stack

Python
Pandas
NumPy
Scikit-learn
Matplotlib
Seaborn

🚀 How to Run the Project

1️⃣ Clone the repository

git clone https://github.com/madhusudanladda/Cancer-Cell-Classifier-High-Acc.git

cd Cancer-Cell-Classifier-High-Acc

2️⃣ Install dependencies

pip install -r requirements.txt

3️⃣ Run the notebook

Open main.ipynb in Jupyter Notebook or VS Code and execute the cells.

📌 Repository Structure text Copy code ├── main.ipynb

├── cancer_cell_dataset.csv

├── requirements.txt

├── accuracy.png

├── confusion_matrix.png

└── README.md

🔮 Future Scope

Deploy the model using Streamlit or Flask

Compare SVM with Random Forest and Neural Networks

Use larger real-world medical datasets

Cloud deployment for remote access

👨‍💻 Author

Madhusudan Ladda

BTech Computer Science

Interested in Machine Learning & Medical AI

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
accuracy.png		accuracy.png
cancer_cell_dataset.csv		cancer_cell_dataset.csv
confusion_matrix.png		confusion_matrix.png
main.ipynb		main.ipynb
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages