Customer Segmentation Using Clustering

Project Overview

This project applies unsupervised machine learning techniques to perform customer segmentation using clustering algorithms. The goal is to group customers into distinct clusters based on purchasing behavior and demographic features, helping businesses develop targeted marketing strategies.

By analyzing customer data, we uncover patterns that help in understanding customer types — such as high spenders, budget-conscious shoppers, or moderate consumers.

Objectives

Understand customer behavior through data exploration
Apply K-Means clustering to segment customers
Visualize the resulting clusters in 2D and 3D
Provide actionable insights for business decision-making

📂 Dataset

Name: Mall Customer Segmentation Data
Source: [UCI / Kaggle / Public Dataset]
Features:
- CustomerID
- Gender
- Age
- Annual Income (k$)
- Spending Score (1–100)

The dataset contains details of customers visiting a mall and their spending behavior.

Tools & Technologies

Python
- pandas, numpy – data manipulation
- matplotlib, seaborn, plotly – data visualization
- scikit-learn – machine learning (KMeans, Silhouette Score)
Jupyter Notebook – development environment

Workflow

Data Preprocessing
- Handle missing/null values (if any)
- Encode categorical variables
- Normalize/scale data for clustering
Exploratory Data Analysis (EDA)
- Distribution plots by gender, age, income
- Pair plots and correlation heatmaps
Clustering (K-Means)
- Determine optimal number of clusters using the Elbow Method and Silhouette Score
- Train KMeans model and predict clusters
Visualization
- 2D scatter plot of clusters (e.g., Income vs. Spending)
- Interactive 3D visualization for deeper insight

Results & Insights

Optimal Clusters: 5 (based on Elbow and Silhouette methods)
Identified Customer Segments:
- High Income – High Spending
- High Income – Low Spending
- Low Income – High Spending
- Average Income – Average Spending
- Young Shoppers

These clusters enable businesses to:

Focus loyalty programs on high-spending segments
Offer discounts to low-spending groups to increase retention
Design personalized marketing strategies

How to Run the Project

Clone the repository:

git clone https://github.com/yourusername/customer-segmentation.git
cd customer-segmentation

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Customer Segmentation.ipynb		Customer Segmentation.ipynb
LICENSE		LICENSE
Mall_Customers.csv		Mall_Customers.csv
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Customer Segmentation Using Clustering

Project Overview

Objectives

📂 Dataset

Tools & Technologies

Workflow

Results & Insights

How to Run the Project

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Customer Segmentation Using Clustering

Project Overview

Objectives

📂 Dataset

Tools & Technologies

Workflow

Results & Insights

How to Run the Project

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages