Skip to content

Customer segmentation using machine learning technique to divide a customer into distinct groups based on similarities between customers. This helps businesses tailor their marketing strategies to specific customer segments.

Notifications You must be signed in to change notification settings

shubham5027/Customer-Segmentation-Machine-Learning-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Customer Segmentation using Machine Learning

Run This App

https://customer-segmentation-machine-learning-webapp-vjvaefg8lhaf7zn2.streamlit.app/

This repository contains the code and resources for a customer segmentation project using KMeans clustering algorithms. The project includes:

  • Data preprocessing
  • KMeans clustering
  • Evaluation and visualization
  • Deployment of a web application using Streamlit

Data Preprocessing

newplot (4) newplot newplot (1) newplot (2) newplot (3)

Algorithms Analysis

algorithm

Deployment on Web

Screenshot 2024-01-09 225746 Screenshot 2024-01-09 224213 Screenshot 2024-01-09 225752 Screenshot 2024-01-09 232002

Prerequisites

  • Python
  • NumPy
  • Pandas
  • Scikit-learn
  • Matplotlib
  • Streamlit

Getting Started

  1. Clone this repository:
git clone https://github.com/shubham5027/customer-segmentation.git
  1. Navigate to the project directory:
cd customer-segmentation
  1. Install the required libraries:
pip install -r requirements.txt
  1. Run the Google Colab Customer_Segmentation.ipynb to perform data preprocessing, clustering, and evaluation.

  2. To deploy the web application, run:

streamlit run _app.py

Data Preprocessing

The data preprocessing step involves:

  • Loading the customer dataset
  • Cleaning the data
  • Feature engineering
  • Scaling the data

KMeans Clustering

KMeans clustering algorithm is used to segment the customers into different groups based on their features. The algorithm is run with different values of k to select the optimal number of clusters.

Evaluation and Visualization

The evaluation step involves:

  • Visualizing the clusters in 2D and 3D
  • Calculating the Within-Cluster Sum of Squares (WCSS) for each value of k
  • Selecting the optimal number of clusters based on the WCSS plot

References

License

This project is licensed under the MIT License. See the LICENSE file for details.

About

Customer segmentation using machine learning technique to divide a customer into distinct groups based on similarities between customers. This helps businesses tailor their marketing strategies to specific customer segments.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published