SitePoint
  • Premium
  • Library
  • Community
  • Jobs
  • Blog
LoginStart Free Trial

The home for developers

Track your course progress, manage projects, and find your dream job - all in one place.

Start Your Free Trial

7 Day Free Trial. Cancel Anytime.

SitePoint Premium

  • Access to 700+ courses
  • Publish articles on SitePoint
  • Daily curated jobs
  • Learning Paths
  • Discounts to dev tools
  • Library/
  • Text courses/
  • Cluster Analysis in Python
Cluster Analysis in Python

Cluster Analysis in Python

Understand your data with cluster analysis


Course details

0∙0 Ratings0 reviews

Published
November 2022
Publisher
SitePoint
Chapters
1

About the course

If you’ve browsed through Google News, you’ll most likely have noticed that news articles on similar topics are grouped together. Google scans the Internet for new articles, and then analyzes the text within each of them to decide which articles talk about the same topic! It’s essentially a result of an unsupervised machine learning algorithm. Cluster analysis is an example of an unsupervised learning algorithm. Another example of clustering is trending topics on Twitter. Twitter goes through new tweets in real time to determine words, hashtags, and a combination of them, to determine what’s causing a buzz.

Unsupervised learning involves the use of machine learning techniques to find patterns and make inferences on unlabelled data. Labelled data has some kind of a tag associated with it—such as a name or an attribute. This label can be categorical or numerical data. Unlabelled data contains no tags. Going back to our example on Google News, news articles are examples of unlabelled data. A cluster of news articles has similar words and word associations repeating in it.

Common unsupervised learning algorithms are cluster analysis, anomaly detection, and neural networks. Cluster analysis involves analyzing unlabelled data and grouping similar data points together. The data points are grouped on the basis of certain characteristics. These groups are called clusters. A common example of clustering is customer segmentation. Cluster analysis can group similar customers on the basis of their demographics and spending patterns, and help businesses cater their offerings differently to different clusters.

In this tutorial, we’ll look at the following:

  • what cluster analysis is
  • when cluster analysis should be used
  • how data can be prepared for cluster analysis
  • two techniques for performing cluster analysis
  • how to interpret the results of cluster analysis

Author

Shaumik Daityari

Shaumik Daityari

Shaumik is an optimist, but one who carries an umbrella. He is currently pursuing his MBA at IIM Lucknow, after completing his M.Tech at IIT Roorkee. A co-founder of The Blog Bowl, he loves writing.

Course Outline

Chapter 1: Cluster Analysis in Python

Reviews

0 total

0
0
0
0
0

Course details

0∙0 Ratings0 reviews

Published
November 2022
Publisher
SitePoint
Chapters
1

SitePoint

Stuff we do

  • Premium
  • Newsletters
  • Learning paths
  • Library
  • Forums

Contact

  • Contact us
  • FAQ
  • Publish your book
  • Write an article
  • Advertise

About

  • Our Story
  • Corporate Memberships
  • Start free trial
  • Login

Connect

  • RSS
  • Facebook
  • Twitter (X)
  • Instagram

Subscribe to our newsletter

Get the freshest news and resources for developers, designers and digital creators in your inbox each week

© 2000 – 2025 SitePoint Pty. Ltd.

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

Terms of usePrivacy policy