#️⃣ Unsupervised Learning

🛒 Example: Customer Segmentation in a Shopping Mall

🟦 1. 📖 Introduction

💡 Unsupervised Learning is a type of Machine Learning in which the computer learns from unlabeled data.

Unlike Supervised Learning, the data does not contain the correct output (labels). The algorithm automatically discovers hidden patterns, similarities, and relationships among the data.

🌟 Definition

✅ Unsupervised Learning is a machine learning technique in which the model is trained using unlabeled data. The algorithm automatically groups similar data or discovers hidden patterns without any human guidance.

🟩 2. 🛒 Real-Life Example

A shopping mall wants to understand the behavior of its customers.

The mall has customer information such as:

👤 Customer ID

🎂 Age

💰 Annual Income

🛍️ Amount Spent

🏙️ City

However, the customers are not already divided into groups.

The machine automatically creates customer groups based on similar shopping behavior.

🟨 3. 🔄 Step-by-Step Working

🟢 Step 1 : 📥 Collect Raw Data

The shopping mall collects customer information.

Information Collected

👤 Customer ID

🎂 Age

💰 Annual Income

🛍️ Shopping Amount

📍 City

This information is called Raw Data.

📌 Notice that there are NO labels like Premium Customer or Regular Customer.

🟢 Step 2 : ❓ No Labels Available

Unlike Supervised Learning,

❌ No "Correct Answer"

❌ No "Approved/Rejected"

❌ No "Pass/Fail"

The algorithm receives only customer information.

This is called Unlabeled Data.

🟢 Step 3 : 🔍 Data Interpretation

The Machine Learning Algorithm studies the customer records.

It observes patterns such as:

✔ Customers with high income spend more.

✔ Young customers buy electronics.

✔ Families purchase groceries.

✔ Senior citizens buy healthcare products.

The machine begins identifying similarities automatically.

🟢 Step 4 : 🤖 Model Training

The algorithm analyzes every customer record.

It compares:

📊 Income

🛍️ Shopping Amount

🎂 Age

📍 Location

and finds customers with similar behavior.

No teacher or supervisor is involved.

🟢 Step 5 : ⚙️ Processing

The algorithm processes all customer records repeatedly.

Gradually it forms groups based on similarities.

Example:

🟢 Group A → High Income Customers

🔵 Group B → Frequent Buyers

🟡 Group C → Budget Customers

🟣 Group D → Occasional Shoppers

🟢 Step 6 : 📊 Generate Output

Finally, the machine automatically creates customer groups.

Example Output

👑 Premium Customers

🛒 Regular Customers

💰 Budget Customers

🎯 Frequent Buyers

These groups were not provided by humans.

The machine discovered them automatically.

🟥 4. 🔄 Workflow of Unsupervised Learning

📥 Raw Customer Data
            │
            ▼
❓ No Labels Available
            │
            ▼
🔍 Data Interpretation
            │
            ▼
🤖 Machine Learning Algorithm
            │
            ▼
⚙️ Processing
            │
            ▼
📊 Customer Groups (Clusters)

🟪 5. 📋 Important Components

🧩 Component	📖 Description
📥 Input Data	Customer Information
🏷️ Labels	❌ Not Available
👨‍🏫 Supervisor	❌ Not Required
📚 Training Dataset	Raw Unlabeled Data
🤖 Algorithm	Finds Hidden Patterns
🎯 Output	Customer Groups (Clusters)

🟦 6. 📂 Categories of Unsupervised Learning

🟢 1. Clustering

Groups similar data together.

Examples

🛒 Customer Segmentation

👨‍🎓 Student Grouping

🏥 Disease Pattern Analysis

🟡 2. Association Rule Mining

Finds relationships between different items.

Example

Customers who buy

🥛 Milk

often buy

🍞 Bread

This is widely used in supermarkets.

🟣 3. Dimensionality Reduction

Reduces unnecessary features while keeping important information.

Example

Compressing a dataset from 100 features to 20 features.

Benefits:

✔ Faster Training

✔ Less Memory

✔ Better Visualization

🟩 7. 🌍 Applications

🛒 Customer Segmentation

🎬 Movie Recommendation

🛍️ Market Basket Analysis

🏥 Disease Pattern Detection

📱 Image Compression

📈 Stock Market Pattern Analysis

🌐 Social Network Analysis

🟦 8. ✅ Advantages

✔ No Labeled Data Required

✔ Finds Hidden Patterns

✔ Discovers Unknown Groups

✔ Useful for Large Datasets

✔ Helps in Business Decision Making

🟥 9. ❌ Limitations

❌ Results are Difficult to Evaluate

❌ Groups may not always be meaningful

❌ Accuracy cannot be measured directly

❌ Sensitive to poor-quality data

🟨 10. ⭐ Key Differences from Supervised Learning

🟢 Supervised Learning	🔵 Unsupervised Learning
Uses Labeled Data	Uses Unlabeled Data
Correct Output Available	No Correct Output
Supervisor Required	No Supervisor
Predicts Results	Finds Hidden Patterns
Classification & Regression	Clustering & Association

🟥 11. 📝 Examination Definition

💡 Unsupervised Learning is a machine learning technique in which the computer learns from unlabeled data. It automatically discovers hidden patterns, similarities, and relationships without using predefined output labels.

🌟 🎯 Exam Tip

🔑 Remember This Sequence

📥 Raw Data

⬇️

❓ No Labels

⬇️

🔍 Pattern Identification

⬇️

🤖 Algorithm Learning

⬇️

⚙️ Processing

⬇️

📊 Grouping (Clusters)

⭐ One-Line Revision

📚 Unsupervised Learning = Unlabeled Data + Hidden Pattern Discovery + Automatic Grouping (Clustering)

Unsupervised Learning algorithms are mainly divided into three categories, depending on the task they perform.

🟢 1. Clustering

📖 Definition

Clustering is a technique that automatically groups similar data objects together based on their characteristics. Data points within the same cluster are more similar to each other than to those in other clusters.

The algorithm decides how to form the groups without any predefined labels.

🎯 Objective

To organize similar data into meaningful groups or clusters.

⚙️ How Clustering Works

1️⃣ The algorithm receives unlabeled data.

2️⃣ It measures the similarity between different data points.

3️⃣ Similar data points are placed into the same cluster.

4️⃣ Different clusters represent different categories of similar data.

🌍 Real-Life Example

🎵 Music Streaming Application

A music streaming platform has thousands of songs but no predefined categories.

The algorithm analyzes song features such as:

🎼 Genre

🎤 Singer

🎸 Instruments

⚡ Tempo

😊 Mood

It automatically creates groups like:

🎶 Romantic Songs

🎶 Classical Songs

🎶 Rock Songs

🎶 Party Songs

🎶 Devotional Songs

The platform can then recommend similar songs to users.

🛠 Popular Clustering Algorithms

K-Means Clustering
Hierarchical Clustering
DBSCAN
Mean Shift

🟡 2. Association Rule Mining

📖 Definition

Association Rule Mining is a technique used to discover relationships or associations between different items in a dataset.

It identifies which items frequently occur together and generates useful rules based on those relationships.

🎯 Objective

To find frequent item combinations and discover useful relationships between them.

⚙️ How Association Rule Mining Works

1️⃣ The algorithm analyzes transaction records or datasets.

2️⃣ It identifies items that frequently appear together.

3️⃣ It generates association rules.

4️⃣ These rules help organizations make better business decisions.

🌍 Real-Life Example

🛒 Online Shopping Website

An e-commerce company studies customer purchase history.

It observes:

📱 Customers who buy a Smartphone

often also buy

🎧 Wireless Earbuds

📱 Mobile Cover

🔋 Power Bank

The company uses these relationships to recommend products during online shopping.

Example Rule:

If a customer buys a Smartphone, they are also likely to purchase a Mobile Cover and Earbuds.

🛠 Popular Association Rule Algorithms

Apriori Algorithm
FP-Growth Algorithm
ECLAT Algorithm

🟣 3. Dimensionality Reduction

📖 Definition

Dimensionality Reduction is a technique used to reduce the number of input features (variables) while preserving the most important information.

Many datasets contain unnecessary or duplicate features that increase complexity. This technique removes irrelevant information, making the model simpler and faster.

🎯 Objective

To simplify large datasets while retaining essential information.

⚙️ How Dimensionality Reduction Works

1️⃣ The algorithm analyzes all features.

2️⃣ It identifies important and less important features.

3️⃣ Redundant or unnecessary features are removed.

4️⃣ The reduced dataset is used for faster analysis and better visualization.

🌍 Real-Life Example

📸 Face Recognition System

A face recognition system collects many facial features such as:

👀 Eye Shape

👃 Nose Shape

👄 Lip Shape

😊 Facial Expression

🎨 Skin Texture

Some of these features may contain duplicate or less useful information.

The algorithm keeps only the most important facial features required for accurate identification.

This reduces computation time while maintaining recognition accuracy.

🛠 Popular Dimensionality Reduction Algorithms

Principal Component Analysis (PCA)
Linear Discriminant Analysis (LDA)
t-SNE
Autoencoders

🟥 5. Comparison of the Three Categories

📌 Feature	🟢 Clustering	🟡 Association Rule Mining	🟣 Dimensionality Reduction
🎯 Purpose	Group similar data	Discover relationships between items	Reduce the number of features
📤 Output	Clusters	Association Rules	Reduced Dataset
🌍 Example	Music Recommendation	Online Shopping Recommendations	Face Recognition
🛠 Popular Algorithm	K-Means	Apriori	PCA

SEM 1	SEM 2	SEM 3
SEM 4	SEM 5	SEM 6

SEM 1	SEM 2	SEM 3
SEM 4	SEM 5	SEM 6

SEM 1	SEM 2	SEM 3
SEM 4	SEM 5	SEM 6

CLASS-4	CLASS-5	CLASS-6
CLASS-7	CLASS-8	CLASS-9
CLASS10	CLASS11 application	CLASS12 application
CLASS11 science	CLASS12 science

C	C++	CORE JAVA	SQL	PYTHON
MS OFFICE	HTML	VISUAL BASIC	advanced java	8085
PROLOG	ASSEMBLY LANGUAGE	JAVA SCRIPT	SHELL PROGRAMMING	R
DIGITAL ELECTRONICS	COMPUTER ARCHITECTURE	DATA STRUCTURE	OPERATING SYSTEM	GRAPH THEORY
DISCRETE MATHEMATICS	NUMERICAL ALGORITHM	AUTOMATA	MICROPROCESSOR	NETWORKING
GRAPHICS	SOFTWARE ENGINEERING	DATABSE	ANALYSIS OF ALGORITHM	IMAGE PROCESSING
ARTIFICIAL INTELLIGENCE	BIG DATA	CLOUD COMPUTING	DATA MINING	INTERNET TECHNOLOGY

CU BSC computer science old syllabus	WBSU BSC computer science old syllabus
CU cbcs BSC computer science HONOURS syllabus 2018	WBSU cbcs BSc computer science HONOURS syllabus 2018
CU cbcs BSC computer science GENERAL syllabus 2018	WBSU cbcs BSC computer science GENERAL syllabus 2018

Total Pageviews

Monday, June 29, 2026