Topic Overview

Hello and welcome! In this lesson, we are going to learn how to convert categorical data into ordered types using the Diamonds dataset from the seaborn library. The goal of this lesson is to enable you to transform categorical data into ordered categorical types effectively. Understanding this process is crucial for improving data analysis and visualization.

Introduction to Categorical Data

Categorical data is data that can be divided into groups or categories. For example, the grades students receive (A, B, C, etc.), types of cars (SUV, Sedan, Truck), and the levels of satisfaction in a survey (Poor, Fair, Good, Very Good, Excellent) are all examples of categorical data.

In the Diamonds dataset, we have categorical columns such as cut, color, and clarity:

  • cut describes the quality of the diamond cut (e.g., Fair, Good, Very Good, Premium, Ideal).
  • color indicates the color grading of a diamond (e.g., D, E, F, G, H, I, J).
  • clarity represents the clarity of the diamond (e.g., I1, SI2, SI1, VS2, VS1, VVS2, VVS1, IF).
Understanding Categorical Data Conversion

Converting categorical data to ordered types is essential for several reasons:

Sign up
Join the 1M+ learners on CodeSignal
Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal