Section 1 - Instruction

Welcome to Data Cleaning! Real-world data is often messy. Imagine a survey where some people didn't answer every question. Those blank spots are "missing values," a common problem we need to fix before any analysis.

Engagement Message

Why do you think missing data could be a problem for analysis?

Section 2 - Instruction

In Pandas, missing data is often represented by NaN, which stands for "Not a Number". When you see NaN in a DataFrame, it's a placeholder for a value that doesn't exist or is unknown. It's our primary target.

Engagement Message

Why do you think Pandas needs a special symbol like NaN for missing data?

Section 3 - Instruction

So, how do we find these NaN values systematically? Pandas gives us a powerful tool: the .isnull() method. You can apply it to your entire DataFrame to check every single cell for missing data all at once.

Engagement Message

What do you think this method returns as its output?

Section 4 - Instruction

When you run .isnull(), it doesn't show your original data. Instead, it returns a DataFrame of the same shape, but filled with only True or False values. This is often called a "boolean mask."

Engagement Message

Sign up
Join the 1M+ learners on CodeSignal
Be a part of our community of 1M+ users who develop and demonstrate their skills on CodeSignal