Hi there! Today, we're going to learn how to apply Linear Regression to a real dataset. Working with real data shows us how machine learning solves real problems. We'll use the California Housing Dataset. By the end of this lesson, you'll know how to use Linear Regression
on a real dataset and understand the results.
Before diving into the code, let's understand the dataset we'll be working with. The California Housing Dataset is based on data from the 1990 California census. It contains information about various factors affecting housing prices in different districts of California.
Here's a quick overview of the columns in the dataset:
MedInc
: Median income in block groupHouseAge
: Median house age in block groupAveRooms
: Average number of rooms per householdAveBedrms
: Average number of bedrooms per householdPopulation
: Block group populationAveOccup
: Average household sizeLatitude
: Block group latitudeLongitude
: Block group longitude
