Welcome to Analyzing Data with Histograms, the third course in our learning path! In the first two courses, we built a solid toolkit of numerical summaries — from measures of center like the mean and median to measures of spread like the range and IQR. Now it's time to bring data to life visually. In this first lesson, we'll learn how to read a histogram by identifying its parts, understanding what each bar represents, and pulling specific information directly from the display.
A single number like the mean or IQR is great for summarizing a dataset, but it can't show us everything at once. A histogram gives us a visual snapshot of an entire distribution, making it easy to see where data points concentrate and how spread out they are. Think of it as a bridge between raw data and the bigger story that data tells.
Before we can interpret that story, though, we first need to understand how a histogram is put together. Let's start with the core building blocks.
A histogram organizes data into equal-width intervals and displays how many data points fall into each one. Here are the three main components:
- Horizontal axis (x-axis): Shows the range of data values divided into consecutive, equal-width intervals called bins. For example, if we are measuring commute times in minutes, the bins might be 0–10, 10–20, 20–30, and so on.
- Vertical axis (y-axis): Shows frequency — the count of data points that land in each bin.
- Bars: Each bar sits directly over a bin, and its height equals the frequency for that interval. Unlike a typical bar chart, the bars in a histogram touch one another because the data is continuous, with no gaps between intervals.
One small but important detail: when a value falls exactly on a bin boundary, it is typically counted in the bin that starts at that value. For example, a commute of exactly minutes would belong to the 20–30 bin, not the 10–20 bin.
Let's practice with a concrete scenario. Imagine a company surveyed employees about their one-way commute times (in minutes) and displayed the results in a histogram. The table below represents what we would read from the bars:
Each row corresponds to one bar in the histogram. The bar over the 20–30 bin is the tallest, reaching a height of . This tells us that employees have a commute between and minutes, making it the most common commute range. Meanwhile, the bar over 0–10 is the shortest at , meaning very few employees commute under minutes.
Often we want to know how many data points fall within a range that spans more than one bin. For instance, suppose we want to find how many employees commute less than 20 minutes. We identify the bins that cover that range (0–10 and 10–20) and add their frequencies:
So out of employees commute less than minutes. Similarly, if we want to know how many commute 30 minutes or more, we add the 30–40 and 40–50 bins:
In this lesson, we covered the building blocks of reading a histogram. We learned that the horizontal axis shows value intervals (bins), the vertical axis shows frequency, and each bar's height tells us how many data points fall in that interval. We also practiced extracting specific counts from individual bars and combining multiple bars to answer questions about broader ranges.
These foundational skills will carry us through the rest of the course as we move on to describing shapes, spotting patterns, and drawing real-world conclusions. Up next is a set of hands-on exercises where you will read histograms, locate key intervals, and calculate totals on your own — let's put these skills to work!

