Welcome to a new lesson! Today, we'll learn about basic statistical operations using Python's NumPy
library. These operations, including mean, median, mode, variance, and standard deviation, are vital tools for understanding and interpreting data. After learning each operation, we'll apply our understanding to a real-world dataset.
The mean, or average, is the sum of all values divided by the number of values. In Python, we use np.mean(array)
to calculate the mean. The median is the middle number in a sorted list, which can be calculated using np.median(array)
. The mode is the most frequent value in your data set, which can be calculated using the mode()
function from scipy
's stats
module.
Note that stats.mode
returns an object. In case of a tie, this object contains multiple items. To obtain the actual mode value, we can select the first one of these items like this:
