What Are Quartiles?
The median is like a ruler that divides data into two equal parts, right in the middle (). Well, there's another friend of the median, called quartiles.
If the median divides data into two, quartiles are even better; they divide the sorted data into four equal parts! Imagine you have a chocolate bar, and you break it into four equal pieces. Quartiles are the breaking points.
There are three quartile breaking points:
- Lower Quartile (): This is the first break. It separates the smallest of the data from the rest. Like the first quarter of the chocolate.
- Middle Quartile (): This is the median! It's exactly in the middle, dividing the data in half ( left, right). Like the break in the middle of the chocolate.
- Upper Quartile (): This is the last break. It separates the smallest of the data from the largest . Like the boundary after three-quarters of the chocolate.
So, , , and divide our data into four small groups with the same number of data points ( each).
How to Find the Position of Quartiles
Okay, now how do we know the position (rank) of , , and in our ordered data?
Assume we have data points that we have sorted from smallest to largest.
Lower Quartile
The formula is simple:
- If the result is a whole number, for example , then is the value of the data point.
- If the result has a decimal, for example , then lies between the and data points. (There's a way to calculate its value later, but for now, we're just finding the position).
Simple Example:
Suppose we have data points ().
The position of is the data point at , which is data point or .
This means is between the and data points.
Middle Quartile
This is the median, so the formula is:
The rules are the same as for :
- If the result is a whole number, say , is the value of data point .
- If the result has a decimal, say , is between data points and .
Simple Example ():
The position of is the data point at , which is data point or .
This means (the median) is between data points and .
Upper Quartile
The formula is similar again:
The rules are exactly the same:
- If the result is a whole number, say , is the value of data point .
- If the result has a decimal, say , is between data points and .
Simple Example ():
The position of is the data point at , which is data point or .
This means is between data points and .
Exercise
Try to find the position of , , and from the math test scores of these children:
Scores:
Step : Sort the data first!
Sorted data:
Number of data points:
Step : Find the quartile positions using the formulas
-
Position of :
The result is a whole number (), so is the nd data point.
-
Position of (Median):
The result is a whole number (), so is the th data point.
-
Position of :
The result is a whole number (), so is the th data point.
Step : Determine the quartile values
Look at the sorted data:
- is nd data point =
- is th data point =
- is th data point =
The Fourth Quartile
You might be wondering, "If there's , , and , is there a ?"
Technically, the concept of quartiles divides the data into four parts. is the boundary for the first , (the median) is the boundary, and is the boundary. The final boundary, which encompasses of the data, is actually the maximum value of the dataset.
So, while we could refer to the maximum value as , in statistical analysis, we don't typically use the term explicitly. The main focus is on , , and because they provide important information about the spread and center of the data in the lower, middle, and upper sections. The minimum value is sometimes called , but like , it's less commonly used than , , and .