# Nakafa Framework: LLM URL: /en/subject/high-school/10/mathematics/statistics/percentile-data-group Source: https://raw.githubusercontent.com/nakafaai/nakafa.com/refs/heads/main/packages/contents/subject/high-school/10/mathematics/statistics/percentile-data-group/en.mdx Output docs content for large language models. --- export const metadata = { title: "Percentiles for Grouped Data", description: "Calculate percentiles in grouped data using interpolation formulas. Learn to find data positions and interpret percentile rankings with examples.", authors: [{ name: "Nabil Akbarazzima Fatih" }], date: "04/22/2025", subject: "Statistics", }; ## What Are Percentiles? You're already familiar with [quartiles](/subject/high-school/10/mathematics/statistics/quartile-data-group), which divide data into 4 equal parts, right? Well, **percentiles** are like quartiles' sibling, but they're even more detailed! If quartiles divide data into 4 chunks, percentiles divide ordered data into **100 equal chunks**. That's a lot, huh? Like dividing a chocolate bar into 100 tiny squares. Each chunk is separated by a percentile value. There are 99 percentile values, starting from , , , ..., up to . - (10th Percentile) means this value separates the smallest 10% of the data from the remaining 90%. - (50th Percentile) is exactly the same as the **Median** or the **Second Quartile ( )**, because it divides the data right in the middle (50% below, 50% above). - (85th Percentile) means this value separates the smallest 85% of the data from the largest 15%. Percentiles are very useful for seeing the position of a specific value relative to the entire dataset, like class rankings for test scores or a child's growth compared to peers of the same age. ## How to Find Percentile Values for Grouped Data Just like finding quartiles for grouped data, we also use **interpolation** to find the value of a percentile () when the data is grouped. The steps are very similar: ### Find the Percentile Class Position First, we determine which data point corresponds to the i-th percentile. The formula is: - = Which percentile are we looking for? (e.g., 10, 50, 85) - = Total number of data points Once we have the position, we look at the cumulative frequency table () to find out which class interval this percentile falls into. ### Calculate the Percentile Value using the Interpolation Formula Once we know the class, we use this magic interpolation formula: Where: - = Value of the i-th Percentile (what we're looking for) - = Lower boundary of the i-th percentile class - = Which percentile (e.g., 10, 85) - = Total frequency - = Cumulative frequency **BEFORE** the i-th percentile class - = Frequency of the i-th percentile class - = Class width Notice, the formula is very similar to the quartile formula, the only difference is the part (quartiles use ). ## Finding Math Test Scores For example, let's say we have the math test scores of 40 students: | Test Score | Frequency () | Cumulative Frequency () | Lower Boundary () | Class Width () | | :--------: | :---------------------------------: | :----------------------------------------------: | :----------------------------------------: | :-----------------------------------: | | 61-70 | 4 | 4 | | 10 | | 71-80 | 10 | 14 | | 10 | | 81-90 | 16 | 30 | | 10 | | 91-100 | 10 | 40 | | 10 | | **Total** | **40** | | | | We want to find the value of the 85th Percentile (). 1. **Find the Position of :** Position of = the -th data point. 2. **Determine the Class of :** Look at the column. Where is the 34th data point? The 81-90 class has (not enough). The 91-100 class has (data points 31 through 40 are here). So, the class is **91-100**. 3. **Gather Ingredients for the Formula:** - (Lower boundary of class 91-100) = 90.5 - = 85 - = 40 - (Cumulative frequency before class 91-100) = 30 - (Frequency of class 91-100) = 10 - (Class width) = 10 4. **Calculate :**
So, the 85th Percentile value is 94.5. This means 85% of the students scored 94.5 or less, and 15% scored above 94.5. ## Exercise Try calculating the value of the 20th Percentile () from the math test score data above! ### Answer Key 1. **Position of :** Position of = the -th data point. 2. **Class of :** Look at . The 8th data point is in the **71-80** class (because the previous class's is 4, and this class's is 14). 3. **Formula Ingredients:** - = 70.5 - = 20 - = 40 - = 4 - = 10 - = 10 4. **Calculate :**
The 20th Percentile value is 74.5.