# Nakafa Framework: LLM URL: /en/subject/high-school/10/mathematics/statistics/variance-standard-deviation-data-group Source: https://raw.githubusercontent.com/nakafaai/nakafa.com/refs/heads/main/packages/contents/subject/high-school/10/mathematics/statistics/variance-standard-deviation-data-group/en.mdx Output docs content for large language models. --- export const metadata = { title: "Variance and Standard Deviation for Grouped Data", description: "Learn to calculate variance and standard deviation for grouped frequency data with step-by-step examples. Master statistical formulas for class intervals.", authors: [{ name: "Nabil Akbarazzima Fatih" }], date: "04/22/2025", subject: "Statistics", }; ## Calculating Spread for Grouped Data How do we measure the spread of data presented in a grouped frequency table? For example, data on phone battery duration grouped into hour intervals (6-10 hours, 11-15 hours, etc.). Since we don't know the exact value of each data point within a class interval (e.g., in the 11-15 hours class, we don't know if the duration was exactly 11 hours, 12 hours, or something else), we need to make an assumption. The most common assumption is that all data within a class interval are evenly distributed. Therefore, we can **represent** all data in that class using the **midpoint** () of that class. ## Formulas for Variance and Standard Deviation of Grouped Data Using the midpoint () and frequency () of each class, the formulas are slightly different: 1. **Variance ()** The commonly used (and easier to compute) formula is the computational formula adapted for grouped data: This formula essentially calculates the average of the squared midpoints weighted by frequency, minus the square of the average midpoint weighted by frequency (the mean of the grouped data). 2. **Standard Deviation ()** Just like with ungrouped data, the standard deviation is the square root of the variance: ## Calculating Variance and Standard Deviation of Phone Battery Duration Suppose a study on phone battery duration yielded the following data: | Battery duration (hours) | Frequency () | | :----------------------: | :---------------------------------: | | 6-10 | 2 | | 11-15 | 10 | | 16-20 | 18 | | 21-25 | 45 | | 26-30 | 5 | Let's determine the variance and standard deviation for this battery duration data. ### Create a Helper Table We need to calculate the midpoint () for each class, then compute and . | Battery duration (hours) | Midpoint, | Frequency, | | | | :----------------------: | :--------------------------------------: | :-----------------------------------: | :----------------------------------------: | :-------------------------------------------: | | 6-10 | | 2 | | | | 11-15 | | 10 | | | | 16-20 | | 18 | | | | 21-25 | | 45 | | | | 26-30 | | 5 | | | | **Total** | | **** | **** | **** | ### Calculate Variance Plug the total values from the table into the variance formula:
So, the variance of the battery duration data is approximately 19.37 (in units of hours squared). ### Calculate Standard Deviation Take the square root of the variance: The standard deviation of the phone battery duration is approximately 4.4 hours. This gives us an idea that the average deviation of battery duration from the mean (which can be calculated as hours) is about 4.4 hours. The smaller the standard deviation, the more uniform the phone battery durations were in the study.