Before diving into cumulative frequency, let's revisit some fundamental concepts. When we collect data, especially large sets, it is often grouped into class intervals. For example, if we record the heights of students, instead of listing every height, we group them into ranges like 150-155 cm, 155-160 cm, and so on. This grouping helps us understand the data distribution more clearly.
Along with class intervals, we use a frequency distribution table to show how many data points fall into each class interval. Frequency tells us the count of observations in each group.
Now, cumulative frequency builds on this idea by giving us a running total of frequencies up to a certain class interval. This running total helps us understand how data accumulates across intervals, which is especially useful in finding medians, percentiles, and other statistical measures.
For example, if you want to know how many students are shorter than 160 cm, cumulative frequency tells you exactly that by adding frequencies of all classes up to 160 cm.
Understanding cumulative frequency is vital for data analysis, as it provides insights into the distribution and helps in interpreting the data effectively.
Cumulative Frequency is defined as the sum of frequencies of all class intervals up to a certain point in the data set. It tells us how many observations fall below (or above) a particular class boundary.
There are two main types of cumulative frequency:
Both types are useful depending on the problem. Less than type is commonly used for finding medians and percentiles, while more than type is often used in reliability and survival analysis.
| Class Interval (cm) | Frequency (f) | Less than Cumulative Frequency | More than Cumulative Frequency |
|---|---|---|---|
| 150 - 155 | 5 | 5 | 50 |
| 155 - 160 | 10 | 15 | 45 |
| 160 - 165 | 15 | 30 | 35 |
| 165 - 170 | 20 | 50 | 20 |
| 170 - 175 | 20 | 70 | 0 |
Let's understand how to build a cumulative frequency table step-by-step from a frequency distribution:
graph TD A[Start with frequency distribution table] --> B[Identify first class interval frequency] B --> C[Set less than cumulative frequency of first class = its frequency] C --> D[Add frequency of next class to previous cumulative frequency] D --> E{More classes left?} E -- Yes --> D E -- No --> F[Complete less than cumulative frequency column] F --> G[For more than cumulative frequency, start from last class frequency] G --> H[Add frequency of previous class cumulatively upwards] H --> I{More classes left?} I -- Yes --> H I -- No --> J[Complete more than cumulative frequency column]This process ensures accurate running totals for both less than and more than cumulative frequencies.
An ogive is a graph that represents cumulative frequency distribution. It helps visualize how data accumulates across class intervals.
There are two types of ogives:
Ogives are useful for quickly estimating medians, percentiles, and understanding data spread.
| Height (cm) | Frequency |
|---|---|
| 150 - 155 | 4 |
| 155 - 160 | 8 |
| 160 - 165 | 12 |
| 165 - 170 | 6 |
| 170 - 175 | 10 |
Step 1: Write down the frequencies in order.
Step 2: Calculate the less than cumulative frequency by adding frequencies cumulatively:
Answer: The less than cumulative frequencies are 4, 12, 24, 30, and 40 respectively.
| Consumption (kWh) | Frequency |
|---|---|
| 0 - 50 | 7 |
| 50 - 100 | 15 |
| 100 - 150 | 20 |
| 150 - 200 | 8 |
| 200 - 250 | 10 |
Step 1: Start from the last class interval frequency.
Step 2: Calculate more than cumulative frequency by adding frequencies cumulatively upwards:
Answer: The more than cumulative frequencies are 60, 53, 38, 18, and 10 respectively for the classes starting at 0, 50, 100, 150, and 200 kWh.
| Rainfall (mm) | Frequency | Less than Cumulative Frequency |
|---|---|---|
| 0 - 5 | 3 | 3 |
| 5 - 10 | 7 | 10 |
| 10 - 15 | 12 | 22 |
| 15 - 20 | 8 | 30 |
| 20 - 25 | 5 | 35 |
Step 1: Identify the upper class boundaries: 5, 10, 15, 20, 25 mm.
Step 2: Plot points with x-axis as upper class boundaries and y-axis as less than cumulative frequency:
Step 3: Join these points with a smooth curve to form the ogive.
Interpretation: The ogive shows how rainfall accumulates. For example, about 22 days had rainfall less than 15 mm. The steepness between points indicates frequency density.
| Income (INR 000) | Frequency | Less than Cumulative Frequency |
|---|---|---|
| 0 - 50 | 10 | 10 |
| 50 - 100 | 20 | 30 |
| 100 - 150 | 30 | 60 |
| 150 - 200 | 25 | 85 |
| 200 - 250 | 15 | 100 |
Step 1: Total frequency \( N = 100 \). Median position is at \( \frac{N}{2} = 50 \).
Step 2: Identify the median class where cumulative frequency just exceeds 50. Here, it is 100 - 150 (cumulative frequency 60).
Step 3: Use the median formula:
\[ \text{Median} = l + \left(\frac{\frac{N}{2} - F}{f}\right) \times h \]
where:
Step 4: Substitute values:
\[ \text{Median} = 100 + \left(\frac{50 - 30}{30}\right) \times 50 = 100 + \frac{20}{30} \times 50 = 100 + \frac{2}{3} \times 50 = 100 + 33.33 = 133.33 \]
Answer: The median income is approximately INR 133,330.
| Score Range | Frequency | Less than Cumulative Frequency |
|---|---|---|
| 0 - 10 | 5 | 5 |
| 10 - 20 | 15 | 20 |
| 20 - 30 | 30 | 50 |
| 30 - 40 | 20 | 70 |
| 40 - 50 | 10 | 80 |
Step 1: Total frequency \( N = 80 \).
Step 2: Find 25th percentile position: \( P_{25} = \frac{25}{100} \times 80 = 20 \).
Step 3: Locate the class where cumulative frequency ≥ 20. It is 10 - 20 (cumulative frequency 20).
Step 4: Use percentile formula (similar to median):
\[ P_k = l + \left(\frac{kN/100 - F}{f}\right) \times h \]
where:
Step 5: Calculate 25th percentile:
\[ P_{25} = 10 + \left(\frac{20 - 5}{15}\right) \times 10 = 10 + \frac{15}{15} \times 10 = 10 + 10 = 20 \]
Step 6: Find 75th percentile position: \( P_{75} = \frac{75}{100} \times 80 = 60 \).
Step 7: Locate class where cumulative frequency ≥ 60. It is 30 - 40 (cumulative frequency 70).
Step 8: Apply formula for 75th percentile:
\[ P_{75} = 30 + \left(\frac{60 - 50}{20}\right) \times 10 = 30 + \frac{10}{20} \times 10 = 30 + 5 = 35 \]
Answer: The 25th percentile is 20 and the 75th percentile is 35.
When to use: When constructing cumulative frequency tables to avoid errors.
When to use: During time-constrained exams for faster data interpretation.
When to use: After completing cumulative frequency tables to ensure accuracy.
When to use: While drawing cumulative frequency graphs.
When to use: When finding median from grouped frequency data.
Progress tracking is paywalled — subscribe to mark subtopics as understood and save your streak.
Go to practice →