👁 Preview — Study, Practice and Revise are open; mock tests and the rest of the syllabus unlock on subscription. Unlock all · ₹4,999
← Back to Collection and Classification of Data
Study mode

Cumulative Frequency

Introduction to Cumulative Frequency

Before diving into cumulative frequency, let's revisit some fundamental concepts. When we collect data, especially large sets, it is often grouped into class intervals. For example, if we record the heights of students, instead of listing every height, we group them into ranges like 150-155 cm, 155-160 cm, and so on. This grouping helps us understand the data distribution more clearly.

Along with class intervals, we use a frequency distribution table to show how many data points fall into each class interval. Frequency tells us the count of observations in each group.

Now, cumulative frequency builds on this idea by giving us a running total of frequencies up to a certain class interval. This running total helps us understand how data accumulates across intervals, which is especially useful in finding medians, percentiles, and other statistical measures.

For example, if you want to know how many students are shorter than 160 cm, cumulative frequency tells you exactly that by adding frequencies of all classes up to 160 cm.

Understanding cumulative frequency is vital for data analysis, as it provides insights into the distribution and helps in interpreting the data effectively.

Definition and Types of Cumulative Frequency

Cumulative Frequency is defined as the sum of frequencies of all class intervals up to a certain point in the data set. It tells us how many observations fall below (or above) a particular class boundary.

There are two main types of cumulative frequency:

  • Less than type cumulative frequency: This is the running total of frequencies from the first class up to a given class interval. It answers questions like, "How many data points are less than a certain value?"
  • More than type cumulative frequency: This is the running total of frequencies from a given class interval to the last class. It answers questions like, "How many data points are more than a certain value?"

Both types are useful depending on the problem. Less than type is commonly used for finding medians and percentiles, while more than type is often used in reliability and survival analysis.

Example Frequency Distribution with Cumulative Frequencies
Class Interval (cm) Frequency (f) Less than Cumulative Frequency More than Cumulative Frequency
150 - 155 5 5 50
155 - 160 10 15 45
160 - 165 15 30 35
165 - 170 20 50 20
170 - 175 20 70 0

Constructing a Cumulative Frequency Table

Let's understand how to build a cumulative frequency table step-by-step from a frequency distribution:

graph TD    A[Start with frequency distribution table] --> B[Identify first class interval frequency]    B --> C[Set less than cumulative frequency of first class = its frequency]    C --> D[Add frequency of next class to previous cumulative frequency]    D --> E{More classes left?}    E -- Yes --> D    E -- No --> F[Complete less than cumulative frequency column]    F --> G[For more than cumulative frequency, start from last class frequency]    G --> H[Add frequency of previous class cumulatively upwards]    H --> I{More classes left?}    I -- Yes --> H    I -- No --> J[Complete more than cumulative frequency column]

This process ensures accurate running totals for both less than and more than cumulative frequencies.

Graphical Representation: Ogives

An ogive is a graph that represents cumulative frequency distribution. It helps visualize how data accumulates across class intervals.

There are two types of ogives:

  • Less than ogive: Plots cumulative frequency against the upper class boundaries.
  • More than ogive: Plots cumulative frequency against the lower class boundaries.

Ogives are useful for quickly estimating medians, percentiles, and understanding data spread.

Class Boundaries (cm) Cumulative Frequency Less than Ogive More than Ogive 155 160 165 170 175 10 20 40 60 80

Worked Examples

Example 1: Calculating Less Than Cumulative Frequency Easy
Given the following frequency distribution of students' heights (in cm), calculate the less than cumulative frequency.
Height (cm)Frequency
150 - 1554
155 - 1608
160 - 16512
165 - 1706
170 - 17510

Step 1: Write down the frequencies in order.

Step 2: Calculate the less than cumulative frequency by adding frequencies cumulatively:

  • Up to 155 cm: 4
  • Up to 160 cm: 4 + 8 = 12
  • Up to 165 cm: 12 + 12 = 24
  • Up to 170 cm: 24 + 6 = 30
  • Up to 175 cm: 30 + 10 = 40

Answer: The less than cumulative frequencies are 4, 12, 24, 30, and 40 respectively.

Example 2: Constructing More Than Cumulative Frequency Table Medium
From the following frequency distribution of monthly household electricity consumption (in kWh), compute the more than cumulative frequencies.
Consumption (kWh)Frequency
0 - 507
50 - 10015
100 - 15020
150 - 2008
200 - 25010

Step 1: Start from the last class interval frequency.

Step 2: Calculate more than cumulative frequency by adding frequencies cumulatively upwards:

  • More than 200 kWh: 10
  • More than 150 kWh: 10 + 8 = 18
  • More than 100 kWh: 18 + 20 = 38
  • More than 50 kWh: 38 + 15 = 53
  • More than 0 kWh: 53 + 7 = 60

Answer: The more than cumulative frequencies are 60, 53, 38, 18, and 10 respectively for the classes starting at 0, 50, 100, 150, and 200 kWh.

Example 3: Plotting an Ogive Medium
Using the less than cumulative frequency data below for daily rainfall (in mm), plot the ogive and interpret the graph.
Rainfall (mm)FrequencyLess than Cumulative Frequency
0 - 533
5 - 10710
10 - 151222
15 - 20830
20 - 25535

Step 1: Identify the upper class boundaries: 5, 10, 15, 20, 25 mm.

Step 2: Plot points with x-axis as upper class boundaries and y-axis as less than cumulative frequency:

  • (5, 3), (10, 10), (15, 22), (20, 30), (25, 35)

Step 3: Join these points with a smooth curve to form the ogive.

Interpretation: The ogive shows how rainfall accumulates. For example, about 22 days had rainfall less than 15 mm. The steepness between points indicates frequency density.

Example 4: Finding Median Using Cumulative Frequency Hard
The following table shows the income distribution (in INR thousands) of 100 households. Find the median income.
Income (INR 000)FrequencyLess than Cumulative Frequency
0 - 501010
50 - 1002030
100 - 1503060
150 - 2002585
200 - 25015100

Step 1: Total frequency \( N = 100 \). Median position is at \( \frac{N}{2} = 50 \).

Step 2: Identify the median class where cumulative frequency just exceeds 50. Here, it is 100 - 150 (cumulative frequency 60).

Step 3: Use the median formula:

\[ \text{Median} = l + \left(\frac{\frac{N}{2} - F}{f}\right) \times h \]

where:

  • \( l = 100 \) (lower boundary of median class)
  • \( N = 100 \)
  • \( F = 30 \) (cumulative frequency before median class)
  • \( f = 30 \) (frequency of median class)
  • \( h = 50 \) (class width)

Step 4: Substitute values:

\[ \text{Median} = 100 + \left(\frac{50 - 30}{30}\right) \times 50 = 100 + \frac{20}{30} \times 50 = 100 + \frac{2}{3} \times 50 = 100 + 33.33 = 133.33 \]

Answer: The median income is approximately INR 133,330.

Example 5: Calculating Percentiles Hard
Using the following cumulative frequency distribution of test scores, find the 25th and 75th percentiles.
Score RangeFrequencyLess than Cumulative Frequency
0 - 1055
10 - 201520
20 - 303050
30 - 402070
40 - 501080

Step 1: Total frequency \( N = 80 \).

Step 2: Find 25th percentile position: \( P_{25} = \frac{25}{100} \times 80 = 20 \).

Step 3: Locate the class where cumulative frequency ≥ 20. It is 10 - 20 (cumulative frequency 20).

Step 4: Use percentile formula (similar to median):

\[ P_k = l + \left(\frac{kN/100 - F}{f}\right) \times h \]

where:

  • \( l = 10 \) (lower boundary of percentile class)
  • \( k = 25 \)
  • \( N = 80 \)
  • \( F = 5 \) (cumulative frequency before class)
  • \( f = 15 \) (frequency of class)
  • \( h = 10 \) (class width)

Step 5: Calculate 25th percentile:

\[ P_{25} = 10 + \left(\frac{20 - 5}{15}\right) \times 10 = 10 + \frac{15}{15} \times 10 = 10 + 10 = 20 \]

Step 6: Find 75th percentile position: \( P_{75} = \frac{75}{100} \times 80 = 60 \).

Step 7: Locate class where cumulative frequency ≥ 60. It is 30 - 40 (cumulative frequency 70).

Step 8: Apply formula for 75th percentile:

\[ P_{75} = 30 + \left(\frac{60 - 50}{20}\right) \times 10 = 30 + \frac{10}{20} \times 10 = 30 + 5 = 35 \]

Answer: The 25th percentile is 20 and the 75th percentile is 35.

Formula Bank

Less Than Cumulative Frequency
\[ CF_i = \sum_{j=1}^i f_j \]
where: \( CF_i \) = cumulative frequency up to class \( i \), \( f_j \) = frequency of \( j^{th} \) class
More Than Cumulative Frequency
\[ CF_i = \sum_{j=i}^n f_j \]
where: \( CF_i \) = cumulative frequency from class \( i \) onwards, \( f_j \) = frequency of \( j^{th} \) class, \( n \) = total number of classes
Median (Grouped Data)
\[ \text{Median} = l + \left(\frac{\frac{N}{2} - F}{f}\right) \times h \]
where: \( l \) = lower boundary of median class, \( N \) = total frequency, \( F \) = cumulative frequency before median class, \( f \) = frequency of median class, \( h \) = class width

Tips & Tricks

Tip: Always start cumulative frequency calculation from the first class interval for 'less than' type and from the last for 'more than' type.

When to use: When constructing cumulative frequency tables to avoid errors.

Tip: Use cumulative frequency graphs (ogives) to quickly estimate median and percentiles without detailed calculations.

When to use: During time-constrained exams for faster data interpretation.

Tip: Check that the final cumulative frequency equals the total number of observations to verify calculations.

When to use: After completing cumulative frequency tables to ensure accuracy.

Tip: Remember that class boundaries are used on the x-axis for plotting ogives, not class marks.

When to use: While drawing cumulative frequency graphs.

Tip: For grouped data median calculation, carefully identify the median class by locating the class where cumulative frequency crosses \( \frac{N}{2} \).

When to use: When finding median from grouped frequency data.

Common Mistakes to Avoid

❌ Adding frequencies incorrectly by mixing 'less than' and 'more than' cumulative frequency methods.
✓ Use one method consistently: sum frequencies from start for 'less than' and from end for 'more than'.
Why: Confusion arises because both methods look similar but have opposite summation directions.
❌ Using class marks instead of class boundaries when plotting ogives.
✓ Always use upper class boundaries for less than ogives and lower class boundaries for more than ogives.
Why: Class marks do not represent the exact limits needed for cumulative frequency graphs.
❌ Forgetting to include the cumulative frequency of the first class interval as the starting point.
✓ Start cumulative frequency with the frequency of the first class interval.
Why: Missing the first frequency leads to incorrect cumulative totals.
❌ Incorrectly identifying the median class by not locating where cumulative frequency crosses \( \frac{N}{2} \).
✓ Find the class interval where cumulative frequency is just greater than or equal to \( \frac{N}{2} \).
Why: Median depends on the correct median class; misidentification leads to wrong median calculation.
❌ Not verifying that the final cumulative frequency matches total observations.
✓ Always cross-check final cumulative frequency equals total frequency \( N \).
Why: Ensures no frequency is missed or double-counted.
Curated videos per subtopic
Top YouTube explainers, AI-ranked for your exam and language. Unlocks with subscription.
Unlock

Try Practice next.

Progress tracking is paywalled — subscribe to mark subtopics as understood and save your streak.

Go to practice →
Ask a doubt
Cumulative Frequency · 10 free messages
Ask me anything about this subtopic. You have 10 free messages this session — chat history isn't saved in preview.