Math'scool | Students' Area | Statistics | Web Lessons

Web Lesson 13: Measures of Spread I

Measures of Spread

Finding the Range

Finding the Inter-quartile Range

Hand in your workings and answers

1. If you have a list of data:

2. If you have a table of ungrouped data:

3. If you have a table of grouped data:

1. If you have a list of data:

2. If you have a table of ungrouped data:

3. If you have a table of grouped data:

There are three different measures of spread:

Range (often used with together with the mode)

Inter-quartile Range (often used with together with the median)

Standard Deviation (often used with together with the mean)

Each one is worked out in a different way:

1. If you have a list of data:

2. If you have a table of ungrouped data:

3. If you have a table of grouped data:

1. If you have a list of data:

2. If you have a table of ungrouped data:

3. If you have a table of grouped data:

The pass mark (to avoid additional homework on this topic) is 8/10

Range
(often used with together with the mode)

Re-write all of the values, in numerical order and count how many there are:

The lower quartile (Q₁) is the (¼n+½)^th value; so work out (¼n+½) and then count across your list to find that value:

The upper quartile (Q₃) is the (¾n+½)^th value; so work out (¾n+½) and then count across your list to find that value:

I.Q.R. = Q₃ - Q₁

Write a Cumulative Frequency Table. The last number in the ‘F’ row is called ‘n’

The lower quartile (Q₁) is the (¼n+½)^th value; so work out (¼n+½) and then look along the cumulative frequencies for this number (or above)
Read across to the value of ‘x’. This is Q₁

The upper quartile (Q₃) is the (¾n+½)^th value; so work out (¾n+½) and then look along the cumulative frequencies for this number (or above)
Read across to the value of ‘x’. This is Q₃

I.Q.R. = Q₃ - Q₁

Rather than use a "Cumulative Frequency Curve" (as we did at G.C.S.E. level) we can use the same Interpolation method that we used to find the median:

Write a Cumulative Frequency Table (remember to use the upper class boundary of each class). The last number in this row is called ‘n’

The lower quartile (Q₁) is the (¼n+½)^th value; so we work out what (¼n+½) gives:

Squeeze an extra column into our table to help us find this value:

Find Q₃ in a similar way

I.Q.R. = Q₃ - Q₁

Find the highest value in the data

Find the lowest value in the data

The range is found by subtracting these

Label the rows as ‘x’ (for the values) and ‘f’ (for the frequencies)

Subtract the highest value of 'x' from the lowest value value of 'x' (Note: First cross out any classes where the frequency is zero)

Once data is grouped we can not find the range any more!

Re-write all of the values, in numerical order and count how many there are:

The lower quartile (Q1) is the (¼n+½)th value; so work out (¼n+½) and then count across your list to find that value:

The upper quartile (Q3) is the (¾n+½)th value; so work out (¾n+½) and then count across your list to find that value:

I.Q.R. = Q3 - Q1

Write a Cumulative Frequency Table. The last number in the ‘F’ row is called ‘n’

The lower quartile (Q1) is the (¼n+½)th value; so work out (¼n+½) and then look along the cumulative frequencies for this number (or above) Read across to the value of ‘x’. This is Q1

The upper quartile (Q3) is the (¾n+½)th value; so work out (¾n+½) and then look along the cumulative frequencies for this number (or above) Read across to the value of ‘x’. This is Q3

I.Q.R. = Q3 - Q1

Rather than use a "Cumulative Frequency Curve" (as we did at G.C.S.E. level) we can use the same Interpolation method that we used to find the median:

Write a Cumulative Frequency Table (remember to use the upper class boundary of each class). The last number in this row is called ‘n’

The lower quartile (Q1) is the (¼n+½)th value; so we work out what (¼n+½) gives:

Squeeze an extra column into our table to help us find this value:

Find Q3 in a similar way

I.Q.R. = Q3 - Q1

Subtract the highest value of 'x' from the lowest value value of 'x'
(Note: First cross out any classes where the frequency is zero)

The lower quartile (Q₁) is the (¼n+½)^th value; so work out (¼n+½) and then count across your list to find that value:

The upper quartile (Q₃) is the (¾n+½)^th value; so work out (¾n+½) and then count across your list to find that value:

I.Q.R. = Q₃ - Q₁

The lower quartile (Q₁) is the (¼n+½)^th value; so work out (¼n+½) and then look along the cumulative frequencies for this number (or above)
Read across to the value of ‘x’. This is Q₁

The upper quartile (Q₃) is the (¾n+½)^th value; so work out (¾n+½) and then look along the cumulative frequencies for this number (or above)
Read across to the value of ‘x’. This is Q₃

I.Q.R. = Q₃ - Q₁

The lower quartile (Q₁) is the (¼n+½)^th value; so we work out what (¼n+½) gives:

Find Q₃ in a similar way

I.Q.R. = Q₃ - Q₁

Sometimes, we need to find a value that ends in '½'; such as the 12½^th value
In this case, use: ½(12^th value + 13^th value)
But if we need to find a value that ends in '¼'; such as the 3¼^th value
In that case, we round DOWN and find the 3^rd value instead

Sometimes, we need to find a value that ends in '½'; such as the 12½^th value
In this case, use: ½(12^th value + 13^th value)
But if we need to find a value that ends in '¾'; such as the 8¾^th value
In that case, we round UP and find the 9^th value instead

Note: Strictly speaking, if n = 79, then Q₁ is the (½(100)+½)^th value (as we did above) - but in practice, when n is large (bigger than 30) then the difference between using ¼n+½ and just ¼n isn't really worth bothering with...
Similarly, for Q₃, if n is bigger than 30, then just use: ¾n

● We need to find some differences using our table: 'Δ₁', 'Δ₂', 'D₁' & 'D₂' need to be found

● Among these differences, only D₂ is unknown - but it can be found using:

● D₁ tells us what to add to the class to the left of the median to estimate the median:

● If the data is discrete, then round the answer

The range tells us the difference between the highest and lowest values in the data

The inter-quartile range tells the the range for the central 50% of the data. This is more useful as it excludes the extreme values which make up the range

The standard deviation is a more complicated. It it used a lot in 'A' level statistics. In general, it tells us the range for the central 68% of the data

Sometimes, we need to find a value that ends in '½'; such as the 12½th value In this case, use: ½(12th value + 13th value) But if we need to find a value that ends in '¼'; such as the 3¼th value In that case, we round DOWN and find the 3rd value instead

Sometimes, we need to find a value that ends in '½'; such as the 12½th value In this case, use: ½(12th value + 13th value) But if we need to find a value that ends in '¾'; such as the 8¾th value In that case, we round UP and find the 9th value instead

Note: Strictly speaking, if n = 79, then Q1 is the (½(100)+½)th value (as we did above) - but in practice, when n is large (bigger than 30) then the difference between using ¼n+½ and just ¼n isn't really worth bothering with... Similarly, for Q3, if n is bigger than 30, then just use: ¾n

● We need to find some differences using our table: 'Δ1', 'Δ2', 'D1' & 'D2' need to be found

● Among these differences, only D2 is unknown - but it can be found using:

● D1 tells us what to add to the class to the left of the median to estimate the median:

● If the data is discrete, then round the answer

Sometimes, we need to find a value that ends in '½'; such as the 12½^th value
In this case, use: ½(12^th value + 13^th value)
But if we need to find a value that ends in '¼'; such as the 3¼^th value
In that case, we round DOWN and find the 3^rd value instead

Sometimes, we need to find a value that ends in '½'; such as the 12½^th value
In this case, use: ½(12^th value + 13^th value)
But if we need to find a value that ends in '¾'; such as the 8¾^th value
In that case, we round UP and find the 9^th value instead

Note: Strictly speaking, if n = 79, then Q₁ is the (½(100)+½)^th value (as we did above) - but in practice, when n is large (bigger than 30) then the difference between using ¼n+½ and just ¼n isn't really worth bothering with...
Similarly, for Q₃, if n is bigger than 30, then just use: ¾n

● We need to find some differences using our table: 'Δ₁', 'Δ₂', 'D₁' & 'D₂' need to be found

● Among these differences, only D₂ is unknown - but it can be found using:

● D₁ tells us what to add to the class to the left of the median to estimate the median:

e.g. Find the range of this data: 2, 4, 6, 5, 3, 5 ,3, 2, 7, 3, 8, 4, 3, 6, 3, 4, 2, 3

e.g. Find the range of this data:

e.g. Find the range of this data:

e.g. Find the quartiles of this data and find the I.Q.R: 1.3, 1.2, 1.4, 1.5, 1.2, 1.6, 1.5, 1.8, 1.5, 2.0, 1.7

e.g. Find the quartiles and the inter-quartile range of this data:

e.g. What with all this extra tuition, the amount of stuff a student has to carry around is too much!
To investigate this, the weights of the school bags of a class of students were measured:

Estimate the I.Q.R. of this data

e.g. Find the range of this data: 2, 4, 6, 5, 3, 5 ,3, 2, 7, 3, 8, 4, 3, 6, 3, 4, 2, 3

e.g. Find the range of this data:

e.g. Find the range of this data:

Question 1: A class of 12 Math'scool students was asked how many questions do they they think should be set for homework:

Find the range for these data:

Question 2: The number of 'sick days' taken by 200 employees in 2001 and in 2002 are shown in the table:

(a): Find the range for the number of sick days in 2001 (b): Find the range for the number of sick days in 2002

Question 3: The salaries of the 200 workers was also recorded:

Explain why is it not possible to find the range for these data

e.g. Find the quartiles of this data and find the I.Q.R: 1.3, 1.2, 1.4, 1.5, 1.2, 1.6, 1.5, 1.8, 1.5, 2.0, 1.7

e.g. Find the quartiles and the inter-quartile range of this data:

e.g. What with all this extra tuition, the amount of stuff a student has to carry around is too much! To investigate this, the weights of the school bags of a class of students were measured:

Estimate the I.Q.R. of this data

Question 4: Find the quartiles and I.Q.R for these data: 23, 30, 29, 20, 22, 29, 27, 23, 27, 24

Question 5: A class of 12 Math'scool students was asked how many questions they think should be set for homework:

Find the quartiles and the I.Q.R. for these data:

Question 6: A group of old people were asked to count how many grey hairs they have:

Find the quartiles and the I.Q.R. for these data:

Question 7: I'm fed up that some students don't hand in all their corrections. I decided to investigate the number of outstanding corrections for each statistics student before deciding upon a plan of action!

(a): Determine the quartiles and the I.Q.R. of these data (b): I decided that students whose number of outstanding corrections EXCEEDS the 3rd quartile will be expelled. How many students will be expelled?

Question 8: The number of 'sick days' taken by 200 employees in 2001 and in 2002 are shown in the table:

(a): Find the I.Q.R. for the number of sick days in 2001 (b): Find the I.Q.R. for the number of sick days in 2002

Question 9: I asked last year's students how long it took them to do this question

(a): Determine the interquartile range of the time taken to do this question (b): How long did it take you to do that?

Question 10: The salaries of the 200 workers was also recorded:

Find the median and quartiles of the salaries

(a): Find the range for the number of sick days in 2001
(b): Find the range for the number of sick days in 2002

e.g. What with all this extra tuition, the amount of stuff a student has to carry around is too much!
To investigate this, the weights of the school bags of a class of students were measured:

(a): Determine the quartiles and the I.Q.R. of these data
(b): I decided that students whose number of outstanding corrections EXCEEDS the 3rd quartile will be expelled. How many students will be expelled?

(a): Find the I.Q.R. for the number of sick days in 2001
(b): Find the I.Q.R. for the number of sick days in 2002

(a): Determine the interquartile range of the time taken to do this question
(b): How long did it take you to do that?

Find the highest value in the data
Find the lowest value in the data
The range is found by subtracting these

      Here we have a "list of data"
 
      We can see that the highest value is '8'
      And the lowest value is '2'
      So the range is 8 - 2  =  6
 
      Note: The range can also be written as '2 — 8', meaning the values lie between 2 & 8

Label the rows as ‘x’ (for the values) and ‘f’ (for the frequencies)
Subtract the highest value of 'x' from the lowest value value of 'x'
(Note: First cross out any classes where the frequency is zero)

No of Wins	0	1	2	3	4
frequency	12	27	25	13	2

      We have a "table of ungrouped data"
 
      The highest value is '4'
      The lowest value is '0'
      So, the range is 4 - 0  =  4
 
      Note: The range can also be written as '0 - 4', meaning the values lie between 0 & 4

Once data is grouped we can not find the range any more!

Voltage	5.6 – 5.8	5.8 – 5.9	5.9 – 6.0	6.0 – 6.1	6.1 – 6.4
frequency	20	20	80	50	30

      We have a "table of grouped data"
 
      It is not possible to find the range because, in the 1st class which is '5.6-5.8',
      we don't know what the exact value of the smallest number was…
 
      We might instead use the  10th percentile (P₁₀)  and the  90th percentile (P₉₀)
      as alternatives…         ╘══════════╤══════════╛         ╘══════════╤═══════════╛
				╒═════════╧═════════╕	        ╒═════════╧══════════╕
			         The 10th Percentile	         The 90th percentile
			         is the n/10th value	         is the 9n/10th value
			        ╘═══════════════════╛	        ╘════════════════════╛
 
      These can be found using the same interpolation method as you learnt to find the median

	  1.2,  1.2,  1.3,  1.4,  1.5,  1.5,  1.5,  1.6,  1.7,  1.8,  2.0
	 └———————————————————— 11 values in the data ————————————————————┘
	                                n=11

   
  So Q₁ is the (¼(11)+½)^th value = 3¼^th value (we round this to the 3^rd value):
 
	               ┌—— 3rd value
	               ▼
	  1.2,  1.2,  1.3,  1.4,  1.5,  1.5,  1.5,  1.6,  1.7,  1.8,  2.0
	               ↓
	        Q₁ is 1.3

  
  So Q₃ is the (¾(11)+½)^th value = 8¾^th value (we round this to the 9^th value):
 
	                                                   ┌—— 9th value
	                                                   ▼
	  1.2,  1.2,  1.3,  1.4,  1.5,  1.5,  1.5,  1.6,  1.7,  1.8,  2.0
	 						   ↓
						    Q₃ is 1.7

  So the Inter-quartile range: I.Q.R = 1.7 - 1.3 = 0.4

No of Wins	0	1	2	3	4
frequency	12	27	25	13	2

Upper Boundary	up to 0	up to 1	up to 2	up to 3	up to 4
Cumulative Frequency	12	12+27= 39	12+27 +25= 64	12+27 +25+13= 77	12+27 +25+13 +2= 79

Upper Boundary	up to 0	up to 1	up to 2	up to 3	up to 4
Cumulative Frequency	12	12+27= 39	12+27 +25= 64	12+27 +25+13= 77	12+27 +25+13 +2= 79

 				       ▲
Since we know n = 79		       ╚════════════════════╗
To find the Q₁, we look up the 20^th value		    ║
i.e. the (¼(79)+½)th value = 20¼th value ≈ 20th value       ║
							    ║
					╒═══════════════════╩══════════════════╕
					│ Since 20 is NOT THERE, we must look  │
					│ up the next number ABOVE 20 in the   │
					│ cumulative frequencies (which is 39) │
					╘══════════════════════════════════════╛
So, Q₁ is '1'

Upper Boundary	up to 0	up to 1	up to 2	up to 3	up to 4
Cumulative Frequency	12	12+27= 39	12+27 +25= 64	12+27 +25+13= 77	12+27 +25+13 +2= 79

 						  ▲
Since we know n = 79				  ╚═════════╗
To find the Q₃, we look up the 60^th value		    ║
i.e. the (¾(79)+½)th value = 59¾th value ≈ 60th value       ║
							    ║
					╒═══════════════════╩══════════════════╕
					│ Since 60 is NOT THERE, we must look  │
					│ up the next number ABOVE 60 in the   │
					│ cumulative frequencies (which is 64) │
					╘══════════════════════════════════════╛
So, Q₃ is '2'

  So the Inter-quartile range: I.Q.R = 2 - 1 = 1

║
║
║

Mass	5 – 15	15 – 25	25 – 30	30 – 40	40 – 50
frequency	8	10	9	10	3

Mass (U.C.B.)	15	25	30	40	50
cumulative frequency	8	18	27	37	40

					╒═══════════════════════════════════════════╕
  n = 40     ╔══════════════════════════╡ If n is larger than 30 then we can chose  │
	     ▼				│ whether we want to use: (¼n + ½)th value  │
  ¼(n)  =  ¼(39)  ≈  10 th value 	│                or just:  ¼n     th value  │
  					│ in this case, it was easier to use: ¼n    │
					╘═══════════════════════════════════════════╛

Mass (U.C.B.)	15	Q₁	25	30	40	50
cumulative frequency	8	10	18	27	37	40

				D₁  =  Δ₁
				D₂     Δ₂ 
 
			    =>  D₁  =   2
				10     10
 
			    =>  D₁  =   2

	╔══ + 2 ══╗ ║ ▼
Mass (U.C.B.)	15	17	25	30	40	50
cumulative frequency	14	10	18	27	37	40

  To find Q_3,we need to locate the: ¾n^th value = 30^th value
 
  Start by inserting a row at F = 30:

Mass (U.C.B.)	15	25	30	Q₃	40	50
cumulative frequency	8	18	27	30	37	40

 
	Δ₁ = 30 - 27 = 3
	Δ₂ = 37 - 27 = 10		    =>	D₁  =  3
	D₁ = ???					10    10
	D₂ = 10	
					    =>  D₁  =  3

  So: Q₃(estimate) = 30 + 3 = 33

  So the Inter-quartile range: I.Q.R = 33 - 17 = 16

╔═══════ D₂=10 ══════╗

╔═ D₁=??? ═╗

Mass (U.C.B.)

cumulative frequency

╚═ Δ₁= 2 ═╝

╚═══════ Δ₂=10 ══════╝