Interquartile range - AbsoluteAstronomy.com

Descriptive statistics

Descriptive statistics quantitatively describe the main features of a collection of data. Descriptive statistics are distinguished from inferential statistics , in that descriptive statistics aim to summarize a data set, rather than use the data to learn about the population that the data are...

, the interquartile range (IQR), also called the midspread or middle fifty, is a measure of statistical dispersion

Statistical dispersion

In statistics, statistical dispersion is variability or spread in a variable or a probability distribution...

, being equal to the difference between the upper and lower quartile

Quartile

In descriptive statistics, the quartiles of a set of values are the three points that divide the data set into four equal groups, each representing a fourth of the population being sampled...

s. IQR = Q₃ − Q₁

Use

Unlike (total) range

Range (statistics)

In the descriptive statistics, the range is the length of the smallest interval which contains all the data. It is calculated by subtracting the smallest observation from the greatest and provides an indication of statistical dispersion.It is measured in the same units as the data...

, the interquartile range is a robust statistic, having a breakdown point of 25%, and is thus often preferred to the total range.

The IQR is used to build box plot

Box plot

In descriptive statistics, a box plot or boxplot is a convenient way of graphically depicting groups of numerical data through their five-number summaries: the smallest observation , lower quartile , median , upper quartile , and largest observation...

s, simple graphical representations of a probability distribution

Probability distribution

In probability theory, a probability mass, probability density, or probability distribution is a function that describes the probability of a random variable taking certain values....

.

For a symmetric distribution (so the median equals the midhinge

Midhinge

In statistics, the midhinge is the average of the first and third quartiles and is thus a measure of location.Equivalently, it is the 25% trimmed mid-range; it is an L-estimator....

, the average of the first and third quartiles), half the IQR equals the median absolute deviation

Median absolute deviation

In statistics, the median absolute deviation is a robust measure of the variability of a univariate sample of quantitative data. It can also refer to the population parameter that is estimated by the MAD calculated from a sample....

(MAD).

The median

Median

In probability theory and statistics, a median is described as the numerical value separating the higher half of a sample, a population, or a probability distribution, from the lower half. The median of a finite list of numbers can be found by arranging all the observations from lowest value to...

is the corresponding measure of central tendency

Central tendency

In statistics, the term central tendency relates to the way in which quantitative data is clustered around some value. A measure of central tendency is a way of specifying - central value...

Examples

Data set in a table

i	x[i]	Quartile
1	102
2	104
3	105	Q₁
4	107
5	108
6	109	Q₂ (median)
7	110
8	112
9	115	Q₃
10	116
11	118

For the data in this table the interquartile range is IQR = 115 − 105 = 10.

Data set in a plain-text box plot



                 

                             +-----+-+  

   o           *     |-------|     | |---|

                             +-----+-+ 

                 

 +---+---+---+---+---+---+---+---+---+---+---+---+   number line

 0   1   2   3   4   5   6   7   8   9   10  11  12

For the data set in this box plot

Box plot

lower (first) quartile (

where CDF⁻¹ is the quantile function

Quantile function

In probability and statistics, the quantile function of the probability distribution of a random variable specifies, for a given probability, the value which the random variable will be at, or below, with that probability...

.

The interquartile range and median of some common distributions are shown below

Distribution	Median	IQR
Normal	μ	2 Φ⁻¹(0.75) ≈ 1.349
Laplace	μ	2b ln(2)
Cauchy Cauchy distribution The Cauchy–Lorentz distribution, named after Augustin Cauchy and Hendrik Lorentz, is a continuous probability distribution. As a probability distribution, it is known as the Cauchy distribution, while among physicists, it is known as the Lorentz distribution, Lorentz function, or Breit–Wigner...	μ

Interquartile range test for normality of distribution

The IQR, mean

Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....

, and standard deviation

Standard deviation

Standard deviation is a widely used measure of variability or diversity used in statistics and probability theory. It shows how much variation or "dispersion" there is from the average...

of a population P can be used in a simply test of whether or not P is normally distributed, or Gaussian. If P is normally distributed, then the standard score

Standard score

In statistics, a standard score indicates how many standard deviations an observation or datum is above or below the mean. It is a dimensionless quantity derived by subtracting the population mean from an individual raw score and then dividing the difference by the population standard deviation...

of the first quartile, z₁, is -0.67, and the standard score of the third quartile, z₃, is +0.67. Given mean = X and standard deviation = σ for P, if P is normally distributed, the first quartile

and the third quartile

If the actual values of the first or third quartiles differ substantially from the calculated values, P is not normally distributed.

Use

Examples

Data set in a table

Data set in a plain-text box plot

Interquartile range test for normality of distribution

See also