All Topics  
Mode (statistics)

 

   Email Print
   Bookmark   Link






 

Mode (statistics)



 
 
In statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
, the mode is the value that occurs the most frequently in a data set
Data set

A data set is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question....
 or a probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score.

Like the statistical mean
Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
 and the median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, the mode is a way of capturing important information
Summary statistics

File:Michelsonmorley-boxplot.svgIn descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount as simply as possible....
 about a random variable
Random variable

In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
 or a population in a single quantity. The mode is in general different from mean and median, and may be very different for strongly skewed distributions.

The mode is not necessarily unique, since the same maximum frequency may be attained at different values.






Discussion
Ask a question about 'Mode (statistics)'
Start a new discussion about 'Mode (statistics)'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
, the mode is the value that occurs the most frequently in a data set
Data set

A data set is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question....
 or a probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score.

Like the statistical mean
Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
 and the median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, the mode is a way of capturing important information
Summary statistics

File:Michelsonmorley-boxplot.svgIn descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount as simply as possible....
 about a random variable
Random variable

In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
 or a population in a single quantity. The mode is in general different from mean and median, and may be very different for strongly skewed distributions.

The mode is not necessarily unique, since the same maximum frequency may be attained at different values. The most ambiguous case occurs in uniform distribution
Uniform distribution

Uniform distribution can refer to:...
s, wherein all values are equally likely.

Mode of a probability distribution


The mode of a discrete probability distribution
Discrete probability distribution

Discrete probability distributions arise in the mathematical description of probability theory and statistical analysis in which the values that might be observed are restricted to being within a pre-defined list of possible values....
 is the value x at which its probability mass function
Probability mass function

In probability theory, a probability mass function is a function that gives the probability that a discrete random variable random variable is exactly equal to some value....
 takes its maximum value. In other words, it is the value that is most likely to be sampled.

The mode of a continuous probability distribution
Continuous probability distribution

In probability theory, a probability distribution is called continuous if its cumulative distribution function is continuous function. This is equivalent to saying that for random variables X with the distribution in question, Pr[X = a] = 0 for all real numbers a, i.e.: the probability that X attains the value a is zer...
 is the value x at which its probability density function
Probability density function

In mathematics, a probability density function is a function that represents a probability distribution in terms of integrals.Formally, a probability distribution has density ƒ, if ƒ is a non-negative Lebesgue integration function such that the probability of the interval [ab] is given by...
 attains its maximum value, so, informally speaking, the mode is at the peak.

As noted above, the mode is not necessarily unique, since the probability mass function or probability density function may achieve its maximum value at several points x1, x2, etc.. When a probability density function has multiple local maxima, it is common to refer to all of the local maxima as modes of the distribution (even though the above definition implies that only global maxima are modes). Such a continuous distribution is called multimodal (as opposed to unimodal).

In symmetric
Reflection symmetry

The triangles with this symmetry are isosceles. The quadrilaterals with this symmetry are the kite s and the isosceles trapezoids.For each line or plane of reflection, the symmetry group is isomorphic with Cs , one of the three types of order two , hence algebraically C2....
 unimodal distributions, such as the normal (or Gaussian) distribution
Normal distribution

The normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields....
 (the distribution whose density function, when graphed, gives the famous "bell curve"), the mean (if defined), median and mode all coincide. For samples, if it is known that they are drawn from a symmetric distribution, the sample mean can be used as an estimate of the population mode.

Mode of a sample


The mode of a data sample is the element that occurs most often in the collection. For example, the mode of the sample [1, 3, 6, 6, 6, 6, 7, 7, 12, 12, 17] is 6. Given the list of data [1, 1, 2, 4, 4] the mode is not unique, unlike the arithmetic mean
Arithmetic mean

In mathematics and statistics, the arithmetic mean of a list of numbers is the sum of all of the list divided by the number of items in the list....
.

For a sample from a continuous distribution, such as [0.935..., 1.211..., 2.430..., 3.668..., 3.874...], the concept is unusable in its raw form, since each value will occur precisely once. The usual practice is to discretize the data by assigning the values to equidistant interval
Interval (mathematics)

In mathematics, a interval is a set of real numbers with the property that any number that lies between two numbers in the set is also included in the set....
s, as for making a histogram
Histogram

In statistics, a histogram is a graphical display of tabulated frequency , shown as bars. It shows what proportion of cases fall into each of several Categorization....
, effectively replacing the values by the midpoints of the intervals they are assigned to. The mode is then the value where the histogram reaches its peak. For small or middle-sized samples the outcome of this procedure is sensitive to the choice of interval width if chosen too narrow or too wide; typically one should have a sizable fraction of the data concentrated in a relatively small number of intervals (5 to 10), while the fraction of the data falling outside these intervals is also sizable. An alternate approach is kernel density estimation
Kernel density estimation

In statistics, kernel density estimation is a Non-parametric statistics way of Density estimation the probability density function of a random variable....
, which essentially blurs point samples to produce a continuous estimate of the probability density function which can provide an estimate of the mode.

Comparison of mean, median and mode


For a probability distribution, the mean is also called the expected value
Expected value

In probability theory and statistics, the expected value of a random variable is the Lebesgue integral of the random variable with respect to its probability measure....
 of the random variable. For a data sample, the mean is also called the average
Average

In mathematics, an average, or central tendency of a data set refers to a measure of the "middle" or "Expected value" value of the data set....
.

When do these measures make sense?

Unlike mean and median, the concept of mode also makes sense for "nominal data" (i.e., not consisting of numerical values). For example, taking a sample of Korean family name
Korean name

A Korean name consists of a family name followed by a given name, as used by the Korean people in both North Korea and South Korea. In the Korean language, 'ireum' usually refers to the family name and given name together....
s, one might find that "Kim
Kim (Korean name)

Kim is the most common Korean name in Korea. The name is common in both modern-day North Korea and South Korea. The Chinese character used for the name means "gold," and although the character is usually pronounced "geum" in Korea, it is pronounced "gim" when used for the family name and names of some cities, e.g., Gimhae and Gimpo...
" occurs more often than any other name. Then "Kim" might be called the mode of the sample. However, this use is not common.

Unlike median, the concept of mean makes sense for any random variable assuming values from a vector space
Vector space

File:Vector addition ans scaling.pngA vector space is a mathematical structure formed by a collection of vectors: objects that may be Vector addition together and Scalar multiplication by numbers, called scalar s in this context....
, including the real number
Real number

In mathematics, the real numbers may be described informally in several different ways. The real numbers include both rational numbers, such as 42 and −23/129, and irrational numbers, such as pi and the square root of two; or, a real number can be given by an infinite decimal representation, such as 2.4871773339...., where the digits co...
s (a one-dimension
Dimension

In mathematics, the dimension of a space is roughly defined as the minimum number of coordinates needed to specify every point within it. For example: a point on the unit circle in the plane can be specified by two Cartesian coordinates but one can make do with a single coordinate , so the circle is 1-dimensional even though it exists in...
al vector space) and the integer
Integer

The integers are natural numbers including 0 and their negative and non-negative numberss . They are numbers that can be written without a fractional or decimal component, and fall within the set ....
s (which can be considered embedded in the reals). For example, a distribution of points in the plane
Plane (mathematics)

In mathematics, a plane is a curvature surface. Planes can arise as subspaces of some higher dimensional space, as with the walls of a room, or they may enjoy an independent existence in their own right, as in the setting of Euclidean geometry....
 will typically have a mean and a mode, but the concept of median does not apply. The median makes sense when there is a linear order on the possible values.

Uniqueness and definedness

For the remainder, the assumption is that we have (a sample of) a real-valued random variable.

For some probability distributions, the expected value may be infinite or undefined, but if defined, it is unique. The average of a (finite) sample is always defined. The median is the value such that the fractions not exceeding it and not falling below it are both at least 1/2. It is not necessarily unique, but never infinite or totally undefined. For a data sample it is the "halfway" value when the list of values is ordered in increasing value, where usually for a list of even length the numerical average is taken of the two values closest to "halfway". Finally, as said before, the mode is not necessarily unique. Certain pathological distributions (for example, the Cantor distribution
Cantor distribution

The Cantor distribution is the probability distribution whose cumulative distribution function is the Cantor function.This distribution has neither a probability density function nor a probability mass function, as it is not absolute continuity with respect to Lebesgue measure, nor has it any point-masses....
) have no defined mode at all. For a finite data sample, the mode is one (or more) of the values in the sample.

Properties

Assuming definedness, and for simplicity uniqueness, the following are some of the most interesting properties.
  • All three measures have the following property: If the random variable (or each value from the sample) is subjected to the linear or affine transformation
    Affine transformation

    In geometry, an affine transformation or affine map or an affinity between two vector spaces consists of a linear transformation followed by a translation :...
     which replaces X by aX+b, so are the mean, median and mode.
  • However, if there is an arbitrary monotonic transformation, only the median follows; for example, if X is replaced by exp(X), the median changes from m to exp(m) but the mean and mode won't.
  • Except for extremely small samples, the mode is insensitive to "outliers" (such as occasional, rare, false experimental readings). The median is also very robust in the presence of outliers, while the mean is rather sensitive.
  • In continuous unimodal distributions the median lies, as a rule of thumb, between the mean and the mode, about one third of the way going from mean to mode. In a formula, median ˜ (2 × mean + mode)/3. This rule, due to Karl Pearson
    Karl Pearson

    Karl Pearson Fellow of the Royal Society established the disciplineof mathematical statistics.In 1911 he founded the world's first university statistics department at University College London....
    , is however not always true and the three statistics can appear in any order. It often applies to slightly non-symmetric distributions that resemble a normal distribution.


Example for a skewed distribution


A well-known example of a skewed
Skewness

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number-valued random variable....
 distribution is personal wealth
Distribution of wealth

Distribution of wealth is a comparison of the wealth of various members or groups in a society. It differs from the distribution of income in a manner analogous to the difference between position and speed....
: Few people are very rich, but among those some are extremely rich. However, many are rather poor.

A well-known class of distributions that can be arbitrarily skewed is given by the log-normal distribution
Log-normal distribution

In probability and statistics, the log-normal distribution is the single-tailed probability distribution of any random variable whose logarithm is normal distribution....
. It is obtained by transforming a random variable X having a normal distribution into random variable Y = exp(X). Then the logarithm of random variable Y is normally distributed, hence the name.

Taking the mean µ of X to be 0, the median of Y will be 1, independent of the standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
 s of X. This is so because X has a symmetric distribution, so its median is also 0. The transformation from X to Y is monotonic, and so we find the median exp(0) = 1 for Y.

When X has standard deviation s = 0.2, the distribution of Y is not very skewed. We find (see under Log-normal distribution
Log-normal distribution

In probability and statistics, the log-normal distribution is the single-tailed probability distribution of any random variable whose logarithm is normal distribution....
), with values rounded to four digits:
  • mean = 1.0202
  • mode = 0.9608
Indeed, the median is about one third on the way from mean to mode.

When X has a much larger standard deviation, s = 5, the distribution of Y is strongly skewed. Now
  • mean = 7.3891
  • mode = 0.0183
Here, Pearson's rule of thumb fails.

See also

  • unimodal function
    Unimodal function

    In mathematics, a function f between two ordered sets is unimodal if for some value m , it is monotonically increasing for x = m and monotonically decreasing for x = m....
  • summary statistics
    Summary statistics

    File:Michelsonmorley-boxplot.svgIn descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount as simply as possible....
  • descriptive statistics
    Descriptive statistics

    Descriptive Statistics are used to describe the basic features of the data gathered from an experimental study in various ways. A descriptive Statistics is distinguished from inductive statistics....
  • central tendency
    • mean
      Mean

      In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
    • median
      Median

      In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
  • arg max
    Arg max

    In mathematics, arg max stands for the argument of the maximum, that is to say, the value of the given parameter for which the value of the given expression attains its maximum value:...
  • moment (mathematics)
    Moment (mathematics)

    The concept of moment in mathematics evolved from the concept of moment in physics. The nth moment of a real-valued function f of a real variable about a value c is...


External links