All Topics  
Mean

 

   Email Print
   Bookmark   Link






 

Mean



 
 
In statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
, mean has two related meanings: It is sometimes stated that the 'mean' means average. This is incorrect if "mean" is taken in the specific sense of "arithmetic mean" as there are different types of averages: the mean, median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, and mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
. For instance, average house prices almost always use the median value for the average. These three types of averages are all measures of locations. Other simple statistical analyses use measures of spread, such as range
Range (mathematics)

In mathematics, the range of a function is the Set of all "output" values produced by that function. Sometimes it is called the , or more precisely, the image of the domain of the function....
, interquartile range
Interquartile range

In descriptive statistics, the interquartile range , also called the midspread, middle fifty and middle of the #s, is a measure of statistical dispersion, being equal to the difference between the third and first quartiles....
, or standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
. For a real-valued random variable
Random variable

In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
 X, the mean is the expectation of X. Note that not every probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
 has a defined mean (or variance
Variance

In probability theory and statistics, the variance of a random variable, probability distribution, or sample is one measure of statistical dispersion, averaging the squared distance of its possible values from the expected value ....
); see the Cauchy distribution
Cauchy distribution

The Cauchy?Lorentz distribution, named after Augustin Cauchy and Hendrik Lorentz,  is a continuous probability distribution. As a probability distribution, it is known as the Cauchy distribution, while among physicists, it is known as a Lorentz distribution, or a Lorentz function or the Breit?Wigner dis...
 for an example.

For a data set
Data set

A data set is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question....
, the mean is the sum of the observations divided by the number of observations.






Discussion
Ask a question about 'Mean'
Start a new discussion about 'Mean'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
, mean has two related meanings:
  • the arithmetic mean
    Arithmetic mean

    In mathematics and statistics, the arithmetic mean of a list of numbers is the sum of all of the list divided by the number of items in the list....
     (and is distinguished from the geometric mean
    Geometric mean

    The geometric mean, in mathematics, is a type of mean or average, which indicates the central tendency or typical value of a set of numbers. It is similar to the arithmetic mean, which is what most people think of with the word "average," except that instead of adding the set of numbers and then dividing the sum by the count of numbers in the...
     or harmonic mean
    Harmonic mean

    In mathematics, the harmonic mean is one of several kinds of average. Typically, it is appropriate for situations when the average of Rate s is desired....
    ).
  • the expected value
    Expected value

    In probability theory and statistics, the expected value of a random variable is the Lebesgue integral of the random variable with respect to its probability measure....
     of a random variable
    Random variable

    In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
    , which is also called the population mean.
It is sometimes stated that the 'mean' means average. This is incorrect if "mean" is taken in the specific sense of "arithmetic mean" as there are different types of averages: the mean, median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, and mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
. For instance, average house prices almost always use the median value for the average. These three types of averages are all measures of locations. Other simple statistical analyses use measures of spread, such as range
Range (mathematics)

In mathematics, the range of a function is the Set of all "output" values produced by that function. Sometimes it is called the , or more precisely, the image of the domain of the function....
, interquartile range
Interquartile range

In descriptive statistics, the interquartile range , also called the midspread, middle fifty and middle of the #s, is a measure of statistical dispersion, being equal to the difference between the third and first quartiles....
, or standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
. For a real-valued random variable
Random variable

In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
 X, the mean is the expectation of X. Note that not every probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
 has a defined mean (or variance
Variance

In probability theory and statistics, the variance of a random variable, probability distribution, or sample is one measure of statistical dispersion, averaging the squared distance of its possible values from the expected value ....
); see the Cauchy distribution
Cauchy distribution

The Cauchy?Lorentz distribution, named after Augustin Cauchy and Hendrik Lorentz,  is a continuous probability distribution. As a probability distribution, it is known as the Cauchy distribution, while among physicists, it is known as a Lorentz distribution, or a Lorentz function or the Breit?Wigner dis...
 for an example.

For a data set
Data set

A data set is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question....
, the mean is the sum of the observations divided by the number of observations. The mean is often quoted along with the standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
: the mean describes the central location of the data, and the standard deviation describes the spread.

An alternative measure of dispersion is the mean deviation, equivalent to the average absolute deviation
Absolute deviation

In statistics, the absolute deviation of an element of a data set is the absolute difference between that element and a given point. Typically the point from which the deviation is measured is a measure of central tendency, most often the median or sometimes the mean of the data set....
 from the mean. It is less sensitive to outliers, but less mathematically tractable.

As well as statistics, means are often used in geometry and analysis; a wide range of means have been developed for these purposes, which are not much used in statistics. These are listed below.

Examples of means


Arithmetic mean

The arithmetic mean is the "standard" average, often simply called the "mean".



The mean may often be confused with the median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
 or range. The mean is the arithmetic average of a set of values, or distribution; however, for skewed
Skewness

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number-valued random variable....
 distributions, the mean is not necessarily the same as the middle value (median), or the most likely (mode). For example, mean income is skewed upwards by a small number of people with very large incomes, so that the majority have an income lower than the mean. By contrast, the median income is the level at which half the population is below and half is above. The mode income is the most likely income, and favors the larger number of people with lower incomes. The median or mode are often more intuitive measures of such data.

Nevertheless, many skewed distributions are best described by their mean - such as the Exponential
Exponential distribution

In probability theory and statistics, the exponential distributions are a class of continuous probability distributions. They describe the times between events in a Poisson process, i.e....
 and Poisson
Poisson distribution

In probability theory and statistics, the Poisson distribution is a discrete probability distribution that expresses the probability of a number of events occurring in a fixed period of time if these events occur with a known average rate and Statistical independence of the time since the last event....
 distributions.

For example, the arithmetic mean of six values: 34, 27, 45, 55, 22, 34 is:

Geometric mean

The geometric mean
Geometric mean

The geometric mean, in mathematics, is a type of mean or average, which indicates the central tendency or typical value of a set of numbers. It is similar to the arithmetic mean, which is what most people think of with the word "average," except that instead of adding the set of numbers and then dividing the sum by the count of numbers in the...
 is an average that is useful for sets of positive numbers that are interpreted according to their product and not their sum (as is the case with the arithmetic mean) e.g. rates of growth.

For example, the geometric mean of six values: 34, 27, 45, 55, 22, 34 is:

Harmonic mean

The harmonic mean
Harmonic mean

In mathematics, the harmonic mean is one of several kinds of average. Typically, it is appropriate for situations when the average of Rate s is desired....
 is an average which is useful for sets of numbers which are defined in relation to some unit, for example speed
Speed

Speed is the rate of Motion , or equivalently the rate of change of distance.Speed is a Scalar quantity with dimensions length/time; the equivalent Vector quantity to speed is velocity....
 (distance per unit of time).

For example, the harmonic mean of the six values: 34, 27, 45, 55, 22, and 34 is

Generalized means


Power mean
The generalized mean
Generalized mean

A generalized mean, also known as power mean or H?lder mean, is an abstraction of the Pythagorean means including arithmetic mean, geometric mean, and harmonic mean means....
, also known as the power mean or Hölder mean, is an abstraction of the quadratic, arithmetic, geometric and harmonic means. It is defined for a set of n positive numbers xi by

By choosing the appropriate value for the parameter m we get
maximum
quadratic mean,
arithmetic mean
Arithmetic mean

In mathematics and statistics, the arithmetic mean of a list of numbers is the sum of all of the list divided by the number of items in the list....
,
geometric mean
Geometric mean

The geometric mean, in mathematics, is a type of mean or average, which indicates the central tendency or typical value of a set of numbers. It is similar to the arithmetic mean, which is what most people think of with the word "average," except that instead of adding the set of numbers and then dividing the sum by the count of numbers in the...
,
harmonic mean
Harmonic mean

In mathematics, the harmonic mean is one of several kinds of average. Typically, it is appropriate for situations when the average of Rate s is desired....
,
minimum.


f-mean
This can be generalized further as the generalized f-mean

and again a suitable choice of an invertible will give
arithmetic mean
Arithmetic mean

In mathematics and statistics, the arithmetic mean of a list of numbers is the sum of all of the list divided by the number of items in the list....
,
harmonic mean
Harmonic mean

In mathematics, the harmonic mean is one of several kinds of average. Typically, it is appropriate for situations when the average of Rate s is desired....
,
power mean,
geometric mean
Geometric mean

The geometric mean, in mathematics, is a type of mean or average, which indicates the central tendency or typical value of a set of numbers. It is similar to the arithmetic mean, which is what most people think of with the word "average," except that instead of adding the set of numbers and then dividing the sum by the count of numbers in the...
.


Weighted arithmetic mean

The weighted arithmetic mean
Weighted mean

The weighted mean is similar to an arithmetic mean , where instead of each of the data points contributing equally to the final average, some data points contribute more than others....
 is used, if one wants to combine average values from samples of the same population with different sample sizes:

The weights represent the bounds of the partial sample. In other applications they represent a measure for the reliability of the influence upon the mean by respective values.

Truncated mean

Sometimes a set of numbers might contain outliers, i.e. a datum
Datum

A geodetic datum is a reference from which measurements are made. In surveying and geodesy,a datum is a set of reference points on the earth's surface against which position measurements are made, and an associated model of the shape of the earth to define a geographic coordinate system....
 which is much lower or much higher than the others. Often, outliers are erroneous data caused by artifacts
Artifact (observational)

In natural science and signal processing, an artifact is any perceived distortion or other data error caused by the instrument of observation....
. In this case one can use a truncated mean
Truncated mean

A truncated mean or trimmed mean is a statistical measure of central tendency, much like the mean and median. It involves the calculation of the mean after discarding given parts of a probability distribution or Sampling at the high and low end, and typically discarding an equal amount of both....
. It involves discarding given parts of the data at the top or the bottom end, typically an equal amount at each end, and then taking the arithmetic mean of the remaining data. The number of values removed is indicated as a percentage of total number of values.

Interquartile mean

The interquartile mean
Interquartile mean

The 'interquartile mean ' is a statistics measures of central tendency, much like the mean , the median, and the mode .The IQM is a truncated mean and so is very similar to the scoring method used in sports that are evaluated by a panel of judges: discard the lowest and the highest scores; calculate the mean value of the remaining score...
 is a specific example of a truncated mean. It is simply the arithmetic mean after removing the lowest and the highest quarter of values. assuming the values have been ordered.

Mean of a function

In calculus
Calculus

Calculus is a branch of mathematics that includes the study of limit , derivatives, integrals, and infinite series, and constitutes a major part of modern university education....
, and especially multivariable calculus
Multivariable calculus

Multivariable calculus is the extension of calculus in one variable to calculus in several variables: the functions which are differentiated and integrated involve several variables rather than one variable....
, the mean of a function is loosely defined as the average value of the function over its domain
Domain (mathematics)

In mathematics, the domain of a given function is the set of "input" values for which the function is defined. For instance, the domain of cosine would be all real numbers, while the domain of the square root would be only numbers greater than or equal to 0 ....
. In one variable, the mean of a function f(x) over the interval (a,b) is defined by



(See also mean value theorem
Mean value theorem

In calculus, the mean value theorem states, roughly, that given a section of a Smooth function curve, there is at least one point on that section at which the derivative of the curve is equal to the "average" derivative of the section....
.) In several variables, the mean over a relatively compact domain U in a Euclidean space
Euclidean space

Around 300 Before Christ, the Ancient Greece mathematician Euclid undertook a study of relationships among distances and angles, first in a plane and then in space....
 is defined by

This generalizes the arithmetic mean. On the other hand, it is also possible to generalize the geometric mean to functions by defining the geometric mean of f to be

More generally, in measure theory and probability theory
Probability theory

Probability theory is the branch of mathematics concerned with analysis of Statistical randomness phenomena. The central objects of probability theory are random variables, stochastic processes, and event s: mathematical abstractions of determinism events or measured quantities that may either be single occurrences or evolve over time in an a...
 either sort of mean plays an important role. In this context, Jensen's inequality
Jensen's inequality

In mathematics, Jensen's inequality, named after the Danish mathematician Johan Jensen, relates the value of a convex function of an integral to the integral of the convex function....
 places sharp estimates on the relationship between these two different notions of the mean of a function.

Mean of angles


Most of the usual means fail on circular quantities, like angle
Angle

In geometry and trigonometry, an angle is the figure formed by two Ray sharing a common endpoint, called the vertex of the angle . The magnitude of the angle is the "amount of rotation" that separates the two rays, and can be measured by considering the length of circular arc swept out when one ray is rotated about the vertex to coincide...
s, daytime
DAYTIME

The DAYTIME service is an Internet Protocol defined in RFC 867. It is intended for testing and measurement purposes in computer networks.A host may connect to a server that supports the DAYTIME protocol, on either Transmission Control Protocol or User Datagram Protocol port 13....
s, fractional parts of real number
Real number

In mathematics, the real numbers may be described informally in several different ways. The real numbers include both rational numbers, such as 42 and −23/129, and irrational numbers, such as pi and the square root of two; or, a real number can be given by an infinite decimal representation, such as 2.4871773339...., where the digits co...
s. For those quantities you need a mean of circular quantities
Mean of circular quantities

In mathematics, a mean of circular quantities is a mean which is suited for quantities like angles, daytimes, and fractional parts of real numbers....
.

Other means


Properties


All means share some properties and additional properties are shared by the most common means. Some of these properties are collected here.

Weighted mean


A weighted mean is a function which maps tuples of positive numbers to a positive number .

  • "Fixed point
    Fixed point (mathematics)

    In mathematics, a fixed point of a function is a point that is mapped to itself by the function. That is to say, x is a fixed point of the function f if and only if f = x....
    ":
  • Homogeneity
    Homogeneous function

    In mathematics, a homogeneous function is a function with multiplicative scaling behaviour: if the argument is multiplied by a factor, then the result is multiplied by some power of this factor....
    :
(using vector
Coordinate vector

In linear algebra, a coordinate vector is an explicit representation of a vector in an Real_coordinate_space#Intuitive_overview as an ordered list of numbers or, equivalently, as an element of the coordinate space Fn....
 notation: )
  • Monotony
    Monotonic function

    In mathematics, a monotonic function is a function which preserves the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of order theory....
    :


It follows
  • Boundedness
    Upper bound

    In mathematics, especially in order theory, an upper bound of a subset S of some partially ordered set is an element of P which is greater than or equal to every element of S....
    :
  • Continuity
    Continuous function

    In mathematics, a continuous function is a function for which, intuitively, small changes in the input result in small changes in the output. Otherwise, a function is said to be discontinuous....
    :
Sketch of a proof: Because it follows .
  • There are means which are not differentiable
    Derivative

    In calculus, a branch of mathematics, the derivative is a measure of how a function changes as its input changes. Loosely speaking, a derivative can be thought of as how much a quantity is changing at a given point....
    . For instance, the maximum number of a tuple is considered a mean (as an extreme case of the power mean, or as a special case of a median
    Median

    In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
    ), but is not differentiable.
  • All means listed above, with the exception of most of the Generalized f-means, satisfy the presented properties.
    • If is bijective, then the generalized f-mean satisfies the fixed point property.
    • If is strictly monotonic, then the generalized f-mean satisfy also the monotony property.
    • In general a generalized f-mean will miss homogeneity.


The above properties imply techniques to construct more complex means:

If are weighted means, is a positive real number
Real number

In mathematics, the real numbers may be described informally in several different ways. The real numbers include both rational numbers, such as 42 and −23/129, and irrational numbers, such as pi and the square root of two; or, a real number can be given by an infinite decimal representation, such as 2.4871773339...., where the digits co...
, then with
are also a weighted mean.

Unweighted mean


Intuitively spoken, an unweighted mean is a weighted mean with equal weights. Since our definition of weighted mean above does not expose particular weights, equal weights must be asserted by a different way. A different view on homogeneous weighting is, that the inputs can be swapped without altering the result.

Thus we define being an unweighted mean if it is a weighted mean and for each permutation
Permutation

In several fields of mathematics the term permutation is used with different but closely related meanings. They all relate to the notion of mapping the element s of a set to other elements of the same set, i.e., exchanging elements of a set....
  of inputs, the result is the same. Let be the set of permutations of -tuples.
Symmetry
Symmetric function

In mathematics, the term "symmetric function" can mean two different things. A symmetric function of n variables is one whose value at any n-tuple of arguments is the same as its value at any permutation of that n-tuple....
:


Analogously to the weighted means, if is a weighted mean and are unweighted means, is a positive real number
Real number

In mathematics, the real numbers may be described informally in several different ways. The real numbers include both rational numbers, such as 42 and −23/129, and irrational numbers, such as pi and the square root of two; or, a real number can be given by an infinite decimal representation, such as 2.4871773339...., where the digits co...
, then with
are also unweighted means.

Convert unweighted mean to weighted mean


An unweighted mean can be turned into a weighted mean by repeating elements. This connection can also be used to state that a mean is the weighted version of an unweighted mean. Say you have the unweighted mean and weight the numbers by natural numbers . (If the numbers are rational
Rational number

In mathematics, a rational number is a number which can be expressed as a quotient of two integers. Non-integer rational numbers are usually written as the vulgar fraction , where b is not 0 ....
, then multiply them with the least common denominator.) Then the corresponding weighted mean is obtained by

Means of tuples of different sizes


If a mean is defined for tuples of several sizes, then one also expects that the mean of a tuple is bounded by the means of partitions. More precisely
  • Given an arbitrary tuple , which is partitioned
    Partition of a set

    In mathematics, a partition of a Set X is a division of X into non-overlapping "parts" or "blocks" or "cells" that cover all of X....
     into , then it holds . (See Convex hull
    Convex hull

    In mathematics, the convex hull or convex envelope for a Set of points X in a real vector space V is the minimal convex set containing X....
    )


Population and sample means

The mean of a population
Statistical population

In statistics, a statistical population is a Set of entities concerning which statistical inferences are to be drawn, often based on a random sample taken from the population....
 has an expected value of µ, known as the population mean. The sample mean makes a good estimator
Estimator

In statistics, an estimator is a function of the observable sample data that is used to estimate an unknown population parameter ; an estimate is the result from the actual application of the function to a particular Sampling_ of data....
 of the population mean, as its expected value is the same as the population mean. The sample mean of a population is a random variable
Random variable

In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
, not a constant, and consequently it will have its own distribution. For a random sample of n observations from a normally distributed population, the sample mean distribution is



Often, since the population variance is an unknown parameter, it is estimated by the mean sum of squares
Sum of squares

Sum of squares is a concept that permeates much of inferential statistics and descriptive statistics. More properly, it is "the sum of the squared deviations"....
, which changes the distribution of the sample mean from a normal distribution to a Student's t distribution with n − 1 degrees of freedom
Degrees of freedom

Degrees of freedom can mean:* Degrees of freedom * Degrees of freedom * Degrees of freedom ...
.

Mathematics education

In many state and government curriculum standards, students are traditionally expected to learn either the meaning or formula for computing the mean by the fourth grade. However, in many standards-based mathematics curricula, students are encouraged to invent their own methods, and may not be taught the traditional method. Reform based texts such as TERC
Investigations in Numbers, Data, and Space

Investigations in Number, Data, and Space is a K-5 mathematics curriculum, developed at in Cambridge, Massachusetts, USA. The curriculum is often referred to as Investigations or simply TERC....
 in fact discourage teaching the traditional "add the numbers and divide by the number of items" method in favor of spending more time on the concept of median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, which does not require division. However, mean can be computed with a simple four-function calculator, while median requires an abacus. The same teacher guide devotes several pages on how to find the median of a set, which is judged to be simpler than finding the mean.

See also

  • Average
    Average

    In mathematics, an average, or central tendency of a data set refers to a measure of the "middle" or "Expected value" value of the data set....
    , same as central tendency
  • Descriptive statistics
    Descriptive statistics

    Descriptive Statistics are used to describe the basic features of the data gathered from an experimental study in various ways. A descriptive Statistics is distinguished from inductive statistics....
  • Kurtosis
    Kurtosis

    In probability theory and statistics, kurtosis is a measure of the "peakedness" of the probability distribution of a real number-valued random variable....
  • Median
    Median

    In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
  • Mode (statistics)
    Mode (statistics)

    In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
  • Summary statistics
    Summary statistics

    File:Michelsonmorley-boxplot.svgIn descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount as simply as possible....
  • Law of averages
    Law of averages

    The law of averages is a Layman's terms used to express a belief that outcomes of a random event shall "even out" within a small sample.As invoked in everyday life, the "law" usually reflects bad statistics or wishful thinking rather than any mathematical principle....
  • Spherical mean
    Spherical mean

    In mathematics, the spherical mean of a function around a point is the average of all values of that function on a sphere of given radius centered at that point....
  • For an independent identical distribution from the reals, the mean of a sample is an unbiased
    Bias of an estimator

    In statistics, the difference between an estimator's expected value and the true value of the parameter being estimated is called the bias. An estimator or decision rule having nonzero bias is said to be biased....
     estimator for the mean of the population.


External links