All Topics  
Average

 

   Email Print
   Bookmark   Link






 

Average



 
 
In mathematics
Mathematics

Mathematics is the study of quantity, structure, space, change, and related topics of pattern and form. Mathematicians seek out patterns whether found in numbers, space, natural science, computers, imaginary abstractions, or elsewhere....
, an average, or central tendency of a data set
Data set

A data set is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question....
 refers to a measure of the "middle" or "expected
Expected value

In probability theory and statistics, the expected value of a random variable is the Lebesgue integral of the random variable with respect to its probability measure....
" value of the data set. There are many different descriptive statistics
Descriptive statistics

Descriptive Statistics are used to describe the basic features of the data gathered from an experimental study in various ways. A descriptive Statistics is distinguished from inductive statistics....
 that can be chosen as a measurement of the central tendency of the data items.

An average is a single value that is meant to typify a list of values. If all the numbers in the list are the same, then this number should be used. If the numbers are not all the same, an easy way to get a representative value from a list is to randomly pick any number from the list.






Discussion
Ask a question about 'Average'
Start a new discussion about 'Average'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In mathematics
Mathematics

Mathematics is the study of quantity, structure, space, change, and related topics of pattern and form. Mathematicians seek out patterns whether found in numbers, space, natural science, computers, imaginary abstractions, or elsewhere....
, an average, or central tendency of a data set
Data set

A data set is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question....
 refers to a measure of the "middle" or "expected
Expected value

In probability theory and statistics, the expected value of a random variable is the Lebesgue integral of the random variable with respect to its probability measure....
" value of the data set. There are many different descriptive statistics
Descriptive statistics

Descriptive Statistics are used to describe the basic features of the data gathered from an experimental study in various ways. A descriptive Statistics is distinguished from inductive statistics....
 that can be chosen as a measurement of the central tendency of the data items.

An average is a single value that is meant to typify a list of values. If all the numbers in the list are the same, then this number should be used. If the numbers are not all the same, an easy way to get a representative value from a list is to randomly pick any number from the list. However, the word 'average' is usually reserved for more sophisticated methods that are generally found to be more useful.

The most common method is the arithmetic mean
Arithmetic mean

In mathematics and statistics, the arithmetic mean of a list of numbers is the sum of all of the list divided by the number of items in the list....
. There are many other types of averages, such as median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
 (used most often to describe house prices and incomes). The average is calculated by combining the measurements related to a set and to compute a number as being the average of the set.

Calculation


Arithmetic mean

If numbers are given, each number denoted by ai, where , the arithmetic mean is the [sum] of the ai's divided by or .

The arithmetic mean, often simply called the mean, of two numbers, such as 2 and 8, is obtained by finding a value A such that 2 + 8 = A + A. One may find that A = (2 + 8)/2 = 5. Switching the order of 2 and 8 to read 8 and 2 does not change the resulting value obtained for A. The mean 5 is not less than the minimum 2 nor greater than the maximum 8. If we increase the number of terms in the list for which we want an average, we get, for example, that the arithmetic mean of 2, 8, and 11 is found by solving for the value of A in the equation 2 + 8 + 11 = A + A + A. One finds that A = (2 + 8 + 11)/3 = 7.

Changing the order of the three members of the list does not change the result: A = (8 + 11 + 2)/3 = 7 and that 7 is between 2 and 11. This summation method is easily generalized for lists with any number of elements. However, the mean of a list of integers is not necessarily an integer. "The average family has 1.7 children" is a jarring way of making a statement that is more appropriately expressed by "the average number of children in the collection of families examined is 1.7".

Geometric mean


The geometric mean of n numbers is obtained by multiplying them all together and then taking the nth root. In algebraic terms, the geometric mean of is defined as

Geometric mean can be thought of as the antilog of the arithmetic mean of the logs
Logarithm

In mathematics, the logarithm of a number to a given base is the Power or exponent to which the base must be raised in order to produce the number....
 of the numbers.

Example: Geometric mean of 2 and 8 is .

Harmonic mean

Harmonic mean for a set of numbers is defined as the reciprocal of the arithmetic mean of the reciprocals of 's:

One example where it is useful is calculating the average speed. For example, if the speed for going from point A to B was 60km/h, and the speed for returning from B to A was 40km/h, then the average speed is given by .

Inequality Concerning AM, GM & HM

A well known inequality concerning Arithmetic, Geometric, and Harmonic means for any set of positive numbers is

It is easy to remember noting that the alphabetical order of the letters A, G and H is preserved in the inequality. See Inequality of arithmetic and geometric means
Inequality of arithmetic and geometric means

In mathematics, the inequality of arithmetic and geometric means, or more briefly the AM-GM inequality, states that the arithmetic mean of a list of non-negative real numbers is greater than or equal to the geometric mean of the same list; and further, that the two means are equal if and only if every number in the list is the same....
.

Mode and median

The most frequently occurring number in a list of numbers is called the mode. The mode of the list (1, 2, 2, 3, 3, 3, 4) is 3. The mode is not necessarily well defined, the list (1, 2, 2, 3, 3, 5) has the two modes 2 and 3. The mode can be subsumed under the general method of defining averages by understanding it as taking the list and setting each member of the list equal to the most common value in the list if there is a most common value. This list is then equated to the resulting list with all values replaced by the same value. Since they are already all the same, this does not require any change. The mode is more meaningful and potentially useful if there are many numbers in the list, and the frequency of the numbers progresses smoothly (eg if out of a group of 1000 people, 30 people weigh 61 kg, 32 weigh 62 kg, 29 weigh 63 kg, and all the other possible weights occur less frequently, then 62 kg is the mode).

The median is the middle number of the group when they are ranked in order. (If there are an even number of numbers, the average of the middle two is taken.)

Thus to find the median, order the list according to its elements' magnitude and then repeatedly remove the pair consisting of the highest and lowest values until either one or two values are left. If exactly one value is left, it is the median; if two values, the median is the arithmetic mean of these two. This method takes the list 1, 7, 3, 13 and orders it to read 1, 3, 7, 13. Then the 1 and 13 are removed to obtain the list 3, 7. Since there are two elements in this remaining list, the median is their arithmetic mean, (3 + 7)/2 = 5.

Average Percentage Return

The average percentage return is a type of average used in finance. It is an example of a geometric mean. For example, if we are considering a period of two years, and the investment return in the first year is -10% and the return in the second year is +60%, then the average percentage return, R, can be obtained by solving the equation: . The value of R that makes this equation true is 0.2, or 20%. Note that changing the order to find the average percentage returns of +60% and -10% gives the same result as the average percentage returns of -10% and +60%.

This method can be generalized to examples in which the periods are not all of one-year duration. Average percentage of a set of returns is a variation on the geometric average that provides the intensive property of a return per year corresponding to a list of percentage returns. For example, consider a period of a half of a year for which the return is -23% and a period of two and one half years for which the return is +13%. The average percentage return for the combined period is the single year return, R, that is the solution of the following equation: , giving an average percentage return R of 0.0600 or 6.00%.

Types

The table of mathematical symbols
Table of mathematical symbols

This is a listing of common symbols found within all branches of the science of mathematics....
 explains the symbols used below.
Name Equation or description
Arithmetic mean
Arithmetic mean

In mathematics and statistics, the arithmetic mean of a list of numbers is the sum of all of the list divided by the number of items in the list....
 
Median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
 
The middle value that separates the higher half from the lower half of the data set
Geometric median
Geometric median

The geometric median of a discrete set of sample points in a Euclidean space is the point minimizing the sum of distances to the sample points. This generalizes the median, which has the property of minimizing the sum of distances for one-dimensional data, and provides a central tendency in higher dimensions....
 
A rotation
Rotation (mathematics)

In geometry and linear algebra, a rotation is a transformation in a plane or in space that describes the motion of a rigid body around a fixed point....
 invariant
Invariant (mathematics)

In mathematics, an invariant is something that does not change under a set of Transformation s. The property of being an invariant is invariance....
 extension of the median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
 for points in Rn
Mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
 
The most frequent value in the data set
Geometric mean
Geometric mean

The geometric mean, in mathematics, is a type of mean or average, which indicates the central tendency or typical value of a set of numbers. It is similar to the arithmetic mean, which is what most people think of with the word "average," except that instead of adding the set of numbers and then dividing the sum by the count of numbers in the...
 
Harmonic mean
Harmonic mean

In mathematics, the harmonic mean is one of several kinds of average. Typically, it is appropriate for situations when the average of Rate s is desired....
 
Quadratic mean
(or RMS)
Generalized mean
Generalized mean

A generalized mean, also known as power mean or H?lder mean, is an abstraction of the Pythagorean means including arithmetic mean, geometric mean, and harmonic mean means....
 
Weighted mean
Weighted mean

The weighted mean is similar to an arithmetic mean , where instead of each of the data points contributing equally to the final average, some data points contribute more than others....
 
Truncated mean
Truncated mean

A truncated mean or trimmed mean is a statistical measure of central tendency, much like the mean and median. It involves the calculation of the mean after discarding given parts of a probability distribution or Sampling at the high and low end, and typically discarding an equal amount of both....
 
The arithmetic mean of data values after a certain number or proportion of the highest and lowest data values have been discarded
Interquartile mean
Interquartile mean

The 'interquartile mean ' is a statistics measures of central tendency, much like the mean , the median, and the mode .The IQM is a truncated mean and so is very similar to the scoring method used in sports that are evaluated by a panel of judges: discard the lowest and the highest scores; calculate the mean value of the remaining score...
 
A special case of the truncated mean, using the interquartile range
Interquartile range

In descriptive statistics, the interquartile range , also called the midspread, middle fifty and middle of the #s, is a measure of statistical dispersion, being equal to the difference between the third and first quartiles....
Midrange
Midrange

In statistics, the mid-range or mid-extreme of a set of statistical data values is the arithmetic mean of the maximum and minimum values in a data set, or:...
 
Winsorized mean
Winsorized mean

A Winsorized mean is a Winsorising statistical measure of central tendency, much like the mean and median, and even more similar to the truncated mean....
 
Similar to the truncated mean, but, rather than deleting the extreme values, they are set equal to the largest and smallest values that remain
Annualization
Compound annual growth rate

Compound Annual Growth Rate is a business and investing specific term for the geometric mean growth rate on an annualized basis. Itrepresents the smoothed annualized gain earned over the investment time horizon....
 


Solutions to variational problems

Several measures of central tendency can be characterized as solving a variational problem, in the sense of the calculus of variations
Calculus of variations

Calculus of variations is a field of mathematics that deals with functional , as opposed to ordinary calculus which deals with function . Such functionals can for example be formed as integrals involving an unknown function and its derivatives....
, namely minimizing variation from the center. That is, given a measure of statistical dispersion
Statistical dispersion

In statistics, statistical dispersion is variability or spread in a variable or a probability distribution. Common examples of measures of statistical dispersion are the variance, standard deviation and interquartile range....
, one asks for a measure of central tendency that minimizes variation: such that variation from the center is minimal among all choices of center. In a quip, "dispersion precedes location". In the sense of spaces
Lp space

In mathematics, the Lp and lp spaces are spaces of p-integrable function, and corresponding sequence spaces....
, the correspondence is:
dispersion central tendency
average absolute deviation median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
mean
Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
maximum deviation midrange
Midrange

In statistics, the mid-range or mid-extreme of a set of statistical data values is the arithmetic mean of the maximum and minimum values in a data set, or:...


Thus standard deviation about the mean is lower than standard deviation about any other point; the uniqueness of this characterization of mean and midrange follows from convex optimization
Convex optimization

Convex optimization is a subfield of optimization . Given a real number vector space together with a convex function, real-valued function defined on a convex set of , the problem is to find the point in for which the number is smallest, i.e., the point such that for all ....
, as the and norms are convex functions. Note that the median in this sense is not in general unique, and in fact any point between the two central points of a discrete distribution minimizes average absolute deviation.

Similarly, the mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
 minimizes qualitative variation
Qualitative variation

An index of qualitative variation is a measure of statistical dispersion in nominal distributions. There are a variety of these, but they have been relatively little-studied in the statistics literature....
.

Miscellaneous types


Other more sophisticated averages are: trimean
Trimean

In statistics, the trimean is a measure of a probability distribution's average, defined as a weighted average of the distribution's median and its two quartiles:...
, trimedian, and normalized mean. These are usually more representative of the whole data set.

One can create one's own average metric using generalized f-mean:



where f is any invertible function. The harmonic mean is an example of this using f(x) = 1/x, and the geometric mean is another, using f(x) = log x. Another example, expmean (exponential mean) is a mean using the function f(x) = ex, and it is inherently biased towards the higher values. However, this method for generating means is not general enough to capture all averages. A more general method for defining an average, y, takes any function of a list g(x1, x2, ..., xn), which is symmetric under permutation of the members of the list, and equates it to the same function with the value of the average replacing each member of the list: g(x1, x2, ..., xn) = g(y, y, ..., y). This most general definition still captures the important property of all averages that the average of a list of identical elements is that element itself. The function g(x1, x2, ..., xn) =x1+x2+ ...+ xn provides the arithmetic mean. The function g(x1, x2, ..., xn) =x1·x2· ...· xn provides the geometric mean. The function g(x1, x2, ..., xn) =x1−1+x2−1+ ...+ xn−1 provides the harmonic mean. (See John Bibby (1974) “Axiomatisations of the average and a further generalisation of monotonic sequences,” Glasgow Mathematical Journal, vol. 15, pp. 63–65.)

In data streams

The concept of an average can be applied to a stream of data as well as a bounded set, the goal being to find a value about which recent data is in some way clustered. The stream may be distributed in time, as in samples taken by some data acquisition system from which we want to remove noise, or in space, as in pixels in an image from which we want to extract some property. An easy-to-understand and widely used application of average to a stream is the simple moving average in which we compute the arithmetic mean of the most recent N data items in the stream. To advance one position in the stream, we add 1/N times the new data item and subtract 1/N times the data item N places back in the stream.

Etymology


An early meaning (c. 1500) of the word average is "damage sustained at sea". The root is found in Arabic as awar, in Italian as avaria and in French as avarie. Hence an average adjuster is a person who assesses an insurable loss.

Marine damage is either particular average, which is borne only by the owner of the damaged property, or general average
General average

The law of general average is a legal principle of maritime law according to which all parties in a sea venture proportionally share any losses resulting from a voluntary sacrifice of part of the ship or cargo to save the whole in an emergency....
, where the owner can claim a proportional contribution from all the parties to the marine venture. The type of calculations used in adjusting general average gave rise to the use of "average" to mean "arithmetic mean".

However, according to the Oxford English Dictionary, the earliest usage in English (1489 or earlier) appears to be an old legal term for a tenant's day labour obligation to a sheriff, probably anglicised from "avera" found in the English Domesday Book (1085).

See also

  • Statistics
    Statistics

    Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....


External links