All Topics  
Random variable

 

   Email Print
   Bookmark   Link






 

Random variable



 
 
In mathematics
Mathematics

Mathematics is the study of quantity, structure, space, change, and related topics of pattern and form. Mathematicians seek out patterns whether found in numbers, space, natural science, computers, imaginary abstractions, or elsewhere....
, random variables are used in the study of chance
Randomness

Randomness is a lack of order, purpose, Causality, or predictability. Randomness as defined by Aristotle is the situation, when a choice is to be made which has no logical component by which to determine or make the choice ....
 and probability
Probability

Probability, or wikt:chance, is a way of expressing knowledge or belief that an Event will occur or has occurred. In mathematics the concept has been given an exact meaning in probability theory, that is used extensively in such areas of study as mathematics, statistics, finance, gambling, science, and philosophy to draw conclusions about t...
. They were developed to assist in the analysis of games of chance
Game of chance

A game of chance is a game whose outcome is strongly influenced by some randomness device, and upon which contestants frequently wager money. Common devices used include dice, spinning tops, playing cards, roulette wheels or numbered balls drawn from a container....
, stochastic
Stochastic

Stochastic means random.A stochastic process is one whose behavior is non-Deterministic system in that a system's subsequent state is determined both by the process's predictable actions and by a random element....
 events, and the results of scientific experiments
Experiment

In scientific inquiry, an experiment is a method of investigating causal relationships among variables. An experiment is a cornerstone of the empiricism approach to acquiring data about the world and is used in both natural sciences and social sciences....
 by capturing only the mathematical properties necessary to answer probabilistic
Probability

Probability, or wikt:chance, is a way of expressing knowledge or belief that an Event will occur or has occurred. In mathematics the concept has been given an exact meaning in probability theory, that is used extensively in such areas of study as mathematics, statistics, finance, gambling, science, and philosophy to draw conclusions about t...
 questions. Further formalizations have firmly grounded the entity in the theoretical domains of mathematics by making use of measure theory.

Fortunately, the language and structure of random variables can be grasped at various levels of mathematical fluency.






Discussion
Ask a question about 'Random variable'
Start a new discussion about 'Random variable'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In mathematics
Mathematics

Mathematics is the study of quantity, structure, space, change, and related topics of pattern and form. Mathematicians seek out patterns whether found in numbers, space, natural science, computers, imaginary abstractions, or elsewhere....
, random variables are used in the study of chance
Randomness

Randomness is a lack of order, purpose, Causality, or predictability. Randomness as defined by Aristotle is the situation, when a choice is to be made which has no logical component by which to determine or make the choice ....
 and probability
Probability

Probability, or wikt:chance, is a way of expressing knowledge or belief that an Event will occur or has occurred. In mathematics the concept has been given an exact meaning in probability theory, that is used extensively in such areas of study as mathematics, statistics, finance, gambling, science, and philosophy to draw conclusions about t...
. They were developed to assist in the analysis of games of chance
Game of chance

A game of chance is a game whose outcome is strongly influenced by some randomness device, and upon which contestants frequently wager money. Common devices used include dice, spinning tops, playing cards, roulette wheels or numbered balls drawn from a container....
, stochastic
Stochastic

Stochastic means random.A stochastic process is one whose behavior is non-Deterministic system in that a system's subsequent state is determined both by the process's predictable actions and by a random element....
 events, and the results of scientific experiments
Experiment

In scientific inquiry, an experiment is a method of investigating causal relationships among variables. An experiment is a cornerstone of the empiricism approach to acquiring data about the world and is used in both natural sciences and social sciences....
 by capturing only the mathematical properties necessary to answer probabilistic
Probability

Probability, or wikt:chance, is a way of expressing knowledge or belief that an Event will occur or has occurred. In mathematics the concept has been given an exact meaning in probability theory, that is used extensively in such areas of study as mathematics, statistics, finance, gambling, science, and philosophy to draw conclusions about t...
 questions. Further formalizations have firmly grounded the entity in the theoretical domains of mathematics by making use of measure theory.

Fortunately, the language and structure of random variables can be grasped at various levels of mathematical fluency. Set theory
Set theory

Set theory is the branch of mathematics that studies Set , which are collections of objects. Although any type of object can be collected into a set, set theory is applied most often to objects that are relevant to mathematics....
 and calculus
Calculus

Calculus is a branch of mathematics that includes the study of limit , derivatives, integrals, and infinite series, and constitutes a major part of modern university education....
 are fundamental.

Broadly, there are two types of random variables — discrete and continuous. Discrete
Discrete probability distribution

Discrete probability distributions arise in the mathematical description of probability theory and statistical analysis in which the values that might be observed are restricted to being within a pre-defined list of possible values....
 random variables take on one of a set of specific values, each with some probability greater than zero. Continuous
Continuous probability distribution

In probability theory, a probability distribution is called continuous if its cumulative distribution function is continuous function. This is equivalent to saying that for random variables X with the distribution in question, Pr[X = a] = 0 for all real numbers a, i.e.: the probability that X attains the value a is zer...
 random variables can be realized with any of a range of values (e.g., a real number between zero and one), and so there are several ranges (e.g. 0 to one half) that have a probability greater than zero of occurring.

A random variable has either an associated probability distribution (discrete random variable) or probability density function (continuous random variable).

Intuitive definition

Intuitively, a random variable is thought of as a function mapping the sample space
Sample space

In probability theory, the sample space or universal sample space, often denoted S, O, or U , of an experiment or random trial and error is the set of all possible outcomes....
 of a random process to the real numbers. A few examples will highlight this.

Examples

For a coin toss, the possible events are heads or tails. The number of heads appearing in one fair coin toss can be described using the following random variable:

with probability mass function
Probability mass function

In probability theory, a probability mass function is a function that gives the probability that a discrete random variable random variable is exactly equal to some value....
 given by:

A random variable can also be used to describe the process of rolling a fair die
Dice

A die is a small polyhedron object, usually cubic, used for generating Statistical randomnesss or other symbols. This makes dice suitable as gambling devices, especially for craps or sic bo, or for use in non-gambling tabletop games....
 and the possible outcomes. The most obvious representation is to take the set as the sample space, defining the random variable X as the number rolled. In this case ,




Formal definition


Let be a probability space
Probability space

A probability space, in probability theory, is the conventional mathematical model of randomness. This mathematical object, sometimes called also probability triple, formalizes three interrelated ideas by three mathematical notions....
 and be a measurable space. Then a random variable X is formally defined as a measurable function
Measurable function

In mathematics, measurable functions are well-behaved function s between sigma-algebra. Functions studied in mathematical analysis that are not measurable are generally considered Pathological ....
 . An interpretation of this is that the preimage of the "well-behaved" subsets of Y (the elements of S) are events (elements of ), and hence are assigned a probability by P.

Real-valued random variables


Typically, the measurable space is the measurable space over the real numbers. In this case, let be a probability space. Then, the function is a real-valued random variable if

Distribution functions of random variables

Associating a cumulative distribution function (CDF) with a random variable is a generalization of assigning a value to a variable. If the CDF is a (right continuous) Heaviside step function
Heaviside step function

The Heaviside step function, H, also called the unit step function, is a continuous function Function whose value is 0 for negative argument and 1 for positive argument....
 then the variable takes on the value at the jump with probability 1. In general, the CDF specifies the probability that the variable takes on particular values.

If a random variable defined on the probability space is given, we can ask questions like "How likely is it that the value of is bigger than 2?". This is the same as the probability of the event which is often written as for short.

Recording all these probabilities of output ranges of a real-valued random variable X yields the probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
 of X. The probability distribution "forgets" about the particular probability space used to define X and only records the probabilities of various values of X. Such a probability distribution can always be captured by its cumulative distribution function
Cumulative distribution function

In probability theory and statistics, the cumulative distribution function or just distribution function, completely describes the probability distribution of a real-valued random variable X....


and sometimes also using a probability density function
Probability density function

In mathematics, a probability density function is a function that represents a probability distribution in terms of integrals.Formally, a probability distribution has density ƒ, if ƒ is a non-negative Lebesgue integration function such that the probability of the interval [ab] is given by...
. In measure-theoretic terms, we use the random variable X to "push-forward" the measure P on O to a measure dF on R. The underlying probability space O is a technical device used to guarantee the existence of random variables, and sometimes to construct them. In practice, one often disposes of the space O altogether and just puts a measure on R that assigns measure 1 to the whole real line, i.e., one works with probability distributions instead of random variables.

Moments


The probability distribution of a random variable is often characterised by a small number of parameters, which also have a practical interpretation. For example, it is often enough to know what its "average value" is. This is captured by the mathematical concept of expected value
Expected value

In probability theory and statistics, the expected value of a random variable is the Lebesgue integral of the random variable with respect to its probability measure....
 of a random variable, denoted E[X]. In general, E[f(X)] is not equal to f(E[X]). Once the "average value" is known, one could then ask how far from this average value the values of X typically are, a question that is answered by the variance
Variance

In probability theory and statistics, the variance of a random variable, probability distribution, or sample is one measure of statistical dispersion, averaging the squared distance of its possible values from the expected value ....
 and standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
 of a random variable.

Mathematically, this is known as the (generalised) problem of moments: for a given class of random variables X, find a collection of functions such that the expectation values E[fi(X)] fully characterize the distribution of the random variable X.

Functions of random variables


If we have a random variable X on O and a Borel measurable function
Measurable function

In mathematics, measurable functions are well-behaved function s between sigma-algebra. Functions studied in mathematical analysis that are not measurable are generally considered Pathological ....
 f: R ? R, then Y = f(X) will also be a random variable on O, since the composition of measurable functions is also measurable. (Warning: this is not true if f is Lebesgue measurable.) The same procedure that allowed one to go from a probability space (O, P) to (R, dFX) can be used to obtain the distribution of Y. The cumulative distribution function
Cumulative distribution function

In probability theory and statistics, the cumulative distribution function or just distribution function, completely describes the probability distribution of a real-valued random variable X....
 of Y is

Example 1


Let X be a real-valued, continuous random variable and let Y = X2.

If y < 0, then P(X2 = y) = 0, so

If y = 0, then

so

Example 2


Suppose is a random variable with a cumulative distribution

where is a fixed parameter. Consider the random variable Then,

The last expression can be calculated in terms of the cumulative distribution of so


Equivalence of random variables


There are several different senses in which random variables can be considered to be equivalent. Two random variables can be equal, equal almost surely, equal in mean, or equal in distribution.

In increasing order of strength, the precise definition of these notions of equivalence is given below.

Equality in distribution


Two random variables X and Y are equal in distribution if they have the same distribution functions:

Two random variables having equal moment generating functions have the same distribution. This provides, for example, a useful method of checking equality of certain functions of i.i.d. random variables.

which is the basis of the Kolmogorov-Smirnov test
Kolmogorov-Smirnov test

In statistics, the Andrey Kolmogorov–Vladimir Ivanovich Smirnov test is a form of minimum distance estimation used as a nonparametric statistics of equality of one-dimensional probability distributions used to compare a random sample with a reference probability distribution , or to compare two samples ....
.

Equality in mean


Two random variables X and Y are equal in p-th mean if the pth moment of |XY| is zero, that is,

As in the previous case, there is a related distance between the random variables, namely

This is equivalent to the following:

Almost sure equality


Two random variables X and Y are equal almost surely if, and only if, the probability that they are different is zero:

For all practical purposes in probability theory, this notion of equivalence is as strong as actual equality. It is associated to the following distance:

where 'sup' in this case represents the essential supremum in the sense of measure theory.

Equality

Finally, the two random variables X and Y are equal if they are equal as functions on their probability space, that is,

Convergence


Much of mathematical statistics consists in proving convergence results for certain sequence
Sequence

In mathematics, a sequence is an ordered list of objects . Like a Set , it contains Element , and the number of terms is called the length of the sequence....
s of random variables; see for instance the law of large numbers
Law of large numbers

The law of large numbers is a theorem in probability that describes the long-term stability of the arithmetic mean of a random variable. Given a random variable with a finite expected value, if its values are repeatedly sampled, as the number of these observations increases, their mean will tend to approach and stay close to the expected va...
 and the central limit theorem
Central limit theorem

The central limit theorem states that the re-averaged sum of a sufficiently large number of Independent and identically-distributed random variables Statistical independence random variables each with finite mean and variance will be approximately normal distribution ....
.

There are various senses in which a sequence (Xn) of random variables can converge to a random variable X. These are explained in the article on convergence of random variables
Convergence of random variables

In probability theory, there exist several different notions of convergence of random variables. The convergence of sequences of random variables to some Limit ing random variable is an important concept in probability theory, and its applications to statistics and stochastic processes....
.

Literature

  • Kallenberg, O.
    Olav Kallenberg

    Olav Kallenberg is a physicist and mathematician living in Auburn, AL, USA. He is known for books, numerous research papers and is an internationally recognized scientist in the area of probability theory....
    , Random Measures, 4th edition. Academic Press, New York, London; Akademie-Verlag, Berlin (1986). MR0854102 ISBN 0123949602
  • Papoulis, Athanasios
    Athanasios Papoulis

    Athanasios Papoulis was a Greek American-United States engineer and applied mathematician....
     1965 Probability, Random Variables, and Stochastic Processes. McGraw-Hill Kogakusha, Tokyo, 9th edition, ISBN 0-07-119981-0.


See also