All Topics  
Convergence of random variables

 

   Email Print
   Bookmark   Link






 

Convergence of random variables



 
 
In probability theory
Probability theory

Probability theory is the branch of mathematics concerned with analysis of Statistical randomness phenomena. The central objects of probability theory are random variables, stochastic processes, and event s: mathematical abstractions of determinism events or measured quantities that may either be single occurrences or evolve over time in an a...
, there exist several different notions of convergence
Convergence

In the absence of a more specific context, convergence denotes the approach toward a definite value, as time goes on; or to a definite point, a common view or opinion, or toward a fixed or equilibrium point state....
 of random variable
Random variable

In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
s. The convergence (in one of the senses presented below) of sequence
Sequence

In mathematics, a sequence is an ordered list of objects . Like a Set , it contains Element , and the number of terms is called the length of the sequence....
s of random variables to some limit
Limit (mathematics)

In mathematics, the concept of a "limit" is used to describe the behavior of a Function as its argument or input either "gets close" to some point, or as the argument becomes arbitrarily large; or the behavior of a sequence's elements as their index increases indefinitely....
ing random variable is an important concept in probability theory, and its applications to statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
 and stochastic process
Stochastic process

A stochastic process, or sometimes random process, is the counterpart to a deterministic process in probability theory. Instead of dealing with only one possible 'reality' of how the process might evolve under time , in a stochastic or random process there is some indeterminacy in its future evolution described by probability distribu...
es. For example, if the average of n uncorrelated
Uncorrelated

In probability theory and statistics, two real-valued random variables are said to be uncorrelated if their covariance is zero.Uncorrelated random variables have a correlation of zero, except in the trivial case when both variables have variance zero ....
 random variables Yi, i = 1, ..., n, is given by

then as n goes to infinity, Xn converges in probability (see below) to the common mean
Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
, μ, of the random variables Yi.






Discussion
Ask a question about 'Convergence of random variables'
Start a new discussion about 'Convergence of random variables'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In probability theory
Probability theory

Probability theory is the branch of mathematics concerned with analysis of Statistical randomness phenomena. The central objects of probability theory are random variables, stochastic processes, and event s: mathematical abstractions of determinism events or measured quantities that may either be single occurrences or evolve over time in an a...
, there exist several different notions of convergence
Convergence

In the absence of a more specific context, convergence denotes the approach toward a definite value, as time goes on; or to a definite point, a common view or opinion, or toward a fixed or equilibrium point state....
 of random variable
Random variable

In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
s. The convergence (in one of the senses presented below) of sequence
Sequence

In mathematics, a sequence is an ordered list of objects . Like a Set , it contains Element , and the number of terms is called the length of the sequence....
s of random variables to some limit
Limit (mathematics)

In mathematics, the concept of a "limit" is used to describe the behavior of a Function as its argument or input either "gets close" to some point, or as the argument becomes arbitrarily large; or the behavior of a sequence's elements as their index increases indefinitely....
ing random variable is an important concept in probability theory, and its applications to statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
 and stochastic process
Stochastic process

A stochastic process, or sometimes random process, is the counterpart to a deterministic process in probability theory. Instead of dealing with only one possible 'reality' of how the process might evolve under time , in a stochastic or random process there is some indeterminacy in its future evolution described by probability distribu...
es. For example, if the average of n uncorrelated
Uncorrelated

In probability theory and statistics, two real-valued random variables are said to be uncorrelated if their covariance is zero.Uncorrelated random variables have a correlation of zero, except in the trivial case when both variables have variance zero ....
 random variables Yi, i = 1, ..., n, is given by

then as n goes to infinity, Xn converges in probability (see below) to the common mean
Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
, μ, of the random variables Yi. This result is known as the weak law of large numbers. Other forms of convergence are important in other useful theorems, including the central limit theorem
Central limit theorem

The central limit theorem states that the re-averaged sum of a sufficiently large number of Independent and identically-distributed random variables Statistical independence random variables each with finite mean and variance will be approximately normal distribution ....
.

Throughout the following, we assume that (Xn) is a sequence of random variables, and X is a random variable, and all of them are defined on the same probability space
Probability space

A probability space, in probability theory, is the conventional mathematical model of randomness. This mathematical object, sometimes called also probability triple, formalizes three interrelated ideas by three mathematical notions....
 (Ω, F, P).

Convergence in distribution


Suppose that F1, F2, ... is a sequence
Sequence

In mathematics, a sequence is an ordered list of objects . Like a Set , it contains Element , and the number of terms is called the length of the sequence....
 of cumulative distribution function
Cumulative distribution function

In probability theory and statistics, the cumulative distribution function or just distribution function, completely describes the probability distribution of a real-valued random variable X....
s corresponding to random variables X1, X2, ..., and that F is a distribution function corresponding to a random variable X. We say that the sequence Xn converges towards X in distribution, if



for every real number
Real number

In mathematics, the real numbers may be described informally in several different ways. The real numbers include both rational numbers, such as 42 and −23/129, and irrational numbers, such as pi and the square root of two; or, a real number can be given by an infinite decimal representation, such as 2.4871773339...., where the digits co...
 a at which F is continuous
Continuous function

In mathematics, a continuous function is a function for which, intuitively, small changes in the input result in small changes in the output. Otherwise, a function is said to be discontinuous....
. Since F(a) = Pr(X ≤ a), this means that the probability that the value of X is in a given range is very similar to the probability that the value of Xn is in that range, provided n is sufficiently large
Sufficiently large

In mathematics, the phrase sufficiently large is used in contexts such as: is true for sufficiently large which is actually shorthand for:This does not necessarily mean that any particular value for is known, but only that such an exists....
. Convergence in distribution is often denoted by adding the letter over an arrow indicating convergence:



Lower-case d is also used, although less frequently.

Convergence in distribution is the weakest form of convergence, and is sometimes called weak convergence (main article: weak convergence of measures). It does not, in general, imply any other mode of convergence. However, convergence in distribution is implied by all other modes of convergence mentioned in this article, and hence, it is the most common and often the most useful form of convergence of random variables. It is the notion of convergence used in the central limit theorem
Central limit theorem

The central limit theorem states that the re-averaged sum of a sufficiently large number of Independent and identically-distributed random variables Statistical independence random variables each with finite mean and variance will be approximately normal distribution ....
.

A useful result, which may be employed in conjunction with law of large numbers and the central limit theorem
Central limit theorem

The central limit theorem states that the re-averaged sum of a sufficiently large number of Independent and identically-distributed random variables Statistical independence random variables each with finite mean and variance will be approximately normal distribution ....
, is that if a function  g: RR  is continuous, then if  Xn  converges in distribution to  X, then so too does  g(Xn)  converge in distribution to  g(X). (This may be proved using Skorokhod's representation theorem
Skorokhod's representation theorem

In mathematics and statistics, Skorokhod's representation theorem is a result that shows that a Weak convergence of measures sequence of probability measures whose limit measure is sufficiently well-behaved can be represented as the distribution/law of a sequence of random variables defined on a common probability space....
.) This fact could be taken as a definition for the convergence in distribution.

The Lévy continuity theorem
Lévy continuity theorem

The L?vy continuity theorem in probability theory, named after the French mathematician Paul Pierre L?vy, is the basis for one approach to prove the central limit theorem and it is one of the central theorems concerning Characteristic function s....
 states that converges in distribution to some random variable with the characteristic function if is the pointwise convergent limit of and is continuous at .

Convergence in distribution is also called convergence in law, since the word "law" is sometimes used as a synonym of "probability distribution."

Convergence in probability


To say that the sequence Xn of random variables converges towards X in probability means

for every ε > 0. Formally, pick any ε > 0 and any δ > 0. Let Pn be the probability that Xn is outside a tolerance ε of X. Then, if Xn converges in probability to X then there exists a value N such that, for all nN, Pn is itself less than δ.

Convergence in probability is often denoted by adding the letter over an arrow indicating convergence:
It is also denoted



or simply



Convergence in probability is the notion of convergence used in the weak law of large numbers. Convergence in probability implies convergence in distribution. Lemma Let X, Y be random variables, c a real number and ε > 0; then

Proof of lemma

since

Proof

Recall that in order to prove convergence in distribution, one must show that the sequence of cumulative distribution functions converges to the FX at every point where FX is continuous. Let a be such a point.

For every , due to the preceding lemma, we have:

So, we have

Taking the limit as , we obtain:

But is the cumulative distribution function
Cumulative distribution function

In probability theory and statistics, the cumulative distribution function or just distribution function, completely describes the probability distribution of a real-valued random variable X....
 FX, evaluated at the point a, at which the function is continous by assumption. This means that

and so, taking the limit as , we obtain

In the special case when X = c is a constant (that is, a random variable that attains the deterministic value c with probability one), convergence of Xn to c in probability is equivalent to convergence to c in distribution.

Convergence in probability defines a topology
Topology

Topology is a major area of mathematics that has emerged through the development of concepts from geometry and set theory, such as those of space, dimension, shape, transformation and others....
 on the space of random variables over a fixed probability space. In fact, this topology is metrizable and a metric d that generates the topology is given by

Almost sure convergence


To say that the sequence Xn converges almost surely or almost everywhere or with probability 1 or strongly towards X means

This means that the values of Xn approach the value of X, in the sense (see almost surely
Almost surely

In probability theory, one says that an event happens almost surely if it happens with probability one. The concept is analogous to the concept of "almost everywhere" in measure theory....
) that events for which Xn does not converge to X have probability 0. Using the probability space and the concept of the random variable as a function from Ω to R, this is equivalent to the statement

Another, equivalent, way of defining almost sure convergence is as follows: Let . Then the almost sure convergence becomes


Almost sure convergence is often denoted by adding the letters a.s. over an arrow indicating convergence:


Almost sure convergence implies convergence in probability, and hence implies convergence in distribution. It is the notion of convergence used in the strong law of large numbers
Law of large numbers

The law of large numbers is a theorem in probability that describes the long-term stability of the arithmetic mean of a random variable. Given a random variable with a finite expected value, if its values are repeatedly sampled, as the number of these observations increases, their mean will tend to approach and stay close to the expected va...
.

Sure convergence


To say that the sequence or random variables (Xn) defined over the same probability space
Probability space

A probability space, in probability theory, is the conventional mathematical model of randomness. This mathematical object, sometimes called also probability triple, formalizes three interrelated ideas by three mathematical notions....
 (i.e., a random process) converges surely or everywhere or pointwise towards X means where is the sample space
Sample space

In probability theory, the sample space or universal sample space, often denoted S, O, or U , of an experiment or random trial and error is the set of all possible outcomes....
 of the underlying probability space
Probability space

A probability space, in probability theory, is the conventional mathematical model of randomness. This mathematical object, sometimes called also probability triple, formalizes three interrelated ideas by three mathematical notions....
 over which the random variables are defined.

This is the notion of pointwise convergence
Pointwise convergence

In mathematics, pointwise convergence is one of various senses in which a sequence of functions can converge to a particular function....
 of sequence functions extended to sequence of random variables. (Note that random variables themselves are functions).

Sure convergence of a random variable implies all the other kinds of convergence stated above, but there is no payoff in probability theory
Probability theory

Probability theory is the branch of mathematics concerned with analysis of Statistical randomness phenomena. The central objects of probability theory are random variables, stochastic processes, and event s: mathematical abstractions of determinism events or measured quantities that may either be single occurrences or evolve over time in an a...
 by using sure convergence compared to using almost sure convergence. The difference between the two only exists on sets with probability zero. This is why the concept of sure convergence of random variables is very rarely used.

Convergence in mean


We say that the sequence Xn converges in the r-th mean or in the Lr norm
Lp space

In mathematics, the Lp and lp spaces are spaces of p-integrable function, and corresponding sequence spaces....
 towards X, if r ≥ 1, E|Xn|r < ∞ for all n, and



where the operator E denotes the expected value
Expected value

In probability theory and statistics, the expected value of a random variable is the Lebesgue integral of the random variable with respect to its probability measure....
. Convergence in rth mean tells us that the expectation of the r-th power of the difference between Xn and X converges to zero.

This type of convergence is often denoted by adding the letter Lr over an arrow indicating convergence:



The most important cases of convergence in r-th mean are:
  • When Xn converges in r-th mean to X for r = 1, we say that Xn converges in mean to X.
  • When Xn converges in r-th mean to X for r = 2, we say that Xn converges in mean square to X. This is also sometimes referred to as convergence in mean, and is sometimes denoted


Convergence in the r-th mean, for r > 0, implies convergence in probability (by Markov's inequality
Markov's inequality

In probability theory, Markov's inequality gives an upper bound for the probability that a negative and non-negative numbers function of a random variable is greater than or equal to some positive constant....
), while if r > s ≥ 1, convergence in r-th mean implies convergence in s-th mean. Hence, convergence in mean square implies convergence in mean.

Implications


The chain of implications between the various notions of convergence are noted in their respective sections. They are, using the arrow notation


No other implications other than these hold in general, but a number of special cases do permit the converse implications:

  • If Xn converges in distribution to a constant c, then Xn converges in probability to c.


  • If Xn converges in probability to X, and if for all n and some b, then Xn converges in rth mean to X for all r ≥ 1. In other words, if Xn converges in probability to X and all random variables Xn are almost surely bounded above and below, then Xn converges to X also in any rth mean.


  • If for all ε > 0,




then we say that Xn converges almost completely, or fast in probability towards X. When Xn converges almost completely towards X then it also converges almost surely to X. In other words, if Xn converges in probability to X sufficiently quickly (i.e. the above sequence of tail probabilities is summable for all ε > 0), then Xn also converges almost surely to X. This is a direct implication from the Borel-Cantelli lemma
Borel-Cantelli lemma

In probability theory, the Borel?Cantelli lemma is a theorem about sequences of event . In a slightly more general form, it is also a result in measure theory....
.


  • If Sn is a sum of n real independent random variables:




then Sn converges almost surely if and only if Sn converges in probability.


  • Lévy's convergence theorem
    Lévy's convergence theorem

    In probability theory L?vy's convergence theorem states that for a sequence of random variables where* and* where Y is some random variable with...
     gives sufficient conditions for almost sure convergence to imply L1-convergence:


See also


  • Continuous stochastic process
    Continuous stochastic process

    In the theory of stochastic processes, there are many senses in which a process may be said to be "continuous function" as a function of its "time" parameter....
    : the question of continuity of a stochastic process
    Stochastic process

    A stochastic process, or sometimes random process, is the counterpart to a deterministic process in probability theory. Instead of dealing with only one possible 'reality' of how the process might evolve under time , in a stochastic or random process there is some indeterminacy in its future evolution described by probability distribu...
     is essentially a question of convergence, and many of the same concepts and relationships used above apply to the continuity question.
  • Slutsky's theorem
    Slutsky's theorem

    In mathematics, in particular probability theory, Slutsky's theorem, named after Eugen Slutsky, extends some properties of algebraic operations on Limit of a sequence of real numbers to sequences of random variables....
  • Asymptotic distribution
    Asymptotic distribution

    In mathematics and statistics, an asymptotic distribution is a hypothetical distribution that is in a sense the "limiting" distribution of a sequence of distributions....
  • The has some examples and counter-examples.


External links

  • Convergence Wikibook