All Topics  
Pearson distribution

 

   Email Print
   Bookmark   Link






 

Pearson distribution



 
 
The Pearson distribution is a family of continuous
Continuous probability distribution

In probability theory, a probability distribution is called continuous if its cumulative distribution function is continuous function. This is equivalent to saying that for random variables X with the distribution in question, Pr[X = a] = 0 for all real numbers a, i.e.: the probability that X attains the value a is zer...
 probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
s. It was first published by Karl Pearson
Karl Pearson

Karl Pearson Fellow of the Royal Society established the disciplineof mathematical statistics.In 1911 he founded the world's first university statistics department at University College London....
 in 1895 and subsequently extended by him in 1901 and 1916 in a series of articles on biostatistics
Biostatistics

Biostatistics is the application of statistics to a wide range of topics in biology. The science of biostatistics encompasses the design of biological experiments, especially in medicine and agriculture; the collection, summarization, and analysis of data from those experiments; and the interpretation of, and inference from, the results....
.

History
The Pearson system was originally devised in an effort to model visibly skew
Skewness

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number-valued random variable....
ed observations. It was well known at the time how to adjust a theoretical model to fit the first two cumulant
Cumulant

In probability theory and statistics, if a random variable X admits an expected value ? = E and a variance s2 = E, then these are the first two cumulants: ? = ?1 and s2 = ?2....
s or moment
Moment (mathematics)

The concept of moment in mathematics evolved from the concept of moment in physics. The nth moment of a real-valued function f of a real variable about a value c is...
s of observed data: Any probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
 can be extended straightforwardly to form a location-scale family
Location-scale family

In probability theory, especially as that field is used in statistics, a location-scale family is a family of univariate probability distributions parametrized by a location parameter μ and a scale parameter σ ≥ 0; if X is any random variable whose probability distribution belongs to such a family, then Y =&...
.






Discussion
Ask a question about 'Pearson distribution'
Start a new discussion about 'Pearson distribution'
Answer questions from other users
Full Discussion Forum



Encyclopedia


The Pearson distribution is a family of continuous
Continuous probability distribution

In probability theory, a probability distribution is called continuous if its cumulative distribution function is continuous function. This is equivalent to saying that for random variables X with the distribution in question, Pr[X = a] = 0 for all real numbers a, i.e.: the probability that X attains the value a is zer...
 probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
s. It was first published by Karl Pearson
Karl Pearson

Karl Pearson Fellow of the Royal Society established the disciplineof mathematical statistics.In 1911 he founded the world's first university statistics department at University College London....
 in 1895 and subsequently extended by him in 1901 and 1916 in a series of articles on biostatistics
Biostatistics

Biostatistics is the application of statistics to a wide range of topics in biology. The science of biostatistics encompasses the design of biological experiments, especially in medicine and agriculture; the collection, summarization, and analysis of data from those experiments; and the interpretation of, and inference from, the results....
.

History


The Pearson system was originally devised in an effort to model visibly skew
Skewness

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number-valued random variable....
ed observations. It was well known at the time how to adjust a theoretical model to fit the first two cumulant
Cumulant

In probability theory and statistics, if a random variable X admits an expected value ? = E and a variance s2 = E, then these are the first two cumulants: ? = ?1 and s2 = ?2....
s or moment
Moment (mathematics)

The concept of moment in mathematics evolved from the concept of moment in physics. The nth moment of a real-valued function f of a real variable about a value c is...
s of observed data: Any probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
 can be extended straightforwardly to form a location-scale family
Location-scale family

In probability theory, especially as that field is used in statistics, a location-scale family is a family of univariate probability distributions parametrized by a location parameter μ and a scale parameter σ ≥ 0; if X is any random variable whose probability distribution belongs to such a family, then Y =&...
. Except in pathological
Pathological (mathematics)

In mathematics, a pathological phenomenon is one whose properties are considered atypically bad or counterintuitive.Often, when the usefulness of a theorem is challenged by counterexamples, defenders of the theorem argue that the exceptions are pathological....
 cases, a location-scale family can be made to fit the observed mean (first cumulant) and variance
Variance

In probability theory and statistics, the variance of a random variable, probability distribution, or sample is one measure of statistical dispersion, averaging the squared distance of its possible values from the expected value ....
 (second cumulant) arbitrarily well. However, it was not known how to construct probability distributions in which the skewness
Skewness

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number-valued random variable....
 (standardized third cumulant) and kurtosis
Kurtosis

In probability theory and statistics, kurtosis is a measure of the "peakedness" of the probability distribution of a real number-valued random variable....
 (standardized fourth cumulant) could be adjusted equally freely. This need became apparent when trying to fit known theoretical models to observed data that exhibited skewness. Pearson's examples include survival data, which are usually asymmetric.

In his original paper, Pearson (1895, p. 360) identified four types of distributions (numbered I through IV) in addition to the normal distribution
Normal distribution

The normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields....
 (which was originally known as type V). The classification depended on whether the distributions were support
Support (mathematics)

In mathematics, the support of a function is the set of points where the function is not zero, or the closure of that set. This concept is used very widely in mathematical analysis....
ed on a bounded interval, on a half-line, or on the whole real line
Real line

In mathematics, the real line is simply the set R of singleton real numbers.However, this term is usually used when R is to be treated as a space of some sort, such as a topological space or a vector space....
; and whether they were potentially skewed or necessarily symmetric. A second paper (Pearson 1901) fixed two omissions: it redefined the type V distribution (originally just the normal distribution
Normal distribution

The normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields....
, but now the inverse-gamma distribution
Inverse-gamma distribution

In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the multiplicative inverse of a variable distributed according to the gamma distribution....
) and introduced the type VI distribution. Together the first two papers cover the five main types of the Pearson system (I, III, VI, V, and IV). In a third paper, Pearson (1916) introduced further special cases and subtypes (VII through XII).

Rhind (1909, pp. 430–432) devised a simple way of visualizing the parameter space of the Pearson system, which was subsequently adopted by Pearson (1916, plate 1 and pp. 430ff., 448ff.). The Pearson types are characterized by two quantities, commonly referred to as and . The first is the square of the skewness
Skewness

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number-valued random variable....
: where is the skewness, or third standardized moment
Standardized moment

In probability theory and statistics, the kthstandardized moment of a probability distribution is where is the kth moment about the mean and σ is the standard deviation....
. The second is the traditional kurtosis
Kurtosis

In probability theory and statistics, kurtosis is a measure of the "peakedness" of the probability distribution of a real number-valued random variable....
, or fourth standardized moment: . (Modern treatments define kurtosis in terms of cumulants instead of moments, so that for a normal distribution we have and . Here we follow the historical precedent and use .) The diagram on the right shows which Pearson type a given concrete distribution (identified by a point ) belongs to.

Many of the skewed and/or non-mesokurtic distributions familiar to us today were still unknown in the early 1890s. What is now known as the beta distribution
Beta distribution

In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] parameterized by two positive shape parameters, typically denoted by α and β....
 had been used by Thomas Bayes
Thomas Bayes

Thomas Bayes was a Kingdom of Great Britain mathematician and Presbyterian minister, known for having formulated a specific case of the theorem that bears his name: Bayes' theorem, which was published posthumously....
 as a posterior distribution of the parameter of a Bernoulli distribution
Bernoulli distribution

In probability theory and statistics, the Bernoulli distribution, named after Swiss scientist Jacob Bernoulli, is a discrete probability distribution probability distribution, which takes value 1 with success probability and value 0 with failure probability ....
 in his 1763 work on inverse probability
Inverse probability

In probability theory, inverse probability is an obsolete term for the probability distribution of an unobserved variable.Today, the problem of determining an unobserved variable is called inferential statistics, the method of inverse probability is called Bayesian probability, the "distribution" of an unobserved variable given data is ra...
. The Beta distribution gained prominence due to its membership in Pearson's system and was known until the 1940s as the Pearson type I distribution.

(Pearson's type II distribution is a special case of type I, but is usually no longer singled out.) The gamma distribution
Gamma distribution

In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. It has a scale parameter θ and a shape parameter k....
 originated from Pearson's work (Pearson 1893, p. 331; Pearson 1895, pp. 357, 360, 373–376) and was known as the Pearson type III distribution, before acquiring its modern name in the 1930s and 1940s.

Pearson's 1895 paper introduced the type IV distribution, which contains Student's t-distribution
Student's t-distribution

In probability and statistics, Student's t-distribution is a probability distribution that arises in the problem of estimating the expected value of a normal distribution Statistical population when the sample size is small....
 as a special case, predating William Gosset's subsequent use by several years. His 1901 paper introduced the inverse-gamma distribution
Inverse-gamma distribution

In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the multiplicative inverse of a variable distributed according to the gamma distribution....
 (type V) and the beta prime distribution
Beta prime distribution

A Beta Prime Distribution is a probability distribution defined for x>0 with two parameters , a and ?, having the probability density function:...
 (type VI).

Definition


A Pearson density
Probability density function

In mathematics, a probability density function is a function that represents a probability distribution in terms of integrals.Formally, a probability distribution has density ƒ, if ƒ is a non-negative Lebesgue integration function such that the probability of the interval [ab] is given by...
 p is defined to be any valid solution to the differential equation
Differential equation

A differential equation is a mathematics equation for an unknown function of one or several variable that relates the values of the function itself and its derivatives of various orders....
 (cf. Pearson 1895, p. 381)

with :

According to Ord, Pearson devised the underlying form of Equation (1) on the basis of, firstly, the formula for the derivative of the logarithm of the density function of the normal distribution
Normal distribution

The normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields....
 (which gives a linear function) and, secondly, from a recurrence relation for values in the probability mass function
Probability mass function

In probability theory, a probability mass function is a function that gives the probability that a discrete random variable random variable is exactly equal to some value....
 of the hypergeometric distribution
Hypergeometric distribution

In probability theory and statistics, the hypergeometric distribution is a discrete probability distribution that describes the number of successes in a sequence of n draws from a finite population without replacement, just as the binomial distribution describes the number of successes for draws with replacement....
 (which yields the linear-divided-by-quadratic structure).

In Equation (1), the parameter a0 determines a stationary point
Stationary point

In mathematics, particularly in calculus, a stationary point is an input to a function where the derivative is zero : where the function "stops" increasing or decreasing ....
, and hence under some conditions a mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
 of the distribution, since

follows directly from the differential equation.

Since we are confronted with a linear differential equation with variable coefficients
Linear differential equation

In mathematics, a linear differential equation is a differential equation of the formwhere the differential operator L is a linear operator, y is the unknown function, and the right hand side ƒ is a given function ....
, its solution is straightforward:

The integral in this solution simplifies considerably when certain special cases of the integrand are considered. Pearson (1895, p. 367) distinguished two main cases, determined by the sign of the discriminant
Discriminant

In algebra, the discriminant of a polynomial with real number or complex number coefficients is a certain expression in the coefficients of the polynomial which is equal to zero if and only if the polynomial has a multiple Root in the complex numbers....
 (and hence the number of real root
Root (mathematics)

In mathematics, a root of a complex-valued Function is a member of the Domain of such that vanishes at , that is,In other words, a "root" of a function is a value for that produces a result of zero ....
s) of the quadratic function
Quadratic function

A quadratic function, in mathematics, is a polynomial function of the form , where . The graph of a function of a quadratic function is a parabola whose major axis is parallel to the y-axis....


Particular types of distribution


Case 1, negative discriminant: The Pearson type IV distribution


If the discriminant of the quadratic function (2) is negative , it has no real roots. Then define

  and

Observe that is a well-defined real number and , because by assumption and therefore . Applying these substitutions, the quadratic function (2) is transformed into

The absence of real roots is obvious from this formulation, because is necessarily positive.

We now express the solution to the differential equation (1) as a function of y:

Pearson (1895, p. 362) called this the "trigonometrical case", because the integral

involves the inverse
Inverse trigonometric function

In mathematics, the inverse trigonometric functions or cyclometric functions are the inverse functions of the trigonometric functions. The principal inverses are listed in the following table....
 trigonometic
Trigonometric function

In mathematics, the trigonometric functions are function s of an angle. They are important in the trigonometry of Triangle and modeling Periodic function, among many other applications....
 arctan function. Then

Finally, let

  and

Applying these substitutions, we obtain the parametric function:

This unnormalized density has support
Support (mathematics)

In mathematics, the support of a function is the set of points where the function is not zero, or the closure of that set. This concept is used very widely in mathematical analysis....
 on the entire real line
Real line

In mathematics, the real line is simply the set R of singleton real numbers.However, this term is usually used when R is to be treated as a space of some sort, such as a topological space or a vector space....
. It depends on a scale parameter
Scale parameter

In probability theory and statistics, a scale parameter is a special kind of numerical parameter of a parametric family of probability distributions....
  and shape parameter
Shape parameter

In probability theory and statistics, a shape parameter is a kind of numerical parameter of a parametric family of probability distributions....
s and . One parameter was lost when we chose to find the solution to the differential equation (1) as a function of y rather than x. We therefore reintroduce a fourth parameter, namely the location parameter
Location parameter

In statistics, a location family is a class of probability distributions parametrized by a scalar- or vector-valued parameter ?, which determines the "location" or shift of the distribution....
 ?. We have thus derived the density of the Pearson type IV distribution:

The normalizing constant
Normalizing constant

The concept of a normalizing constant arises in probability theory and a variety of other areas of mathematics....
 involves the complex Gamma function
Gamma function

In mathematics, the Gamma function is an extension of the factorial function to real number and complex number numbers. For a complex number z with positive real part the Gamma function is defined by...
 (G) and the Beta function
Beta function

In mathematics, the beta function, also called the Euler integral of the first kind, is a special function defined byfor The beta function was studied by Leonhard Euler and Adrien-Marie Legendre and was given its name by Jacques Philippe Marie Binet....
 (B).

The Pearson type VII distribution

The shape parameter ? of the Pearson type IV distribution controls its skewness
Skewness

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number-valued random variable....
. If we fix its value at zero, we obtain a symmetric three-parameter family. This special case is known as the Pearson type VII distribution (cf. Pearson 1916, p. 450). Its density is

where B is the Beta function
Beta function

In mathematics, the beta function, also called the Euler integral of the first kind, is a special function defined byfor The beta function was studied by Leonhard Euler and Adrien-Marie Legendre and was given its name by Jacques Philippe Marie Binet....
.

An alternative parameterization (and slight specialization) of the type VII distribution is obtained by letting

which requires . This entails a minor loss of generality but ensures that the variance
Variance

In probability theory and statistics, the variance of a random variable, probability distribution, or sample is one measure of statistical dispersion, averaging the squared distance of its possible values from the expected value ....
 of the distribution exists and is equal to . Now the parameter m only controls the kurtosis
Kurtosis

In probability theory and statistics, kurtosis is a measure of the "peakedness" of the probability distribution of a real number-valued random variable....
 of the distribution. If m approaches infinity as ? and s are held constant, the normal distribution
Normal distribution

The normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields....
 arises as a special case:


This is the density of a normal distribution with mean ? and standard deviation s.

It is convenient to require that and to let

This is another specialization, and it guarantees that the first four moments of the distribution exist. More specifically, the Pearson type VII distribution parameterized in terms of has a mean of ?, standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
 of s, skewness
Skewness

In probability theory and statistics, skewness is a measure of the asymmetry of the probability distribution of a real number-valued random variable....
 of zero, and excess kurtosis of .

Student's t-distribution

The Pearson type VII distribution subsumes Student's t-distribution
Student's t-distribution

In probability and statistics, Student's t-distribution is a probability distribution that arises in the problem of estimating the expected value of a normal distribution Statistical population when the sample size is small....
, and hence also the Cauchy distribution
Cauchy distribution

The Cauchy?Lorentz distribution, named after Augustin Cauchy and Hendrik Lorentz,  is a continuous probability distribution. As a probability distribution, it is known as the Cauchy distribution, while among physicists, it is known as a Lorentz distribution, or a Lorentz function or the Breit?Wigner dis...
. Student's t-distribution arises as the result of applying the following substitutions to its original parameterization:

  and

where . Observe that the constraint is satisfied. The density of this restricted one-parameter family is

which is easily recognized as the density of Student's t-distribution.

Case 2, non-negative discriminant


If the quadratic function (2) has a non-negative discriminant , it has real roots a1 and a2 (not necessarily distinct):

One have to define :

In the presence of real roots the quadratic function (2) can be written as

and the solution to the differential equation is therefore

Pearson (1895, p. 362) called this the "logarithmic case", because the integral

involves only the logarithm
Logarithm

In mathematics, the logarithm of a number to a given base is the Power or exponent to which the base must be raised in order to produce the number....
 function, and not the arctan function as in the previous case.

Using the substitution

we obtain the following solution to the differential equation (1):

Since this density is only known up to a hidden constant of proportionality, that constant can be changed and the density written as follows:

The Pearson type I and type II distribution

The Pearson type I distribution (a generalization of the beta distribution
Beta distribution

In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] parameterized by two positive shape parameters, typically denoted by α and β....
) arises when the roots of the quadratic equation (2) are of opposite sign, that is, . Then the solution p is supported on the interval . Apply the substition

which yields a solution in terms of y that is supported on the interval :

Regrouping constants and parameters, this simplifies to:

Thus follows a with

It turns out that is necessary and sufficient for p to be a proper probability density function.

The Pearson type II distribution

The Pearson type II distribution is a special case of the Pearson type I family restricted to symmetric distributions.

For the Pearson Type II Curve

,

where

the ordinate, y, is the frequency of . The Pearson Type II Curve is used in computing the table of significant correlation coefficients for Spearman's rank correlation coefficient
Spearman's rank correlation coefficient

In statistics, Spearman's rank correlation coefficient or Spearman's rho, named after Charles Spearman and often denoted by the Greek letter rho or as , is a non-parametric statistics measure of correlation – that is, it assesses how well an arbitrary monotonic function could describe the relationship between two variables, witho...
 when the number of items in a series is less than 100 (or 30, depending on some sources). After that, the distribution mimics a standard Student's t-distribution
Student's t-distribution

In probability and statistics, Student's t-distribution is a probability distribution that arises in the problem of estimating the expected value of a normal distribution Statistical population when the sample size is small....
. For the table of values, certain values are used as the constants in the previous equation:

The moments of x used are
The Pearson type III distribution
follows a Pearson type III distribution gamma distribution
Gamma distribution

In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. It has a scale parameter θ and a shape parameter k....
, chi-square distribution
Chi-square distribution

In probability theory and statistics, the chi-square distribution is one of the most widely used theoretical probability distributions in inferential statistics, e.g., in statistical significance tests....


The Pearson type V distribution
follows a Pearson type V distribution inverse-gamma distribution
Inverse-gamma distribution

In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the multiplicative inverse of a variable distributed according to the gamma distribution....


The Pearson type VI distribution
follows a Pearson type VI distribution beta prime distribution
Beta prime distribution

A Beta Prime Distribution is a probability distribution defined for x>0 with two parameters , a and ?, having the probability density function:...
, F-distribution
F-distribution

In probability theory and statistics, the F-distribution is a continuous probability distribution probability distribution. It is also known as Snedecor's F distribution or the Fisher-Snedecor distribution ....


Relation to other distributions


The Pearson family subsumes the following distributions, among others:

  • beta distribution
    Beta distribution

    In probability theory and statistics, the beta distribution is a family of continuous probability distributions defined on the interval [0, 1] parameterized by two positive shape parameters, typically denoted by α and β....
     (type I)
  • beta prime distribution
    Beta prime distribution

    A Beta Prime Distribution is a probability distribution defined for x>0 with two parameters , a and ?, having the probability density function:...
     (type VI)
  • Cauchy distribution
    Cauchy distribution

    The Cauchy?Lorentz distribution, named after Augustin Cauchy and Hendrik Lorentz,  is a continuous probability distribution. As a probability distribution, it is known as the Cauchy distribution, while among physicists, it is known as a Lorentz distribution, or a Lorentz function or the Breit?Wigner dis...
     (type IV)
  • chi-square distribution
    Chi-square distribution

    In probability theory and statistics, the chi-square distribution is one of the most widely used theoretical probability distributions in inferential statistics, e.g., in statistical significance tests....
     (type III)
  • continuous uniform distribution
    Uniform distribution (continuous)

    In probability theory and statistics, the continuous uniform distribution is a family of probability distributions such that for each member of the family, all interval s of the same length on the distribution's support are equally probable....
     (limit of type I)
  • exponential distribution
    Exponential distribution

    In probability theory and statistics, the exponential distributions are a class of continuous probability distributions. They describe the times between events in a Poisson process, i.e....
     (type III)
  • gamma distribution
    Gamma distribution

    In probability theory and statistics, the gamma distribution is a two-parameter family of continuous probability distributions. It has a scale parameter θ and a shape parameter k....
     (type III)
  • F-distribution
    F-distribution

    In probability theory and statistics, the F-distribution is a continuous probability distribution probability distribution. It is also known as Snedecor's F distribution or the Fisher-Snedecor distribution ....
     (type VI)
  • inverse-chi-square distribution
    Inverse-chi-square distribution

    In probability and statistics, the inverse-chi-square distribution is the probability distribution of a random variable whose multiplicative inverse has a chi-square distribution....
     (type V)
  • inverse-gamma distribution
    Inverse-gamma distribution

    In probability theory and statistics, the inverse gamma distribution is a two-parameter family of continuous probability distributions on the positive real line, which is the distribution of the multiplicative inverse of a variable distributed according to the gamma distribution....
     (type V)
  • normal distribution
    Normal distribution

    The normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields....
     (limit of type I, III, IV, V, or VI)
  • Student's t-distribution
    Student's t-distribution

    In probability and statistics, Student's t-distribution is a probability distribution that arises in the problem of estimating the expected value of a normal distribution Statistical population when the sample size is small....
     (type VII, which is the non-skewed subtype of type IV)


Applications


These models are used in financial markets, given their ability to be parametrised in a way that has intuitive meaning for market traders. A number of models are in current use that capture the stochastic nature of the volatility of rates, stocks etc. and this family of distributions may prove to be one of the more important.

In the United States, the Log-Pearson III is the default distribution for flood frequency analysis.

Sources


Primary sources


Secondary sources


  • Milton Abramowitz and Irene A. Stegun (1964). Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. National Bureau of Standards.


  • Eric W. Weisstein
    Eric W. Weisstein

    Eric W. Weisstein is an encyclopedist who created and maintains MathWorld and Eric Weisstein's World of Science . He currently works for Wolfram Research, Inc....
     et al. . From MathWorld
    MathWorld

    MathWorld is an online mathematics reference work, created and largely written by Eric W. Weisstein. It is sponsored by Wolfram Research Inc. and was partially funded by the National Science Foundation's National Science Digital Library grant to the University of Illinois at Urbana-Champaign....
    .