All Topics  
Estimation theory

 

   Email Print
   Bookmark   Link






 

Estimation theory



 
 
Estimation theory is a branch of statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
 and signal processing
Signal processing

Signal processing is the analysis, interpretation, and manipulation of signal . Signals of interest include: audio signal processing, , time-varying measurement values and sensor data, for example biological data such as electrocardiograms, control system signals, telecommunication transmission signals such as radio signals, and many others....
 that deals with estimating the values of parameters based on measured/empirical data. The parameters describe an underlying physical setting in such a way that the value of the parameters affects the distribution of the measured data. An estimator
Estimator

In statistics, an estimator is a function of the observable sample data that is used to estimate an unknown population parameter ; an estimate is the result from the actual application of the function to a particular Sampling_ of data....
 attempts to approximate the unknown parameters using the measurements.

For example, it is desired to estimate the proportion of a population of voters who will vote for a particular candidate.






Discussion
Ask a question about 'Estimation theory'
Start a new discussion about 'Estimation theory'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Estimation theory is a branch of statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
 and signal processing
Signal processing

Signal processing is the analysis, interpretation, and manipulation of signal . Signals of interest include: audio signal processing, , time-varying measurement values and sensor data, for example biological data such as electrocardiograms, control system signals, telecommunication transmission signals such as radio signals, and many others....
 that deals with estimating the values of parameters based on measured/empirical data. The parameters describe an underlying physical setting in such a way that the value of the parameters affects the distribution of the measured data. An estimator
Estimator

In statistics, an estimator is a function of the observable sample data that is used to estimate an unknown population parameter ; an estimate is the result from the actual application of the function to a particular Sampling_ of data....
 attempts to approximate the unknown parameters using the measurements.

For example, it is desired to estimate the proportion of a population of voters who will vote for a particular candidate. That proportion is the unobservable parameter; the estimate is based on a small random sample of voters.

Or, for example, in radar
Radar

Radar is a system that uses electromagnetic radiation waves to identify the range, altitude, direction, or speed of both moving and fixed objects such as aircraft, ships, motor vehicles, weather formations, and terrain....
 the goal is to estimate the location of objects (airplanes, boats, etc.) by analyzing the received echo and a possible question to be posed is "where are the airplanes?" To answer where the airplanes are, it is necessary to estimate the distance the airplanes are at from the radar station, which can provide an absolute location if the absolute location of the radar station is known.

In estimation theory, it is assumed that the desired information is embedded in a noisy signal
Signal (electrical engineering)

In the fields of telecommunications, signal processing, and in electrical engineering more generally, a signal is any time-varying or spatial-varying quantity....
. Noise adds uncertainty, without which the problem would be deterministic
Determinism

Determinism is the philosophy proposition that every event, including human cognition and behavior, decision and action, is causality determined by an unbroken chain of prior occurrences. With numerous historical debates, many varieties and philosophical positions on the subject of determinism exist from traditions throughout...
 and estimation would not be needed.

Estimation process

The entire purpose of estimation theory is to arrive at an estimator, and preferably an implementable one that could actually be used. The estimator takes the measured data as input and produces an estimate of the parameters.

It is also preferable to derive an estimator that exhibits optimality
Optimization (mathematics)

In mathematics, the simplest case of optimization, or mathematical programming, refers to the study of problems in which one seeks to maxima and minima or maxima and minima a Function of a real variable by systematically choosing the values of Real number or integer variables from within an allowed set....
. An optimal estimator would indicate that all available information in the measured data has been extracted, for if there was unused information in the data then the estimator would not be optimal.

These are the general steps to arrive at an estimator:
  • In order to arrive at a desired estimator for estimating a single or multiple parameters, it is first necessary to determine a model for the system. This model should incorporate the process being modeled as well as points of uncertainty and noise. The model describes the physical scenario in which the parameters apply.
  • After deciding upon a model, it is helpful to find the limitations placed upon an estimator. This limitation, for example, can be found through the Cramér-Rao bound.
  • Next, an estimator needs to be developed or applied if an already known estimator is valid for the model. The estimator needs to be tested against the limitations to determine if it is an optimal estimator (if so, then no other estimator will perform better).
  • Finally, experiments or simulations can be run using the estimator to test its performance.


After arriving at an estimator, real data might show that the model used to derive the estimator is incorrect, which may require repeating these steps to find a new estimator. A non-implementable or infeasible estimator may need to be scrapped and the process started anew.

In summary, the estimator estimates the parameters of a physical model based on measured data.

Basics

To build a model, several statistical "ingredients" need to be known. These are needed to ensure the estimator has some mathematical tractability instead of being based on "good feel".

The first is a set of statistical samples taken from a random vector (RV) of size N. Put into a vector,



Secondly, we have the corresponding M parameters



which need to be established with their probability density function
Probability density function

In mathematics, a probability density function is a function that represents a probability distribution in terms of integrals.Formally, a probability distribution has density ƒ, if ƒ is a non-negative Lebesgue integration function such that the probability of the interval [ab] is given by...
 (pdf) or probability mass function
Probability mass function

In probability theory, a probability mass function is a function that gives the probability that a discrete random variable random variable is exactly equal to some value....
 (pmf)



It is also possible for the parameters themselves to have a probability distribution (e.g., Bayesian statistics). It is then necessary to define the epistemic probability



After the model is formed, the goal is to estimate the parameters, commonly denoted , where the "hat" indicates the estimate.

One common estimator is the minimum mean squared error (MMSE) estimator, which utilizes the error between the estimated parameters and the actual value of the parameters



as the basis for optimality. This error term is then squared and minimized for the MMSE estimator.

Estimators

Commonly-used estimators and estimation methods, and topics related to them:
  • Maximum likelihood
    Maximum likelihood

    Maximum likelihood estimation is a popular statistics method used for fitting a mathematical model to data. The modeling of real world data using estimation by maximum likelihood offers a way of tuning the free parameters of the model to provide a good fit....
     estimators
  • Bayes estimator
    Bayes estimator

    In decision theory and estimation theory, a Bayes estimator is an estimator or decision rule that maximizes the posterior probability expected value of a utility function or minimizes the posterior expected value of a loss function ....
    s
  • Method of moments
    Method of moments (statistics)

    In statistics, the method of moments is a method of estimation of population parameters such as mean, variance, median, etc. , by equating sample moment with unobservable population moments and then solving those equations for the quantities to be estimated....
     estimators
  • Cramér-Rao bound
  • Minimum mean squared error (MMSE), also known as Bayes least squared error (BLSE)
  • Maximum a posteriori
    Maximum a posteriori

    In statistics, the method of maximum a posteriori estimation theory can be used to obtain a point estimation of an unobserved quantity on the basis of empirical data....
     (MAP)
  • Minimum variance unbiased estimator (MVUE)
  • Best linear unbiased estimator (BLUE)
  • Unbiased estimators — see estimator bias.
  • Particle filter
    Particle filter

    Particle filters, also known as sequential Monte Carlo methods , are sophisticated model estimation techniques based on simulation.They are usually used to estimate Bayesian models and are the Sequential estimation analogue of Markov chain Monte Carlo batch methods and are often similar to importance sampling methods....
  • Markov chain Monte Carlo
    Markov chain Monte Carlo

    Markov chain Monte Carlo method methods , are a class of algorithms for sampling from probability distributions based on constructing a Markov chain that has the desired distribution as its Markov chain#Steady-state_analysis_and_limiting_distributions....
     (MCMC)
  • Kalman filter
    Kalman filter

    The Kalman filter is an efficient recursive filter that estimates the state of a Linear system from a series of noise measurements. It is used in a wide range of engineering applications from radar to computer vision, and is an important topic in control theory and control systems engineering....
  • Ensemble Kalman filter
    Ensemble Kalman filter

    The ensemble Kalman filter is a recursive filter suitable for problems with a large number of variables, such as discretizations of partial differential equations in geophysical models....
     (EnKF)
  • Wiener filter
    Wiener filter

    In signal processing, the Wiener filter is a filter proposed by Norbert Wiener during the 1940s and published in 1949. Its purpose is to reduce the amount of noise present in a signal by comparison with an estimation of the desired noiseless signal....


Examples


Unknown constant in additive white Gaussian noise

Consider a received discrete signal
Discrete signal

A discrete signal or discrete-time signal is a time series, perhaps a signal that has been sampling from a continuous signal.Unlike a continuous-time signal, a discrete-time signal is not a function of a continuous-time argument, but is a sequence of quantities; that is, a function over a Domain of discrete integers....
, , of independent
Statistical independence

In probability theory, to say that two event s are independent intuitively means that the occurrence of one event makes it neither more nor less probable that the other occurs....
 samples that consists of an unknown constant with additive white Gaussian noise
Additive white Gaussian noise

ExplanationIn Telecommunication, the additive white Gaussian noise channel model is one in which the information is given a single impairment: a linear addition of wideband or white noise with a constant spectral density and a Gaussian distribution of noise samples....
  with known variance
Variance

In probability theory and statistics, the variance of a random variable, probability distribution, or sample is one measure of statistical dispersion, averaging the squared distance of its possible values from the expected value ....
  (i.e., ). Since the variance is known then the only unknown parameter is .

The model for the signal is then


Two possible (of many) estimators are:
  • which is the sample mean


Both of these estimators have a mean
Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
 of , which can be shown through taking the expected value
Expected value

In probability theory and statistics, the expected value of a random variable is the Lebesgue integral of the random variable with respect to its probability measure....
 of each estimator

and

At this point, these two estimators would appear to perform the same. However, the difference between them becomes apparent when comparing the variances.

and

It would seem that the sample mean is a better estimator since, as , the variance goes to zero.

Maximum likelihood
Continuing the example using the maximum likelihood
Maximum likelihood

Maximum likelihood estimation is a popular statistics method used for fitting a mathematical model to data. The modeling of real world data using estimation by maximum likelihood offers a way of tuning the free parameters of the model to provide a good fit....
 estimator, the probability density function
Probability density function

In mathematics, a probability density function is a function that represents a probability distribution in terms of integrals.Formally, a probability distribution has density ƒ, if ƒ is a non-negative Lebesgue integration function such that the probability of the interval [ab] is given by...
 (pdf) of the noise for one sample is

and the probability of becomes ( can be thought of a )

By independence, the probability of becomes

Taking the natural logarithm
Natural logarithm

The natural logarithm, formerly known as the hyperbolic logarithm, is the logarithm to the base e , where e is an irrational number constant approximately equal to 2.718281828....
 of the pdf

and the maximum likelihood estimator is

Taking the first derivative
Derivative

In calculus, a branch of mathematics, the derivative is a measure of how a function changes as its input changes. Loosely speaking, a derivative can be thought of as how much a quantity is changing at a given point....
 of the log-likelihood function

and setting it to zero

This results in the maximum likelihood estimator

which is simply the sample mean. From this example, it was found that the sample mean is the maximum likelihood estimator for samples of a fixed, unknown parameter corrupted by AWGN.

Cramér–Rao lower bound

To find the Cramér-Rao lower bound (CRLB) of the sample mean estimator, it is first necessary to find the Fisher information
Fisher information

In statistics and information theory, the Fisher information is the variance of the score . It is named in honor of its inventor, the statistician Ronald Fisher....
 number

and copying from above

Taking the second derivative

and finding the negative expected value is trivial since it is now a deterministic constant

Finally, putting the Fisher information into

results in

Comparing this to the variance of the sample mean (determined previously) shows that the sample mean is equal to the Cramér-Rao lower bound for all values of and . In other words, the sample mean is the (necessarily unique) efficient estimator, and thus also the minimum variance unbiased estimator (MVUE), in addition to being the maximum likelihood
Maximum likelihood

Maximum likelihood estimation is a popular statistics method used for fitting a mathematical model to data. The modeling of real world data using estimation by maximum likelihood offers a way of tuning the free parameters of the model to provide a good fit....
 estimator.

Maximum of a uniform distribution

One of the simplest non-trivial examples of estimation is the estimation of the maximum of a uniform distribution. It is used as a hands-on classroom exercise and to illustrate basic principles of estimation theory. Further, in the case of estimation based on a single sample, it demonstrates philosophical issues and possible misunderstandings in the use of maximum likelihood
Maximum likelihood

Maximum likelihood estimation is a popular statistics method used for fitting a mathematical model to data. The modeling of real world data using estimation by maximum likelihood offers a way of tuning the free parameters of the model to provide a good fit....
 estimators and likelihood functions.

Given a discrete uniform distribution with unknown maximum, the UMVU estimator for the maximum is given by where m is the sample maximum and k is the sample size
Sample size

The sample size of a statistical sample is the number of observations that constitute it. It is typically denoted n, a positive integer ....
, sampling without replacement. This problem is commonly known as the German tank problem
German tank problem

File:PantherTankColor.jpgIn the statistical theory of estimation theory, estimating the maximum of a uniform distribution is a common illustration of differences between estimation methods....
, due to application of maximum estimation to estimates of German tank production during World War II
World War II

World War II, or the Second World War , was a global military conflict which involved a Participants in World War II, including all of the great powers, organised into two opposing military alliances: the Allies of World War II and the Axis powers....
.

The formula may be understood intuitively as:
"The sample maximum plus the average gap between observations in the sample",
the gap being added to compensate for the negative bias of the sample maximum as an estimator for the population maximum.The sample maximum is never more than the population maximum, but can be less, hence it is a biased estimator: it will tend to underestimate the population maximum.

This has a variance of so a standard deviation of approximately , the (population) average size of a gap between samples; compare above. This can be seen as a very simple case of maximum spacing estimation
Maximum spacing estimation

In mathematics, Maximum spacing estimation , or maximum product of spacing estimation , is a statistics method for fitting the parameters of a mathematical model to data ....
.

The sample maximum is the maximum likelihood
Maximum likelihood

Maximum likelihood estimation is a popular statistics method used for fitting a mathematical model to data. The modeling of real world data using estimation by maximum likelihood offers a way of tuning the free parameters of the model to provide a good fit....
 estimator for the population maximum, but, as discussed above, it is biased.

Applications


Numerous fields require the use of estimation theory. Some of these fields include (but are by no means limited to):

  • Interpretation of scientific experiment
    Experiment

    In scientific inquiry, an experiment is a method of investigating causal relationships among variables. An experiment is a cornerstone of the empiricism approach to acquiring data about the world and is used in both natural sciences and social sciences....
    s
  • Signal processing
    Signal processing

    Signal processing is the analysis, interpretation, and manipulation of signal . Signals of interest include: audio signal processing, , time-varying measurement values and sensor data, for example biological data such as electrocardiograms, control system signals, telecommunication transmission signals such as radio signals, and many others....
  • Clinical trial
    Clinical trial

    In health care, clinical trials are conducted to allow safety and efficacy data to be collected for new drugs or devices. These trials can only take place once satisfactory information has been gathered on the quality of the product and its non-clinical safety, and Institutional review board approval is granted in the country where the trial...
    s
  • Opinion poll
    Opinion poll

    An opinion poll is a statistical survey of public opinion from a particular sampling . Opinion polls are usually designed to represent the opinions of a population by conducting a series of questions and then extrapolating generalities in ratio or within confidence intervals....
    s
  • Quality control
    Quality control

    In engineering and manufacturing, quality control and quality engineering are used in developing systems to ensure product s or Service are designed and produced to meet or exceed customer requirements....
  • Telecommunication
    Telecommunication

    Telecommunication is the assisted Transmission of Signal over a distance for the purpose of communication. In earlier times, this may have involved the use of smoke signals, Drum , Semaphore line, flag signals or heliograph....
    s
  • Project management
    Project management

    Project management is the List of academic disciplines of planning, organizing and managing resources to bring about the successful completion of specific project goals and objectives....
  • Software engineering
    Software engineering

    Software engineering is the application of a systematic, disciplined, quantifiable approach to the development, operation, and maintenance of software, and the study of these approaches....
  • Control theory
    Control theory

    Control theory is an interdisciplinary branch of engineering and mathematics, that deals with the behavior of dynamical systems. The desired output of a system is called the reference....
  • Network intrusion detection system
    Network intrusion detection system

    A network intrusion detection system is an intrusion detection system that tries to detect malicious activity such as denial of service attacks, port scans or even attempts to black hat into computers by monitoring computer network traffic....
  • Orbit Determination
    Orbit determination

    Orbit determination is a branch of astronomy specialised in calculating, and hence predicting, the orbits of objects, primarily around the Earth....


Measured data are likely to be subject to noise or uncertainty and it is through statistical probability
Probability

Probability, or wikt:chance, is a way of expressing knowledge or belief that an Event will occur or has occurred. In mathematics the concept has been given an exact meaning in probability theory, that is used extensively in such areas of study as mathematics, statistics, finance, gambling, science, and philosophy to draw conclusions about t...
 that optimal
Optimization (mathematics)

In mathematics, the simplest case of optimization, or mathematical programming, refers to the study of problems in which one seeks to maxima and minima or maxima and minima a Function of a real variable by systematically choosing the values of Real number or integer variables from within an allowed set....
 solutions are sought to extract as much information
Fisher information

In statistics and information theory, the Fisher information is the variance of the score . It is named in honor of its inventor, the statistician Ronald Fisher....
 from the data as possible.

See also

Category:Estimation theory
Category:Estimation for specific distributions
  • Best linear unbiased estimator (BLUE)
  • Chebyshev center
    Chebyshev center

    The Chebyshev center of a bounded set having non-empty interior is the center of the minimal radius ball enclosing the entire set .In the field of parameter estimation, the Chebyshev center approach tries to find an estimator for given the feasibility set , such that minimizes the worst possible estimation error for x ....
  • Completeness (statistics)
    Completeness (statistics)

    In statistics, completeness is a property of a statistic for which the statistic optimally obtains information about the unknown parameters characterizing the distribution of the underlying data....
  • Cramér-Rao bound
  • Detection theory
    Detection theory

    Detection theory, or signal detection theory, is a means to quantify the ability to discern between signal and signal noise.According to the theory, there are a number of psychological determiners of how we will detect a signal, and where our threshold levels will be....
  • Efficiency (statistics)
    Efficiency (statistics)

    In statistics, efficiency is a term used in the comparison of various statistical procedures and, in particular, it refers to a measure of the desirability of an estimator or of an experimental design....
  • Estimator
    Estimator

    In statistics, an estimator is a function of the observable sample data that is used to estimate an unknown population parameter ; an estimate is the result from the actual application of the function to a particular Sampling_ of data....
    , Estimator bias
  • Expectation-maximization algorithm
    Expectation-maximization algorithm

    An expectation-maximization algorithm is used in statistics for finding maximum likelihood estimates of parameters in probabilistic models, where the model depends on unobserved latent variables....
     (EM algorithm)
  • Information theory
    Information theory

    Information theory is a branch of applied mathematics and electrical engineering involving the quantification of information. Historically, information theory was developed by Claude E....
  • Kalman filter
    Kalman filter

    The Kalman filter is an efficient recursive filter that estimates the state of a Linear system from a series of noise measurements. It is used in a wide range of engineering applications from radar to computer vision, and is an important topic in control theory and control systems engineering....
  • Least-squares spectral analysis
    Least-squares spectral analysis

    Least-squares spectral analysis is a method of estimating a frequency spectrum, based on a least squares fit of sinusoids to data samples, similar to Fourier analysis....
  • Markov chain Monte Carlo
    Markov chain Monte Carlo

    Markov chain Monte Carlo method methods , are a class of algorithms for sampling from probability distributions based on constructing a Markov chain that has the desired distribution as its Markov chain#Steady-state_analysis_and_limiting_distributions....
     (MCMC)
  • Matched filter
    Matched filter

    In telecommunications, a matched filter is obtained by cross-correlation a known signal , or template, with an unknown signal to detection the presence of the template in the unknown signal....
  • Maximum a posteriori
    Maximum a posteriori

    In statistics, the method of maximum a posteriori estimation theory can be used to obtain a point estimation of an unobserved quantity on the basis of empirical data....
     (MAP)
  • Maximum likelihood
    Maximum likelihood

    Maximum likelihood estimation is a popular statistics method used for fitting a mathematical model to data. The modeling of real world data using estimation by maximum likelihood offers a way of tuning the free parameters of the model to provide a good fit....
  • Maximum entropy spectral estimation
    Maximum entropy spectral estimation

    The maximum entropy method applied to spectral density estimation. The overall idea is that the maximum entropy rate stochastic process that satisfies the given constant autocorrelation and variance constraints, is a linear Gauss-Markov_process with i.i.d....
  • Method of moments
    Method of moments (statistics)

    In statistics, the method of moments is a method of estimation of population parameters such as mean, variance, median, etc. , by equating sample moment with unobservable population moments and then solving those equations for the quantities to be estimated....
    , generalized method of moments
    Generalized method of moments

    The generalized method of moments is a very general statistics method for obtaining point estimation of parameters of statistical models. It is a generalization, developed by Lars Peter Hansen, of the method of moments ....
  • Minimum mean squared error (MMSE)
  • Minimum variance unbiased estimator (MVUE)
  • Nuisance parameter
  • Parametric equation
    Parametric equation

    In mathematics, parametric equations are a method of defining a curve. A simple kinematics example is when one uses a time parameter to determine the position, velocity, and other information about a body in motion....
  • Particle filter
    Particle filter

    Particle filters, also known as sequential Monte Carlo methods , are sophisticated model estimation techniques based on simulation.They are usually used to estimate Bayesian models and are the Sequential estimation analogue of Markov chain Monte Carlo batch methods and are often similar to importance sampling methods....
  • Rao-Blackwell theorem
  • Spectral density
    Spectral density

    In statistical signal processing and physics, the spectral density, power spectral density , or energy spectral density , is a positive real function of a frequency variable associated with a stationary stochastic process, or a deterministic function of time, which has dimensions of power per Hz, or energy per Hz....
    , Spectral density estimation
    Spectral density estimation

    In statistical signal processing, the goal of spectral density estimation is to estimation theory the spectral density of a random signal from a sequence of time samples of the signal....
  • Statistical signal processing
    Statistical signal processing

    Statistical signal processing is an area of signal processing that treats signals as stochastic processes, dealing with their statistical properties ....
  • Sufficiency (statistics)
    Sufficiency (statistics)

    In statistics, sufficiency is the property possessed by a statistic, with respect to a parameter, "when no other statistic which can be calculated from the same sample provides any additional information as to the value of the parameter"....
  • Wiener filter
    Wiener filter

    In signal processing, the Wiener filter is a filter proposed by Norbert Wiener during the 1940s and published in 1949. Its purpose is to reduce the amount of noise present in a signal by comparison with an estimation of the desired noiseless signal....


Reference list

  • Mathematical Statistics and Data Analysis by John Rice. (ISBN 0-534-209343)
  • Fundamentals of Statistical Signal Processing: Estimation Theory by Steven M. Kay (ISBN 0-13-345711-7)
  • An Introduction to Signal Detection and Estimation by H. Vincent Poor (ISBN 0-387-94173-8)
  • Detection, Estimation, and Modulation Theory, Part 1 by Harry L. Van Trees (ISBN 0-471-09517-6; )
  • Optimal State Estimation: Kalman, H-infinity, and Nonlinear Approaches by Dan Simon