All Topics  
Time series

 

   Email Print
   Bookmark   Link






 

Time series



 
 
In statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
, signal processing
Signal processing

Signal processing is the analysis, interpretation, and manipulation of signal . Signals of interest include: audio signal processing, , time-varying measurement values and sensor data, for example biological data such as electrocardiograms, control system signals, telecommunication transmission signals such as radio signals, and many others....
, and many other fields, a time series is a sequence of data point
Data point

In statistics, a data point is a single typed measurement. Here type is used in a way compatible with datatype in computing; so that the type of measurement can specify whether the measurement results in a Boolean value from , an integer or real number, or some vector space or array....
s, measured typically at successive times, spaced at (often uniform) time intervals. Time series analysis comprises methods that attempt to understand such time series, often either to understand the underlying context of the data points (where did they come from? what generated them?), or to make forecasts (predictions).






Discussion
Ask a question about 'Time series'
Start a new discussion about 'Time series'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Random Data Plus Trend R2
In statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
, signal processing
Signal processing

Signal processing is the analysis, interpretation, and manipulation of signal . Signals of interest include: audio signal processing, , time-varying measurement values and sensor data, for example biological data such as electrocardiograms, control system signals, telecommunication transmission signals such as radio signals, and many others....
, and many other fields, a time series is a sequence of data point
Data point

In statistics, a data point is a single typed measurement. Here type is used in a way compatible with datatype in computing; so that the type of measurement can specify whether the measurement results in a Boolean value from , an integer or real number, or some vector space or array....
s, measured typically at successive times, spaced at (often uniform) time intervals. Time series analysis comprises methods that attempt to understand such time series, often either to understand the underlying context of the data points (where did they come from? what generated them?), or to make forecasts (predictions). Time series forecasting is the use of a model
Model (abstract)

In mathematical logic, the formal languages, formal systems, and theory which are studied have no meaningful content until they are given an interpretation within some other system....
 to forecast future events based on known past events: to forecast future data points before they are measured. A standard example in econometrics
Econometrics

Econometrics is concerned with the tasks of developing and applying quantitative or statistical methods to the study and elucidation of economic principles....
 is the opening price of a share of stock
STOCK

Software for fixed assets management and stock control developed in 2004. Stocktaking process is carried using a hand-held mobile terminal equipped with barcode reader or RFID technology....
 based on its past performance.

The term time series analysis is used to distinguish a problem, firstly from more ordinary data analysis problems (where there is no natural ordering of the context of individual observations), and secondly from spatial data analysis where there is a context that observations (often) relate to geographical locations. There are additional possibilities in the form of space-time models (often called spatial-temporal analysis). A time series model will generally reflect the fact that observations close together in time will be more closely related than observations further apart. In addition, time series models will often make use of the natural one-way ordering of time so that values in a series for a given time will be expressed as deriving in some way from past values, rather than from future values (see time reversibility
Time reversibility

Time reversibility is an attribute of some stochastic process and some deterministic processes.If a stochastic process is time reversible, then it is not possible to determine, given the states at a number of points in time after running the stochastic process, which state came first and which state arrived later....
.)

Methods for time series analyses are often divided into two classes: frequency-domain methods and time-domain methods. The former centre around spectral analysis
Spectral analysis

Spectral analysis or Spectrum analysis may refer to:* Spectrum analysis in chemistry and physics, a method of analyzing the chemical properties of matter from bands in their optical spectrum...
 and recently wavelet analysis, and can be regarded as model-free analyses well-suited to exploratory investigations. Time-domain methods have a model-free subset consisting of the examination of auto-correlation and cross-correlation
Cross-correlation

In signal processing, cross-correlation is a measure of similarity of two waveforms as a function of a time-lag applied to one of them. This is also known as a sliding dot product or inner-product....
 analysis, but it is here that partly and fully-specified time series models make their appearance.

Time series analyses

There are several types of data analysis available for time series which are appropriate for different purposes.

General exploration

  • Graphical examination of data series
  • Autocorrelation
    Autocorrelation

    Autocorrelation is a mathematical tool for finding repeating patterns, such as the presence of a periodic signal which has been buried under noise, or identifying the missing fundamental frequency in a signal implied by its harmonic frequencies....
     analysis to examine serial dependence
  • Spectral analysis
    Spectral analysis

    Spectral analysis or Spectrum analysis may refer to:* Spectrum analysis in chemistry and physics, a method of analyzing the chemical properties of matter from bands in their optical spectrum...
     to examine cyclic behaviour which need not be related to seasonality


Description

  • Separation into components representing trend, seasonality, slow and fast variation, cyclical irregular: see Decomposition of time series
  • Simple properties of marginal distribution
    Marginal distribution

    In probability theory and statistics, the marginal distribution of a subset of a collection of random variables is the probability distribution of the variables contained in the subset....
    s


Prediction and forecasting

  • Fully-formed statistical models for stochastic simulation
    Stochastic simulation

    Stochastic simulation algorithms and methods were initially developed to analyse chemical reactions involving large numbers of species with complex reaction kinetics....
     purposes, so as to generate alternative versions of the time series, representing what might happen over non-specific time-periods in the future (prediction).
  • Simple or fully-formed statistical models to describe the likely outcome of the time series in the immediate future, given knowledge of the most recent outcomes (forecasting).


Time series models

As shown by Box and Jenkins
Box-Jenkins

In econometrics, the Box-Jenkins methodology, named after the statisticians George Box and Gwilym Jenkins, applies autoregressive moving average Autoregressive moving average or ARIMA models to find the best fit of a time series to past values of this time series, in order to make forecasts....
, models for time series data can have many forms and represent different stochastic processes. When modeling variations in the level of a process, three broad classes of practical importance are the autoregressive (AR) models, the integrated (I) models, and the moving average
Moving average model

In time series analysis, the moving average model is common approach for modeling univariate time series models. The notation MA refers to the moving average model of order q:...
 (MA) models. These three classes depend linearly on previous data points. Combinations of these ideas produce autoregressive moving average (ARMA) and autoregressive integrated moving average
Autoregressive integrated moving average

In statistics and econometrics, and in particular in time series analysis, an autoregressive integrated moving average model is a generalisation of an autoregressive moving average or model....
 (ARIMA) models. The autoregressive fractionally integrated moving average
Autoregressive fractionally integrated moving average

In statistics, autoregressive fractionally integrated moving average models are time series models that generalize ARIMA models by allowing non-integer values of the differencing parameter and are useful in modeling time series with long memory....
 (ARFIMA) model generalizes the former three. Extensions of these classes to deal with vector-valued data are available under the heading of multivariate time-series models and sometimes the preceding acronyms are extended by including an initial "V" for "vector". An additional set of extensions of these models is available for use where the observed time-series is driven by some "forcing" time-series (which may not have a causal effect on the observed series): the distinction from the multivariate case is that the forcing series may be deterministic or under the experimenter's control. For these models, the acronyms are extended with a final "X" for "exogenous".

Non-linear dependence of the level of a series on previous data points is of interest, partly because of the possibility of producing a chaotic
Chaos theory

In mathematics, chaos theory describes the behavior of certain dynamical system s ? that is, systems whose states evolve with time ? that may exhibit dynamics that are highly sensitive to initial conditions ....
 time series. However, more importantly, empirical investigations can indicate the advantage of using predictions derived from non-linear models, over those from linear models.

Among other types of non-linear time series models, there are models to represent the changes of variance along time (heteroskedasticity
Heteroskedasticity

In statistics, a sequence or a vector of random variables is heteroskedastic, or heteroscedastic, if the random variables have different variances....
). These models are called autoregressive conditional heteroskedasticity
Autoregressive conditional heteroskedasticity

In econometrics,an autoregressive conditional heteroscedasticity model considers the variance of the current error term to be a function of the variances of the previous time period's error terms....
 (ARCH) and the collection comprises a wide variety of representation (GARCH, TARCH, EGARCH, FIGARCH, CGARCH, etc). Here changes in variability are related to, or predicted by, recent past values of the observed series. This is in contrast to other possible representations of locally-varying variability, where the variability might be modelled as being driven by a separate time-varying process, as in a doubly stochastic model.

In recent work on model-free analyses, wavelet transform based methods (for example locally stationary wavelets and wavelet decomposed neural networks) have gained favor. Multiscale (often referred to as multiresolution) techniques decompose a given time series, attempting to illustrate time dependence at multiple scales.

Notation

A number of different notations are in use for time-series analysis:

X =


is a common notation which specifies a time series X which is indexed by the natural number
Natural number

In mathematics, a natural number can mean either an element of the Set = *n = = ? = ? ...
s. Another common notation is:

Y =


Conditions

There are two sets of conditions under which much of the theory is built:
  • Stationary process
    Stationary process

    In the mathematics, a stationary process is a stochastic process whose joint probability distribution does not change when shifted in time or space....
  • Ergodicity


However, ideas of stationarity must be expanded to consider two important ideas: strict stationarity and second-order stationarity. Both models and applications can be developed under each of these conditions, although the models in the latter case might be considered as only partly specified.

In addition, time-series analysis can be applied where the series are seasonally stationary and non-stationary.

Models

The general representation of an autoregressive model, well-known as AR(p), is



where the term et is the source of randomness and is called white noise
White noise

White noise is a random signal with a flat power spectral density. In other words, the signal contains equal power within a fixed bandwidth at any center frequency....
. It is assumed to have the following characteristics:

1.

2.

3.

With these assumptions, the process is specified up to second-order moments and, subject to conditions on the coefficients, may be second-order stationary.

If the noise also has a normal distribution
Normal distribution

The normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields....
, it is called normal white noise (denoted here by Normal-WN):



In this case the AR process may be strictly stationary, again subject to conditions on the coefficients.

Related tools

Tools for investigating time-series data include:
  • Consideration of the autocorrelation function
    Autocorrelation

    Autocorrelation is a mathematical tool for finding repeating patterns, such as the presence of a periodic signal which has been buried under noise, or identifying the missing fundamental frequency in a signal implied by its harmonic frequencies....
     and the spectral density function
    Spectral density

    In statistical signal processing and physics, the spectral density, power spectral density , or energy spectral density , is a positive real function of a frequency variable associated with a stationary stochastic process, or a deterministic function of time, which has dimensions of power per Hz, or energy per Hz....
     (also cross-correlation functions and cross-spectral density functions)
  • Performing a Fourier transform
    Fourier transform

    In mathematics, Fourier analysis is a subject area which grew out of the study of Fourier series. The subject began with trying to understand when it was possible to represent general functions by sums of simpler trigonometric functions....
     to investigate the series in the frequency domain
    Frequency domain

    In electronics and control systems engineering, frequency domain is a term used to describe the analysis of mathematical functions or Signal with respect to frequency, rather than time....
    .
  • Use of a filter
    Digital filter

    In electronics, computer science and mathematics, a digital filter is a system that performs mathematical operations on a Sampling , discrete-time Signal to reduce or enhance certain aspects of that signal....
     to remove unwanted noise.
  • Principal components analysis
    Principal components analysis

    Principal component analysis involves a mathematical procedure that transforms a number of possibly correlated variables into a smaller number of uncorrelated variables called principal components....
     (or empirical orthogonal function analysis)
  • Singular spectrum analysis
    Singular Spectrum Analysis

    The singular spectrum analysis technique is a powerful technique of time series analysis incorporating the elements of classical time series analysis, multivariate statistics, multivariate geometry, dynamical systems and signal processing....
  • Artificial neural network
    Artificial neural network

    An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
    s
  • time-frequency analysis techniques:
    Time-frequency representation

    A time-frequency representation is a view of a signal processing represented over both time and frequency. Time-frequency analysis means analysis of a TFR....
    • Continuous wavelet transform
      Continuous wavelet transform

      A continuous wavelet transform is used to divide a continuous-time function into wavelets. Unlike Fourier transform, the continuous wavelet transform possesses the ability to construct a time-frequency representation of a signal that offers very good time and frequency localization....
    • Short-time Fourier transform
      Short-time Fourier transform

      The short-time Fourier transform , or alternatively short-term Fourier transform, is a List of Fourier-related transforms used to determine the sinusoidal frequency and phase content of local sections of a signal as it changes over time....
    • Chirplet transform
      Chirplet transform

      In signal processing, the chirplet transform is an inner product of an input signal with a family of analysis primitives called chirplets....
    • Fractional Fourier transform
      Fractional Fourier transform

      In mathematics, in the area of harmonic analysis, the fractional Fourier transform is a linear transformation generalizing the Fourier transform....
  • Chaotic analysis
    Chaos theory

    In mathematics, chaos theory describes the behavior of certain dynamical system s ? that is, systems whose states evolve with time ? that may exhibit dynamics that are highly sensitive to initial conditions ....
    • Correlation dimension
      Correlation dimension

      In chaos theory the correlation dimension is a measure of the dimensionality of the space occupied by a set of random points, often referred to as a type of fractal dimension)....
    • Recurrence plot
      Recurrence plot

      In descriptive statistics and chaos theory, a recurrence plot is a plot showing, for a given moment in time, the times at which a phase space trajectory visits roughly the same area in the phase space....
      s
    • Recurrence quantification analysis
      Recurrence quantification analysis

      Recurrence quantification analysis is a method of nonlinear data analysis for the investigation of dynamical systems. It quantifies the number and duration of recurrences of a dynamical system presented by its phase space trajectory....
    • Lyapunov exponent
      Lyapunov exponent

      In mathematics the Lyapunov exponent or Lyapunov characteristic exponent of a dynamical system is a quantity that characterizes the rate of separation of infinitesimally close trajectory....
      s


See also


External links

  • - an open source book on time series analysis with SAS
  • - A practical guide to Time series analysis
  • product performs online analysis and prediction of Chaotic time series. Access is provided free online via a web service and graphic interface.