All Topics  
Power law

 

   Email Print
   Bookmark   Link

 

Power law


 
 


A power law is any polynomialPolynomial

In mathematics, a polynomial is an expression in which a finite number of constants and variables are combined using only ad...
 relationship that exhibits the property of scale invarianceScale invariance

In physics and mathematics, scale invariance is a feature of objects or laws that do not change if length scales are multipl...
. The most common power laws relate two variables and have the form

where and are constants, and is of . Here, is typically called the scaling exponent, the word "scaling" denoting the fact that a power-law function satisfies where is a constant. That is, a rescaling of the function's argument changes the constant of proportionality but preserves the shape of the function itself. This point becomes clearer if we take the logarithmLogarithm Overview

The logarithm is the mathematical operation that is the inverse of exponentiation ....
 of both sides:

Notice that this expression has the form of a linear relationshipLine (mathematics)

A line, or straight line, can be described as an infinitely thin, infinitely long, perfectly straight curve....
 with slope . Rescaling the argument produces a linear shift of the function up or down but leaves both the basic form and the slope unchanged.

Power-law relations characterize a staggering number of naturally occurring phenomena, and this is one of the principal reasons why they have attracted interest. For instance, inverse-square lawInverse-square law

In physics, an inverse-square law is any physical law stating that some physical quantity or strength is inversely proportio...
s, such as gravitationGravitation

In physics, gravitation or gravity is the tendency of objects with mass to accelerate toward each other....
 and the Coulomb force, are power laws, as are many common mathematical formulae such as the quadratic law of area of the circleCircle Overview

In Euclidean geometry, a circle is the set of all points in a plane at a fixed distance, called the radius, from a fixed poi...
. However it is mainly in the the study of probability distributions that power laws have attracted recent interest. A wide variety of observed probability distributions appear, at least approximately, to have tails asymptotically following power-law forms, an observation connected closely with the study of theory of large deviationsExtreme value theory

Extreme value theory is a branch of statistics dealing with the extreme deviations from the median of probability distributi...
 (also called extreme value theoryFacts About Extreme value theory

Extreme value theory is a branch of statistics dealing with the extreme deviations from the median of probability distributi...
), which considers the frequency of extremely rare events like stock market crashesStock market crash

A stock market crash is a sudden dramatic decline of stock prices across a significant cross-section of a market....
 and large natural disastersNatural disaster

A natural disaster is the consequence of the combination of a natural hazard and human activities....
. It is primarily in the study of statistical distributions that the name "power law" is used; in other areas the power-law functional form is more often referred to simply as a polynomial form or polynomial function.

Scientific interest in power law relations also derives from the ease with which certain general classes of mechanisms can generate them, so that the observation of a power-law relation in data often points to specific kinds of mechanisms that might underly the natural phenomenon in question, and can indicate a deep connection with other, seemingly unrelated systems (see the reference by Simon and the subsection on universality below). The ubiquity of power-law relations in physics is partly due to dimensional constraintsFacts About Dimensional analysis

Dimensional analysis is a conceptual tool often applied in physics, chemistry, and engineering to understand physical situat...
, while in complex systemsComplex systems Summary

Complex systems is a scientific field, which studies the common properties of systems considered complex in nature, society ...
, power laws are often thought to be signatures of hierarchy or of specific stochastic processes. A few notable examples of power laws are the Gutenberg-Richter lawGutenberg-Richter law

In seismology, the Gutenberg-Richter law states that the number of earthquakes per year of Richter magnitude M statistic...
 for earthquake sizes, Pareto's lawPareto principle Summary

The Pareto principle states that for many phenomena, 80% of the consequences stem from 20% of the causes....
 of income distribution, structural self-similarity of fractals, and scaling laws in biological systemsAllometric law

Allometric law describes the relationship between the body parts or processes within or among living organisms, usually expr...
. Research on the origins of power-law relations, and efforts to observe and validate them in the real world, is an active topic of research in many fields of science, including physicsPhysics

Physics , the most fundamental physical science, is concerned with the underlying principles of the natural world....
, computer scienceComputer science

Computer science, or computing science, is the study of the theoretical foundations of information and computation and...
, linguisticsLinguistics

Linguistics is the scientific study of human language....
, geophysicsGeophysics Summary

Geophysics, the study of the earth by quantitative physical methods, especially by seismic, electromagnetic, and radioactivi...
, sociologySociology

Sociology is the study of society and human social action....
, economicsEconomics

In the social sciences, economics is the study of the production, distribution, and consumption of goods and services.....
 and more.

Properties of power laws

Scale invariance

The main property of power laws that makes them interesting is their scale invarianceScale invariance

In physics and mathematics, scale invariance is a feature of objects or laws that do not change if length scales are multipl...
. Given a relation , or, indeed any homogeneous polynomialHomogeneous polynomial

In mathematics, a homogeneous polynomial is a polynomial whose terms are monomials all having the same total degree; or are ...
, scaling the argument by a constant factor causes only a proportionate scaling of the function itself. That is,

That is, scaling by a constant simply multiplies the original power-law relation by the constant . Thus, it follows that all power laws with a particular scaling exponent are equivalent up to constant factors, since each is simply a scaled version of the others. This behavior is what produces the linear relationship when both logarithms are taken of both and , and the straight-line on the log-log plot is often called the signature of a power law. Notably, however, with real data, such straightness is necessary, but not a sufficient condition for the data following a power-law relation. In fact, there are many ways to generate finite amounts of data that mimic this signature behavior, but, in their asymptotic limit, are not true power laws. Thus, accurately fitting and validating power-law models is an active area of research in statisticsStatistics

Statistics is a mathematical science pertaining to the collection, analysis, interpretation, and presentation of data....
.

Universality

The equivalence of power laws with a particular scaling exponent can have a deeper origin in the dynamical processes that generate the power-law relation. In physics, for example, phase transitionPhase transition

In physics, a phase transition or phase change is the transformation of a thermodynamic system from one phase to anoth...
s in thermodynamic systems are associated with the emergence of power-law distributions of certain quantities, whose exponents are referred to as the critical exponentFacts About Critical exponent

Critical exponents are observed in second-order phase transitions....
s of the system. Diverse systems with the same critical exponents — that is, which display identical scaling behaviour as they approach criticalityCritical point (thermodynamics)

In physical chemistry, thermodynamics, chemistry and condensed matter physics, a critical point, also called a critical s...
 — can be shown, via renormalization groupRenormalization group

In theoretical physics, renormalization group refers...
 theory, to share the same fundamental dynamics. For instance, the behavior of water and CO2 at their boiling points fall in the same universality class because they have identical critical exponents. In fact, almost all material phase transitions are described by a small set of universality classes. Similar observations have been made, though not as comprehensively, for various self-organized criticalSelf-organized criticality

In physics, self-organized criticality is a property of dynamical systems which have a critical point as an attractor....
 systems, where the critical point of the system is an attractorAttractor

In dynamical systems, an attractor is a set to which the system evolves after a long enough time....
. Formally, this sharing of dynamics is referred to as universalityUniversality (dynamical systems)

In statistical mechanics, universality is the observation that there are properties for a large class of systems that are in...
, and systems with precisely the same critical exponents are said to belong to the same universality classFacts About Renormalization group

In theoretical physics, renormalization group refers...
.

Power-law functions

The general power-law function follows the polynomial form given above, and is a ubiquitous form throughout mathematics and science. Notably, however, not all polynomial functions are power laws because not all polynomials exhibit the property of scale invariance. Typically, power-law functions are polynomials in a single variable, and are explicitly used to model the scaling behavior of natural processes. For instance, allometric scaling lawsAllometric law

Allometric law describes the relationship between the body parts or processes within or among living organisms, usually expr...
 for the relation of biological variables are some of the best known power-law functions in nature. In this context, the term is most typically replaced by a deviation term , which can represent uncertainty in the observed values (perhaps measurement or sampling errors) or provide a simple way for observations to deviate from the no power-law function (perhaps for stochasticStochastic process

In the mathematics of probability, a stochastic process is a random function....
 reasons):

Estimating the exponent from empirical data

There are many methods for fitting power-law functions to data, and the best option typically depends strongly on the kind of question being asked. For instance, prediction-type questions should rely on nonlinear regressionNonlinear regression

In statistics, nonlinear regression is the problem of fitting a model...
, while descriptive-type summary questions, such as those found in allometryAllometry

Allometry is the science studying the differential growth rates of the parts of a living organism's body part or process....
, should use a method that allows for uncertainty in both the and measurements. If the residuals are log normally distributed, e.g. if the spread in is multiplicative (increasing proportionally with ), a simple least-squares linear regressionLinear regression

In statistics, linear regression is a method of estimating the conditional expected value of one variable y given the va...
 on log-transformed data can be performed, since the log transformed residues are normally distributed after transformation. Otherwise, the logarithmic transformation produces residuals that are log-normally distributed, while the least squares method requires normally distributed errors. In this latter context, the method of standardized major axis (SMA) regression (sometimes called reduced major axis, but this term should be avoided) is preferred.

The major axis is the linear equation that minimizes the sum of squares of the shortest (perpendicular) distance between data points and the equation. This axis is equivalent to the first principal component axis of the covariance matrixCovariance matrix

In statistics and probability theory, the covariance matrix is a matrix of covariances between elements of a vector....
. From this observation, the estimatorEstimator

In statistics, an estimator is a function of the known sample data that is used to estimate an unknown population parameter;...
 for the slope can be derived

where and are the sample means of the and data, respectively.

More about this method, and the conditions under which it can be used, can be found in the Warton reference below. Further, Warton's comprehensive review article also provides (C++, R, and Matlab) for estimation and testing routines for power-law functions.

Examples of power law functions

  • The Stefan-Boltzmann lawStefan-Boltzmann law

    The Stefan-Boltzmann law, also known as Stefan's law, states that the total energy radiated per unit surface area of a...
  • The Gompertz Law of MortalityGompertz-Makeham law of mortality

    The Gompertz-Makeham law states that death rate is a sum of age-independent component and age-dependent component, which inc...
  • The Ramberg-Osgood stress-strain relationshipRamberg-Osgood relationship

    The Ramberg-Osgood equation was created to describe the relationship between stress and strain—that is, the stress-str...
  • The Inverse-square lawInverse-square law

    In physics, an inverse-square law is any physical law stating that some physical quantity or strength is inversely proportio...
     of Newtonian gravity
  • The Initial mass functionInitial mass function

    The initial mass function is a relationship that specifies the mass distribution of a newly formed stellar population, by gi...
  • Gamma correctionGamma correction

    Gamma correction, gamma nonlinearity, gamma encoding, or often simply gamma, is the name of a nonlinear op...
     relating light intensity with voltage
  • Kleiber's lawKleiber's law

    Kleiber's law, named after Max Kleiber's biological work in the early 1930s, is the observation that, for the vast majority ...
     relating animal metabolism to size, and allometric lawAllometric law Summary

    Allometric law describes the relationship between the body parts or processes within or among living organisms, usually expr...
    s in general
  • Behaviour near second-order phase transitionsPhase transition

    In physics, a phase transition or phase change is the transformation of a thermodynamic system from one phase to anoth...
     involving critical exponentCritical exponent

    Critical exponents are observed in second-order phase transitions....
    s
  • Proposed form of experience curve effectsExperience curve effects

    The learning curve effect and the closely related experience curve effect express the relationship between experience ...
  • The differential energy spectrum of cosmic-ray nuclei
  • Inverse-square lawInverse-square law

    In physics, an inverse-square law is any physical law stating that some physical quantity or strength is inversely proportio...
  • Square-cube lawSquare-cube law

    The square-cube law is a principle, drawn from the mathematics of proportion, that is applied in engineering and biomechanic...
  • Constructal law
  • FractalFractal Overview

    In colloquial usage, a fractal is a shape that is recursively constructed or self-similar, that is, a shape that appears similar a...
    s

Power-law distributions

A power-law distribution is any that, in the most general sense, has the form

where , and is a slowly varying function, which is any function that satisfies with constant. This property of follows directly from the requirement that be asymptotically scale invariant; thus, the form of only controls the shape and finite extent of the lower tail. For instance, if is the constant function, then we have a power-law that holds for all values of . In many cases, it is convenient to assume a lower bound from which the law holds. Combining these two cases, and where is a continuous variable, the power law has the form

where the constant is necessary to guarantee that the distribution is properly normalized. Briefly, we can consider several properties of this distribution.

In general, the moments of this distribution are given by

which is only well defined for . That is, all moments diverge: when , the average and all higher-order moments are infinite; when , the mean exists, but the variance and higher-order moments are infinite, etc. For finite-size samples drawn from such distribution, this behavior implies that the central moment estimators (like the mean and the variance) for diverging moments will never converge - as more data is accumulated, they continue to grow.

Another kind of power-law distribution, which does not satisfy the general form above, is the power law with an exponential cutoff

where we introduce an exponential decay term that overwhelms the power-law behavior at large values of . This distribution does not scale and is thus not asymptotically a power law; however, it does approximately scale over a finite region before the cutoff. (Note that the pure form above is a subset of this family, with .) This distribution is a common alternative to the asymptotic power-law distribution because it naturally captures finite-size effects. For instance, although the Gutenberg-Richter LawGutenberg-Richter law

In seismology, the Gutenberg-Richter law states that the number of earthquakes per year of Richter magnitude M statistic...
 is commonly cited as an example of a power-law distribution, the distribution of earthquake magnitudes cannot scale as a power law in the limit because there is a finite amount of energy in the Earth's crust. Thus, there must be some maximum size earthquake, and the scaling behavior must taper off as it approaches this size.

Plotting power-law distributions

In general, power-law distributions are plotted on doubly logarithmic axesLog-log graph

In science and engineering, a log-log graph or log-log plot is a way of visualizing data that is changing with a power...
, which emphasizes the upper tail region. The most convenient way to do this is via the (complementary) cumulative distributionCumulative distribution function

In probability theory, the cumulative distribution function completely describes the probability distribution of a real-val...
 (cdf), ,

Note that the cdf is also a power-law function, but with a smaller scaling exponent. For data, an equivalent form of the cdf is the rank-frequency approach, in which we first sort the observed values in ascending order, and plot them against the vector .

Although it can be convenient to log-bin the data, or otherwise smooth the probability density (mass) function directly, these methods introduce an implicit bias in the representation of the data, and thus should be avoided. The cdf, on the other hand, introduces no bias in the data and preserves the linear signature on doubly logarithmic axes.

Estimating the exponent from empirical data

There are many ways of estimating the value of the scaling exponent for a power-law tail, however not all of them yield unbiased and consistent answersFacts About Maximum likelihood

Maximum likelihood estimation is a popular statistical method used to make inferences about parameters of the underlying pro...
. The most reliable techniques are often based on the method of maximum likelihood. Alternative methods are often based on making a linear regression on either the log-log probability, the log-log cumulative distribution function, or on log-binned data, but these approaches should be avoided as they can all lead to highly biased estimates of the scaling exponent (see the Clauset et al. reference below).

For real-valued data, we fit a power-law distribution of the form

to the data . Given a choice for , a simple derivation by this method yields the estimator equation

where are the data points . (For a more detailed derivation, see Hall or Newman below.) This estimator exhibits a small finite sample-size bias of order , which is small when n > 100. Further, the uncertainty in the estimation can be derived from the maximum likelihood argument, and has the form . This estimator is equivalent to the popular Hill estimator from quantitative finance and extreme value theoryExtreme value theory

Extreme value theory is a branch of statistics dealing with the extreme deviations from the median of probability distributi...
.

For a set of n integer-valued data points , again where each , the maximum likelihood exponent is the solution to the transcendental equation

where is the incomplete zeta functionRiemann zeta function

In mathematics, the Riemann zeta function, named after Bernhard Riemann, is a function of significant importance in number t...
. The uncertainty in this estimate follows the same formula as for the continuous equation. However, the two equations for are not equivalent, and the continuous version should not be applied to discrete data, nor vice versa.

Further, both of these estimators require the choice of . For functions with a non-trivial function, choosing too small produces a significant bias in , while choosing it too small increases the uncertainty in , and reduces the statistical powerStatistical power

The power of a statistical test is the probability that the test will reject a false null hypothesis, or in other words tha...
 of our model. In general, the optimum choice of depends strongly on the particular form of the lower tail, represented by above.

More about these methods, and the conditions under which they can be used, can be found in the Clauset et al. reference below. Further, this comprehensive review article provides (Matlab and R) for estimation and testing routines for power-law distributions.

Examples of power-law distributions

  • Pareto distributionPareto distribution

    The Pareto distribution, named after the Italian economist Vilfredo Pareto, is a power law probability distribution found in...
     (continuous)
  • Zeta distributionFacts About Zeta distribution

    In probability theory and statistics, the zeta distribution is a discrete probability distribution....
     (discrete)
  • Yule–Simon distribution (discrete)
  • Student's t-distributionStudent's t-distribution

    In probability and statistics, the t-distribution or Student's t-distribution is a probability distribution that a...
     (continuous), of which the Cauchy distributionCauchy distribution

    The Cauchy-Lorentz distribution, named after Augustin Cauchy and Hendrik Lorentz, is a continuous probability distribution w...
     is a special case
  • Zipf's lawZipf's law

    Originally, Zipf's law stated that, in a corpus of natural language utterances, the frequency of any word is roughly inverse...
     and its generalization, the Zipf-Mandelbrot lawZipf-Mandelbrot law

    In probability theory and statistics, the Zipf-Mandelbrot law is a discrete probability distribution....
     (discrete)
  • The scale-free networkScale-free network Overview

    A scale-free network is a specific kind of complex network that has attracted attention since many real-world networks fall ...
     model
  • BibliogramBibliogram

    A bibliogram is a verbal construct made when noun phrases from extended stretches of text are ranked high to low by their fr...
    s
  • Gutenberg-Richter lawGutenberg-Richter law

    In seismology, the Gutenberg-Richter law states that the number of earthquakes per year of Richter magnitude M statistic...
     of earthquakeEarthquake

    An earthquake is a phenomenon that results from and is powered by the sudden release of stored energy that radiates seismic ...
     magnitudes
  • HortonRobert E. Horton

    Robert Elmer Horton was an American ecologist and soil scientist, considered by many to be the father of modern hydrology....
    's laws describing river systems
  • Richardson's Law for the severity of violent conflicts (wars and terrorism)
  • population of cities
  • numbers of religious adherents
  • net worth of individuals
  • frequency of words in a text


A great many power-law distributions have been conjectured in recent years. For instance, power laws are thought to characterize the behavior of the upper tails for the popularity of websites, number of species per genus, the popularity of given namesGiven name

A given name is a word which specifies and differentiates between members of a group of individuals, especially a family, al...
, the size of financial returns, and many others. However, much debate remains as to which of these tails are actually power-law distributed and which are not. For instance, it is commonly accepted now that the famous Gutenberg-Richter LawGutenberg-Richter law Summary

In seismology, the Gutenberg-Richter law states that the number of earthquakes per year of Richter magnitude M statistic...
 decays more rapidly than a pure power-law tail because of a finite exponential cutoff in the upper tail.

Validating power laws

Although power-law relations are attractive for many theoretical reasons, demonstrating that data do indeed follow a power-law relation requires more than simply fitting such a model to the data. In general, many alternative functional forms can appear to follow a power-law form for some extent. Thus, the preferred method for validation of power-law relations is by testing many orthogonal predictions of a particular generative mechanism against data, and not simply fitting a power-law relation to a particular kind of data. As such, the validation of power-law claims remains a very active field of research in many areas of modern science.

See also

Bibliography



External links


  • by Benoit Mandelbrot & Nassim Nicholas Taleb. Fortune, July 11, 2005.
  • power-law distributions in homelessness and other social problems; by Malcolm Gladwell. The New Yorker, February 13, 2006.
  • Benoit Mandelbrot & Richard Hudson: The Misbehaviour of Markets (2004)
  • Philip Ball: (2005)
  • from
  • br>