All Topics  
Statistical randomness

 

   Email Print
   Bookmark   Link






 

Statistical randomness



 
 
A numeric sequence is said to be statistically random when it contains no recognizable patterns or regularities; sequences such as the results of an ideal die roll
Dice

A die is a small polyhedron object, usually cubic, used for generating Statistical randomnesss or other symbols. This makes dice suitable as gambling devices, especially for craps or sic bo, or for use in non-gambling tabletop games....
, or the digits of p
Pi

Pi or p is a mathematical constant whose value is the ratio of any circle's circumference to its diameter in Euclidean geometry; this is the same value as the ratio of a circle's area to the square of its radius....
 exhibit statistical randomness.

Statistical randomness does not necessarily imply "true" randomness
Randomness

Randomness is a lack of order, purpose, Causality, or predictability. Randomness as defined by Aristotle is the situation, when a choice is to be made which has no logical component by which to determine or make the choice ....
, i.e., objective unpredictability. Pseudorandomness
Pseudorandomness

A pseudo random process is a process that appears randomness but is not. Pseudo random sequences typically exhibit statistical randomness while being generated by an entirely deterministic causal process....
 is sufficient for many uses.

A distinction is sometimes made between global randomness and local randomness.






Discussion
Ask a question about 'Statistical randomness'
Start a new discussion about 'Statistical randomness'
Answer questions from other users
Full Discussion Forum



Encyclopedia


A numeric sequence is said to be statistically random when it contains no recognizable patterns or regularities; sequences such as the results of an ideal die roll
Dice

A die is a small polyhedron object, usually cubic, used for generating Statistical randomnesss or other symbols. This makes dice suitable as gambling devices, especially for craps or sic bo, or for use in non-gambling tabletop games....
, or the digits of p
Pi

Pi or p is a mathematical constant whose value is the ratio of any circle's circumference to its diameter in Euclidean geometry; this is the same value as the ratio of a circle's area to the square of its radius....
 exhibit statistical randomness.

Statistical randomness does not necessarily imply "true" randomness
Randomness

Randomness is a lack of order, purpose, Causality, or predictability. Randomness as defined by Aristotle is the situation, when a choice is to be made which has no logical component by which to determine or make the choice ....
, i.e., objective unpredictability. Pseudorandomness
Pseudorandomness

A pseudo random process is a process that appears randomness but is not. Pseudo random sequences typically exhibit statistical randomness while being generated by an entirely deterministic causal process....
 is sufficient for many uses.

A distinction is sometimes made between global randomness and local randomness. Most philosophical conceptions of randomness are "global" — they are based on the idea that "in the long run" a sequence would look truly random, even if certain sequences would not look random (in a "truly" random sequence of numbers of sufficient length, for example, it is probable that there would be long sequences of nothing but zeros, though on the whole the sequence might be "random"). "Local" randomness refers to the idea that there can be minimum sequence lengths in which "random" distributions are approximated. Long stretches of the same digits, even those generated by "truly" random processes, would diminish the "local randomness" of a sample (it might only be locally random for sequences of 10,000 digits; taking sequences of less than 1,000 might not appear "random" at all, for example).

A sequence exhibiting a pattern is not thereby proved not statistically random. According to principles of Ramsey theory
Ramsey theory

Ramsey theory, named for Frank P. Ramsey, is a branch of mathematics that studies the conditions under which order must appear. Problems in Ramsey theory typically ask a question of the form: how many elements of some structure must there be to guarantee that a particular property will hold?...
, sufficiently large objects must necessarily contain a given structure ("complete disorder is impossible").

Legislation concerning gambling
Gambling

Gambling is the wikt:wager#Verb of money or something of material Value on an event with an uncertain outcome with the primary intent of winning additional money and/or material goods....
 imposes certain standards of statistical randomness to slot machine
Slot machine

A slot machine , fruit machine , or poker machine is a casino gambling machine with three or more reels which spin when a button is pushed....
s.

Contrast with algorithmic randomness.

Tests

The first tests for random numbers were published by M.G. Kendall and Bernard Babington Smith in the Journal of the Royal Statistical Society
Royal Statistical Society

The Royal Statistical Society is a learned society for statistics and a professional body for statisticians in the United Kingdom. It was founded in 1834 as the Statistical Society of London....
 in 1938. They were built on statistical tools such as Pearson's chi-square test
Pearson's chi-square test

Pearson's chi-square test is the best-known of several chi-square tests ? Statistics procedures whose results are evaluated by reference to the chi-square distribution....
 which were developed in order to distinguish whether or not experimental phenomena matched up with their theoretical probabilities (Pearson developed his test originally by showing that a number of dice experiments by W.F.R. Weldon did not display "random" behavior).

Kendall and Smith's original four tests were hypothesis tests
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
, which took as their null hypothesis
Null hypothesis

In statistics, a null hypothesis is a concept which arises in the context of statistical hypothesis testing. A common convention is to use the symbol H0 to denote the null hypothesis....
 the idea that each number in a given random sequence had an equal chance of occurring, and that various other patterns in the data should be also distributed equiprobably.

  • The frequency test, was very basic: checking to make sure that there were roughly the same number of 0s, 1s, 2s, 3s, etc.


  • The serial test, did the same thing but for sequences of two digits at a time (00, 01, 02, etc.), comparing their observed frequencies with their hypothetical predictions were they equally distributed.
  • The poker test, tested for certain sequences of five numbers at a time (aaaaa, aaaab, aaabb, etc.) based on hands in the game poker
    Poker

    Poker is a family of card game that share betting rules and usually List of poker hands. Poker games differ in how the cards are dealt, how hands may be formed, whether the high or low hand wins the pot in a showdown , limits on bets and how many rounds of betting are allowed....
    .
  • The gap test, looked at the distances between 0s (00 would be a distance of 0, 030 would be a distance of 1, 02250 would be a distance of 3, etc.).


If a given sequence was able to pass all of these tests within a given degree of significance (generally 5%), then it was judged to be, in their words "locally random". Kendall and Smith differentiated "local randomness" from "true randomness" in that many sequences generated with truly random methods might not display "local randomness" to a given degree — very large sequences might contain many rows of a single digit. This might be "random" on the scale of the entire sequence, but in a smaller block it would not be "random" (it would not pass their tests), and would be useless for a number of statistical applications.

As random number sets became more and more common, more tests, of increasing sophistication were used. Some modern tests plot random digits as points on a three-dimensional plane, which can then be rotated to look for hidden patterns. In 1995, the statistician George Marsaglia
George Marsaglia

George Marsaglia is a mathematician and computer scientist. He is perhaps best known for establishing the lattice structure of congruential random number generators in the paper "Random numbers fall mainly in the planes",...
 created a set of tests known as the Diehard tests
Diehard tests

The diehard tests are a battery of statistical tests for measuring the quality of a set of random numbers. They were developed by George Marsaglia over several years and first published in 1995 on a CD-ROM of random numbers....
 which he distributes with a CD-ROM
CD-ROM

CD-ROM is a pre-pressed Compact Disc that contains Computer data storage accessible to, but not writable by, a computer. While the Compact Disc format was originally designed for music storage and playback, the 1985 Yellow Book standard developed by Sony and Philips adapted the format to hold any form of Binary file....
 of 5 billion pseudorandom numbers.

Pseudorandom number generators
Pseudorandom number generator

A pseudorandom number generator is an algorithm for generating a sequence of numbers that approximates the properties of random numbers. The sequence is not truly random in that it is completely determined by a relatively small set of initial values, called the PRNG's state. Although sequences that are closer to truly random can be gen...
 require tests as exclusive verifications for their "randomness" as they are decidedly not produced by "truly random" processes, but rather by deterministic algorithms. Over the history of random number generation, many sources of numbers thought to appear "random" under testing have later been discovered to be very non-random when subjected to certain types of tests. The notion of quasi-random numbers was developed in order to circumvent some of these problems, though pseudorandom number generators are still extensively used in many applications (even ones known to be extremely "non-random"), as they are "good enough" for most applications.

Other tests :
  • The Monobit test treats each output bit of the random number generator as a coin flip test, and determine if the observed number of heads and tails are close to the expected 50% frequency. The number of heads in a coin flip trail forms a Binomial distribution
    Binomial distribution

    In probability theory and statistics, the binomial distribution is the discrete probability distribution of the number of successes in a sequence of n statistical independence yes/no experiments, each of which yields success with probability p....
    .
  • The Wald-Wolfowitz runs test
    Wald-Wolfowitz runs test

    The runs test is a non-parametric statistic test that checks a randomness hypothesis for a two-valued data sequence. More precisely, it can be used to Statistical hypothesis testing that the elements of the sequence are mutually Statistical independence....
     tests for the number of bit transitions between 0 bits, and 1 bits, comparing the observed frequencies with expected frequency of a random bit sequence.
  • Information entropy
    Information entropy

    In information theory, entropy is a measure of the uncertainty associated with a random variable. The term by itself in this context usually refers to the Shannon entropy, which quantifies, in the sense of an expected value, the self-information contained in a message, usually in units such as bits....
  • Autocorrelation
    Autocorrelation

    Autocorrelation is a mathematical tool for finding repeating patterns, such as the presence of a periodic signal which has been buried under noise, or identifying the missing fundamental frequency in a signal implied by its harmonic frequencies....
     test
  • KS test
  • Maurer's Universal Statistical Test.


See also

  • Checking if a coin is fair
    Checking if a coin is fair

    In statistics, a fair coin is an idealized Statistical randomness with two states which are equally likely to occur. It is based on the ubiquitous coin flip used in sports and other situations where it is necessary to give two parties the same chance of winning....
  • Normal number
    Normal number

    In mathematics, a normal number is a real number whose digits in every radix show a uniform distribution , with all digits being equally likely, all pairs of digits equally likely, all triplets of digits equally likely, etc....
  • Randomness
    Randomness

    Randomness is a lack of order, purpose, Causality, or predictability. Randomness as defined by Aristotle is the situation, when a choice is to be made which has no logical component by which to determine or make the choice ....
  • Random number
    Random number

    Random number may refer to:* A number generated for or part of a set exhibiting statistical randomness.* A random sequence obtained from a stochastic process....
  • Statistical hypothesis testing
    Statistical hypothesis testing

    A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
  • One-time pad
    One-time pad

    In cryptography, the one-time pad is an encryption algorithm where the plaintext is combined with a random key or "pad" that is as long as the plaintext and used only once....
  • Randomness tests
    Randomness tests

    Randomness tests , in data evaluation, are used to analyze the distribution pattern of a set of data. In stochastic modeling, as in some computer simulations, the expected random input data can be verified to show that tests were performed using randomized data....


External links

  • : A free (GPL
    GNU General Public License

    The GNU General Public License is a widely used free software license, originally written by Richard Stallman for the GNU project. The GPL is the most popular and well-known example of the type of strong copyleft license that requires derived works to be available under the same copyleft....
    ) C
    C (programming language)

    C is a general-purpose computer programming language originally developed in 1972 by Dennis Ritchie at the Bell Telephone Laboratories to implement the Unix operating system....
     Random Number Test Suite.