All Topics  
Algorithmic information theory

 

   Email Print
   Bookmark   Link






 

Algorithmic information theory



 
 
Algorithmic information theory is a subfield of information theory
Information theory

Information theory is a branch of applied mathematics and electrical engineering involving the quantification of information. Historically, information theory was developed by Claude E....
 and computer science
Computer science

Computer science is the study of the theoretical foundations of information and computation, and of practical techniques for their implementation and application in computer systems....
 that concerns itself with the relationship between computation
Theory of computation

The theory of computation is the branch of computer science that deals with whether and how efficiently problems can be solved on a model of computation, using an algorithm....
 and information
Information

Information as a Conveyed concept has a diversity of meanings, from everyday usage to technical settings. Generally speaking, the concept of information is closely related to notions of constraint, communication, control system, data, form, instruction, knowledge, Meaning , stimulation, pattern, perception, and knowledge representation....
. According to Gregory Chaitin
Gregory Chaitin

Gregory John Chaitin is an Argentina-United States mathematician and computer scientist.Beginning in the late 1960s, Chaitin made contributions to algorithmic information theory and metamathematics, in particular a new incompleteness theorem in reaction to G?del's incompleteness theorem....
, it is "the result of putting Shannon's information theory and Turing
Alan Turing

Alan Mathison Turing, Order of the British Empire, Fellow of the Royal Society was a British mathematician, logician and Cryptanalysis....
's computability theory into a cocktail shaker and shaking vigorously."

rithmic information theory principally studies complexity measures on string
String (computer science)

In computer programming and some branches of mathematics, a string is an ordered sequence of symbols. These symbols are chosen from a predetermined set or alphabet....
s (or other data structure
Data structure

A data structure in computer science is a way of storing data in a computer so that it can be used efficiently. It is an organization of mathematical and logical concepts of data....
s). Because most mathematical objects can be described in terms of strings, or as the limit
Limit of a sequence

The limit of a sequence is one of the oldest concepts in mathematical analysis. It provides a rigorous definition of the idea of a sequence converging towards a point called the limit....
 of a sequence
Sequence

In mathematics, a sequence is an ordered list of objects . Like a Set , it contains Element , and the number of terms is called the length of the sequence....
 of strings, it can be used to study a wide variety of mathematical objects, including integer
Integer

The integers are natural numbers including 0 and their negative and non-negative numberss . They are numbers that can be written without a fractional or decimal component, and fall within the set ....
s and real number
Real number

In mathematics, the real numbers may be described informally in several different ways. The real numbers include both rational numbers, such as 42 and −23/129, and irrational numbers, such as pi and the square root of two; or, a real number can be given by an infinite decimal representation, such as 2.4871773339...., where the digits co...
s.

This use of the term "information" might be a bit misleading, as it depends upon the concept of compressibility.






Discussion
Ask a question about 'Algorithmic information theory'
Start a new discussion about 'Algorithmic information theory'
Answer questions from other users
Full Discussion Forum



Encyclopedia


Algorithmic information theory is a subfield of information theory
Information theory

Information theory is a branch of applied mathematics and electrical engineering involving the quantification of information. Historically, information theory was developed by Claude E....
 and computer science
Computer science

Computer science is the study of the theoretical foundations of information and computation, and of practical techniques for their implementation and application in computer systems....
 that concerns itself with the relationship between computation
Theory of computation

The theory of computation is the branch of computer science that deals with whether and how efficiently problems can be solved on a model of computation, using an algorithm....
 and information
Information

Information as a Conveyed concept has a diversity of meanings, from everyday usage to technical settings. Generally speaking, the concept of information is closely related to notions of constraint, communication, control system, data, form, instruction, knowledge, Meaning , stimulation, pattern, perception, and knowledge representation....
. According to Gregory Chaitin
Gregory Chaitin

Gregory John Chaitin is an Argentina-United States mathematician and computer scientist.Beginning in the late 1960s, Chaitin made contributions to algorithmic information theory and metamathematics, in particular a new incompleteness theorem in reaction to G?del's incompleteness theorem....
, it is "the result of putting Shannon's information theory and Turing
Alan Turing

Alan Mathison Turing, Order of the British Empire, Fellow of the Royal Society was a British mathematician, logician and Cryptanalysis....
's computability theory into a cocktail shaker and shaking vigorously."

Overview

Algorithmic information theory principally studies complexity measures on string
String (computer science)

In computer programming and some branches of mathematics, a string is an ordered sequence of symbols. These symbols are chosen from a predetermined set or alphabet....
s (or other data structure
Data structure

A data structure in computer science is a way of storing data in a computer so that it can be used efficiently. It is an organization of mathematical and logical concepts of data....
s). Because most mathematical objects can be described in terms of strings, or as the limit
Limit of a sequence

The limit of a sequence is one of the oldest concepts in mathematical analysis. It provides a rigorous definition of the idea of a sequence converging towards a point called the limit....
 of a sequence
Sequence

In mathematics, a sequence is an ordered list of objects . Like a Set , it contains Element , and the number of terms is called the length of the sequence....
 of strings, it can be used to study a wide variety of mathematical objects, including integer
Integer

The integers are natural numbers including 0 and their negative and non-negative numberss . They are numbers that can be written without a fractional or decimal component, and fall within the set ....
s and real number
Real number

In mathematics, the real numbers may be described informally in several different ways. The real numbers include both rational numbers, such as 42 and −23/129, and irrational numbers, such as pi and the square root of two; or, a real number can be given by an infinite decimal representation, such as 2.4871773339...., where the digits co...
s.

This use of the term "information" might be a bit misleading, as it depends upon the concept of compressibility. Informally, from the point of view of algorithmic information theory, the information content of a string is equivalent to the length of the shortest possible self-contained representation of that string. A self-contained representation is essentially a program – in some fixed but otherwise irrelevant universal programming language
Programming language

A programming language is a machine-readable artificial language designed to express computations that can be performed by a machine, particularly a computer....
 – that, when run, outputs the original string.

From this point of view, a 3000 page encyclopedia actually contains less information than 3000 pages of completely random letters, despite the fact that the encyclopedia is much more useful. This is because to reconstruct the entire sequence of random letters, one must know, more or less, what every single letter is. On the other hand, if every vowel were removed from the encyclopedia, someone with reasonable knowledge of the English language could reconstruct it, just as one could likely reconstruct the sentence "Ths sntnc hs lw nfrmtn cntnt" from the context and consonants present. For this reason, high-information strings and sequences are sometimes called "random"; people also sometimes attempt to distinguish between "information" and "useful information" and attempt to provide rigorous definitions for the latter, with the idea that the random letters may have more information than the encyclopedia, but the encyclopedia has more "useful" information.

Unlike classical information theory, algorithmic information theory gives formal, rigorous definitions of a random string
Kolmogorov complexity

In algorithmic information theory , the Andrey Kolmogorov complexity of an object such as a piece of text is a measure of the computational resources needed to specify the object....
 and a random infinite sequence
Algorithmically random sequence

Intuitively, an algorithmically random sequence is an infinite Sequence#Infinite sequences in theoretical computer science ofbinary digits that appears random to any algorithm....
 that do not depend on physical or philosophical intuition
Intuition

Intuition has many related meanings, usually connected to the meaning "ability to sense or know immediately without reasoning", and is often regarded as a divine or prophetic power, including:...
s about nondeterminism
Nondeterminism

Nondeterminism may refer to:* Nondeterministic algorithm * Indeterminacy in computation* Indeterminism ...
 or likelihood. (The set of random strings depends on the choice of the universal Turing machine used to define Kolmogorov complexity, but any choice gives identical asymptotic results because the Kolmogorov complexity of a string is invariant up to an additive constant depending only on the choice of universal Turing machine. For this reason the set of random infinite sequences is independent of the choice of universal machine.)

Some of the results of algorithmic information theory, such as Chaitin's incompleteness theorem
Kolmogorov complexity

In algorithmic information theory , the Andrey Kolmogorov complexity of an object such as a piece of text is a measure of the computational resources needed to specify the object....
, appear to challenge common mathematical and philosophical intuitions. Most notable among these is the construction of Chaitin's constant
Chaitin's constant

In the computer science subfield of algorithmic information theory a Chaitin constant or halting probability is a real number that informally represents the probability that a randomly-chosen program will halt....
 O, a real number which expresses the probability that a self-delimiting universal Turing machine will halt
Halting problem

In computability theory , the halting problem is a decision problem which can be stated as follows: given a description of a computer program and a finite input, decide whether the program finishes running or will run forever, given that input....
 when its input is supplied by flips of a fair coin (sometimes thought of as the probability that a random computer program will eventually halt). Although O is easily defined, in any consistent axiom
Axiom

In traditional logic, an axiom or postulate is a proposition that is not proved or demonstrated but considered to be either self-evidence, or subject to necessary decision....
atizable theory
Theory (mathematical logic)

In mathematical logic, a theory is a set of sentence s in a formal language. For example, a first-order theory is a set of first-order logic sentences....
 one can only compute finitely many digits of O, so it is in some sense unknowable, providing an absolute limit on knowledge that is reminiscent of Gödel's Incompleteness Theorem. Although the digits of O cannot be determined, many properties of O are known; for example, it is an algorithmically random sequence
Algorithmically random sequence

Intuitively, an algorithmically random sequence is an infinite Sequence#Infinite sequences in theoretical computer science ofbinary digits that appears random to any algorithm....
 and thus its binary digits are evenly distributed (in fact it is normal
Normal number

In mathematics, a normal number is a real number whose digits in every radix show a uniform distribution , with all digits being equally likely, all pairs of digits equally likely, all triplets of digits equally likely, etc....
).

History

The field was developed by Andrey Kolmogorov
Andrey Kolmogorov

Andrey Nikolaevich Kolmogorov was a Soviet Union Russian mathematician, preeminent in the 20th century who advanced various scientific fields ....
, Ray Solomonoff
Ray Solomonoff

Ray Solomonoff invented Algorithmic Probability in 1960. He first described his results at a Conference at Caltech,1960, andin a report, Feb....
 and Gregory Chaitin
Gregory Chaitin

Gregory John Chaitin is an Argentina-United States mathematician and computer scientist.Beginning in the late 1960s, Chaitin made contributions to algorithmic information theory and metamathematics, in particular a new incompleteness theorem in reaction to G?del's incompleteness theorem....
, starting in the late 1960s. There are several variants of Kolmogorov complexity or algorithmic information; the most widely used one is based on self-delimiting programs and is mainly due to Leonid Levin
Leonid Levin

Leonid Anatolievich Levin is a Russian-American computer scientist best known for his work in defining NP-completeness. He studied under Andrey Kolmogorov....
 (1974). Per Martin-Löf
Per Martin-Löf

Per Erik Rutger Martin-L?f is a Sweden logician, philosopher, and mathematician. He is best known for developing intuitionistic type theory as a constructive foundation of mathematics....
 also contributed significantly to the information theory of infinite sequences. An axiomatic approach to algorithmic information theory based on Blum axioms
Blum axioms

In computational complexity theory the Blum axioms or Blum complexity axioms are axioms which specify desirable properties of complexity measures on the set of computable functions....
 (Blum 1967) was introduced by Mark Burgin in a paper presented for publication by Andrey Kolmogorov
Andrey Kolmogorov

Andrey Nikolaevich Kolmogorov was a Soviet Union Russian mathematician, preeminent in the 20th century who advanced various scientific fields ....
 (Burgin 1982). The axiomatic approach encompasses other approaches in the algorithmic information theory. It is possible to treat different measures of algorithmic information as particular cases of axiomatically defined measures of algorithmic information. Instead, of proving similar theorems, such as the basic invariance theorem, for each particular measure, it is possible to easily deduce all such results from one corresponding theorem proved in the axiomatic setting. This is a general advantage of the axiomatic approach in mathematics. The axiomatic approach to algorithmic information theory was further developed in the book (Burgin 2005) and applied to software metrics (Burgin and Debnath, 2003; Debnath and Burgin, 2003).

Precise Definitions


A binary string is said to be random if the Kolmogorov complexity
Kolmogorov complexity

In algorithmic information theory , the Andrey Kolmogorov complexity of an object such as a piece of text is a measure of the computational resources needed to specify the object....
 of the string is at least the length of the string. A simple counting argument shows that some strings of any given length are random, and almost all strings are very close to being random. Since Kolmogorov complexity depends on a fixed choice of universal Turing machine (informally, a fixed "description language" in which the "descriptions" are given), the collection of random strings does depend on the choice of fixed universal machine. Nevertheless, the collection of random strings, as a whole, has similar properties regardless of the fixed machine, so one can (and often does) talk about the properties of random strings as a group without having to first specify a universal machine.

An infinite binary sequence is said to be random if, for some constant c, the Kolmogorov complexity
Kolmogorov complexity

In algorithmic information theory , the Andrey Kolmogorov complexity of an object such as a piece of text is a measure of the computational resources needed to specify the object....
 of every initial segment (with length n) of the sequence is at least n-c. Importantly, the complexity used here is prefix-free complexity; if plain complexity were used, there would be no random sequences. However, with this definition, it can be shown that almost every sequence (from the point of view of the standard measure
Measure (mathematics)

In mathematics, more specifically in measure theory, a measure on a set is a systematic way to assign to each suitable subset a number, intuitively interpreted as the size of the subset....
 - "fair coin" or Lebesgue measure
Lebesgue measure

In mathematics, the Lebesgue measure, named after Henri Lebesgue, is the standard way of assigning a length, area or volume to subsets of Euclidean space....
 - on the space of infinite binary sequences) is random. Also, since it can be shown that the Kolmogorov complexity relative to two different universal machines differs by at most a constant, the collection of random infinite sequences does not depend on the choice of universal machine (in contrast to finite strings). This definition of randomness is usually called Martin-Löf randomness, after Per Martin-Löf
Per Martin-Löf

Per Erik Rutger Martin-L?f is a Sweden logician, philosopher, and mathematician. He is best known for developing intuitionistic type theory as a constructive foundation of mathematics....
, to distinguish it from other similar notions of randomness. It is also sometimes called 1-randomness to distinguish it from other stronger notions of randomness (2-randomness, 3-randomness, etc.).

(Related definitions can be made for alphabets other than the set .)

See also

  • Kolmogorov complexity
    Kolmogorov complexity

    In algorithmic information theory , the Andrey Kolmogorov complexity of an object such as a piece of text is a measure of the computational resources needed to specify the object....
  • Algorithmically random sequence
    Algorithmically random sequence

    Intuitively, an algorithmically random sequence is an infinite Sequence#Infinite sequences in theoretical computer science ofbinary digits that appears random to any algorithm....
  • Algorithmic probability
    Algorithmic probability

    Algorithmic probability is a concept in theoretical computer science; it quantifies the idea of theories and predictions with reference to short programs and their output....
  • Chaitin's constant
    Chaitin's constant

    In the computer science subfield of algorithmic information theory a Chaitin constant or halting probability is a real number that informally represents the probability that a randomly-chosen program will halt....
  • Chaitin–Kolmogorov randomness
  • Computationally indistinguishable
    Computationally indistinguishable

    In computational complexity, if and are two distribution ensembles indexed by a security parameter n, then we say they are computationally indistinguishable if for any nonuniform probabilistic polynomial time algorithm A, the following quantity is a negligible function in n:...
  • Distribution ensemble
    Distribution ensemble

    The term distribution- or probability ensemble is used in cryptography to denote a family of distributions or random variables where is a index set, and each is a random variable, or probability distribution....
  • Epistemology
    Epistemology

    Epistemology or theory of knowledge is the branch of philosophy concerned with the nature and scope of knowledge. It addresses the questions:...
  • Invariance theorem
    Invariance theorem

    In algorithmic information theory, the invariance theorem, originally proved by Ray Solomonoff, states that a universal Turing machine provides an optimal means of description, up to an additive constant....
  • Limits to knowledge
  • Minimum description length
    Minimum description length

    The minimum description length principle is a formalization of Occam's Razor in which the best hypothesis for a given set of data is the one that leads to the largest data compression....
  • Minimum message length
    Minimum message length

    Minimum message length is a formal information theory restatement of Occam's Razor: even when models are not equal in goodness of fit accuracy to the observed data, the one generating the shortest overall message is more likely to be correct ....
  • Pseudorandom ensemble
    Pseudorandom ensemble

    Let be a uniform ensembleand be an distribution ensemble. The ensemble is called pseudorandom if and are Computationally indistinguishable....
  • Pseudorandom generator
    Pseudorandom generator

    In theoretical computer science, a pseudorandom generator is a deterministic algorithm that produces a pseudorandom distribution from a short uniform distribution input, known as a random seed....
  • Uniform ensemble
    Uniform ensemble

    Uniform ensemble is a distribution ensemble, Uniform distribution over strings of length ....


External links

  • .


Further reading


  • Blum, M. (1967) On the Size of Machines, Information and Control, v. 11, pp. 257-265
  • Blum M. (1967a) A Machine-independent Theory of Complexity of Recursive Functions, Journal of the ACM, v. 14, No.2, pp. 322-336
  • Burgin, M. (1982) Generalized Kolmogorov complexity and duality in theory of computations, Soviet Math. Dokl., v.25, No. 3, pp.19-23
  • Burgin, M. (1990) Generalized Kolmogorov Complexity and other Dual Complexity Measures, Cybernetics, No. 4, pp. 21-29
  • Burgin, M. Super-recursive algorithms, Monographs in computer science, Springer, 2005
  • Calude, C.S. (1996) Algorithmic information theory: Open problems, J. UCS, v. 2, pp. 439-441
  • Calude, C.S. Information and Randomness: An Algorithmic Perspective, (Texts in Theoretical Computer Science. An EATCS Series), Springer-Verlag, Berlin, 2002
  • Chaitin, G.J. (1966) On the Length of Programs for Computing Finite Binary Sequences, J. Association for Computing Machinery, v. 13, No. 4, pp. 547-569
  • Chaitin, G.J. (1969) On the Simplicity and Speed of Programs for Computing Definite Sets of Natural Numbers, J. Association for Computing Machinery, v. 16, pp. 407-412
  • Chaitin, G.J. (1975) A Theory of Program Size Formally Identical to Information Theory, J. Association for Computing Machinery, v. 22, No. 3, pp. 329-340
  • Chaitin, G.J. (1977) Algorithmic information theory, IBM Journal of Research and Development, v.21, No. 4, 350-359
  • Chaitin, G.J. Algorithmic Information Theory, Cambridge University Press, Cambridge, 1987
  • Kolmogorov, A.N. (1965) Three approaches to the definition of the quantity of information, Problems of Information Transmission, No. 1, pp.3-11
  • Kolmogorov, A.N. (1968) Logical basis for information theory and probability theory, IEEE Trans. Inform. Theory, vol. IT-14, pp. 662-664
  • Levin, L. A. (1974) Laws of information (nongrowth) and aspects of the foundation of prob¬ability theory, Problems of Information Transmission, v. 10, No. 3, pp. 206-210
  • Levin, L.A. (1976) Various Measures of Complexity for Finite Objects (Axiomatic Description), Soviet Math. Dokl., v. 17, pp. 522-526
  • Li, M., and Vitanyi, P. An Introduction to Kolmogorov Complexity and its Applications, Springer-Verlag, New York, 1997
  • Solomonoff, R.J. (1960) A Preliminary Report on a General Theory of Inductive Inference, Technical Report ZTB-138, Zator Company, Cambridge, Mass.
  • Solomonoff, R.J. (1964) A Formal Theory of Inductive Inference, Information and Control, v. 7, No. 1, pp. 1-22; No.2, pp. 224-254
  • Van Lambagen, (1989) Algorithmic Information Theory, Journal for Symbolic Logic, v. 54, pp. 1389-1400
  • Zurek, W.H. (1991) Algorithmic Information Content, Church-Turing Thesis, physical entropy, and Maxwell’s demon, in Complexity, Entropy and the Physics of Information, (Zurek, W.H., Ed.) Addison-Wesley, pp. 73-89
  • Zvonkin, A.K. and Levin, L. A. (1970) The Complexity of Finite Objects and the Development of the Concepts of Information and Randomness by Means of the Theory of Algorithms, Russian Mathematics Surveys, v. 256, pp. 83-124