All Topics  
Maximum entropy thermodynamics

 

   Email Print
   Bookmark   Link






 

Maximum entropy thermodynamics



 
 
In physics
Physics

Physics is the natural science which examines basic concepts such as energy, force, and spacetime and all that derives from these, such as mass, charge, matter and its Motion ....
, maximum entropy thermodynamics (colloquially, MaxEnt thermodynamics
Thermodynamics

In physics, thermodynamics is the study of the conversion of heat energy into different forms of energy ; different energy conversions into heat energy; and its relation to macroscopic variables such as temperature, pressure, and volume....
) views equilibrium thermodynamics
Equilibrium thermodynamics

Equilibrium Thermodynamics is the systematic study of transformations of matter and energy in systems as they approach equilibrium. The word equilibrium implies a state of balance....
 and statistical mechanics
Statistical mechanics

Statistical mechanics is the application of probability theory, which includes Mathematics tools for dealing with large populations, to the field of mechanics, which is concerned with the motion of particles or objects when subjected to a force....
 as inference
Inference

Inference is the act or process of deriving a logical consequence from premises.Inference is studied within several different fields.* Human inference is traditionally studied within the field of cognitive psychology....
 processes. More specifically, MaxEnt applies inference techniques rooted in Shannon information theory, subjective probability, and the principle of maximum entropy
Principle of maximum entropy

The principle of maximum entropy is a postulate about a universal feature of any probability assignment on a given set of propositions . Let some testable information about a probability distribution function be given....
. These techniques are relevant to any situation requiring prediction from incomplete or insufficient data (e.g., image reconstruction
Image processing

In electrical engineering and computer science, image processing is any form of signal processing for which the input is an , such as photographs or video frame; the output of image processing can be either an image or a set of characteristics or parameters related to the image....
, signal processing
Signal processing

Signal processing is the analysis, interpretation, and manipulation of signal . Signals of interest include: audio signal processing, , time-varying measurement values and sensor data, for example biological data such as electrocardiograms, control system signals, telecommunication transmission signals such as radio signals, and many others....
, spectral analysis
Spectral analysis

Spectral analysis or Spectrum analysis may refer to:* Spectrum analysis in chemistry and physics, a method of analyzing the chemical properties of matter from bands in their optical spectrum...
, and inverse problem
Inverse problem

An inverse problem is the task that often occurs in many branches of science and mathematics where the values of some model parameter must be obtained from the observed datum....
s). MaxEnt thermodynamics began with two papers Edwin T.






Discussion
Ask a question about 'Maximum entropy thermodynamics'
Start a new discussion about 'Maximum entropy thermodynamics'
Answer questions from other users
Full Discussion Forum



Encyclopedia


In physics
Physics

Physics is the natural science which examines basic concepts such as energy, force, and spacetime and all that derives from these, such as mass, charge, matter and its Motion ....
, maximum entropy thermodynamics (colloquially, MaxEnt thermodynamics
Thermodynamics

In physics, thermodynamics is the study of the conversion of heat energy into different forms of energy ; different energy conversions into heat energy; and its relation to macroscopic variables such as temperature, pressure, and volume....
) views equilibrium thermodynamics
Equilibrium thermodynamics

Equilibrium Thermodynamics is the systematic study of transformations of matter and energy in systems as they approach equilibrium. The word equilibrium implies a state of balance....
 and statistical mechanics
Statistical mechanics

Statistical mechanics is the application of probability theory, which includes Mathematics tools for dealing with large populations, to the field of mechanics, which is concerned with the motion of particles or objects when subjected to a force....
 as inference
Inference

Inference is the act or process of deriving a logical consequence from premises.Inference is studied within several different fields.* Human inference is traditionally studied within the field of cognitive psychology....
 processes. More specifically, MaxEnt applies inference techniques rooted in Shannon information theory, subjective probability, and the principle of maximum entropy
Principle of maximum entropy

The principle of maximum entropy is a postulate about a universal feature of any probability assignment on a given set of propositions . Let some testable information about a probability distribution function be given....
. These techniques are relevant to any situation requiring prediction from incomplete or insufficient data (e.g., image reconstruction
Image processing

In electrical engineering and computer science, image processing is any form of signal processing for which the input is an , such as photographs or video frame; the output of image processing can be either an image or a set of characteristics or parameters related to the image....
, signal processing
Signal processing

Signal processing is the analysis, interpretation, and manipulation of signal . Signals of interest include: audio signal processing, , time-varying measurement values and sensor data, for example biological data such as electrocardiograms, control system signals, telecommunication transmission signals such as radio signals, and many others....
, spectral analysis
Spectral analysis

Spectral analysis or Spectrum analysis may refer to:* Spectrum analysis in chemistry and physics, a method of analyzing the chemical properties of matter from bands in their optical spectrum...
, and inverse problem
Inverse problem

An inverse problem is the task that often occurs in many branches of science and mathematics where the values of some model parameter must be obtained from the observed datum....
s). MaxEnt thermodynamics began with two papers Edwin T. Jaynes published in the 1957 Physical Review.

Maximum Shannon entropy

Central to the MaxEnt thesis is the principle of maximum entropy
Principle of maximum entropy

The principle of maximum entropy is a postulate about a universal feature of any probability assignment on a given set of propositions . Let some testable information about a probability distribution function be given....
, which states that given certain "testable information" about a probability distribution
Probability distribution

In probability theory and statistics, a probability distribution identifies either the probability of each value of an unidentified random variable , or the probability of the value falling within a particular interval ....
, for example particular expectation
Expectation

In the case of uncertainty, expectation is what is considered the most likely to happen. An expectation, which is a belief that is centred on the future, may or may not be realistic....
 values, but which is not in itself sufficient to uniquely determine the distribution, one should prefer the distribution which maximizes the Shannon information entropy.

This is known as the Gibbs algorithm
Gibbs algorithm

In statistical mechanics, the Gibbs algorithm, first introduced by J. Willard Gibbs in 1878, is the injunction to choose a statistical ensemble for the unknown microstate of a thermodynamic system by minimising the average log probability...
, having been introduced by J. Willard Gibbs in 1878, to set up statistical ensembles to predict the properties of thermodynamic systems at equilibrium. It is the cornerstone of the statistical mechanical analysis of the thermodynamic properties of equilibrium systems (see partition function
Partition function

Partition function may refer to:*Partition function *Partition function , which generalizes its use in statistical mechanics and quantum field theory:...
).

A direct connection is thus made between the equilibrium thermodynamic entropy STh, a state function
State function

In thermodynamics, a state function, state quantity, or a function of state, is a physical quantity of a system that depends only on the current Thermodynamic state, not on the way in which the system got to that state....
 of pressure, volume, temperature, etc., and the information entropy
Information entropy

In information theory, entropy is a measure of the uncertainty associated with a random variable. The term by itself in this context usually refers to the Shannon entropy, which quantifies, in the sense of an expected value, the self-information contained in a message, usually in units such as bits....
 for the predicted distribution with maximum uncertainty conditioned only on the expectation values of those variables:

kB, Boltzmann's constant, has no fundamental physical significance here, but is necessary to retain consistency with the previous historical definition of entropy by Clausius (1865) (see Boltzmann's constant).

However, the MaxEnt school argue that the MaxEnt approach is a general technique of statistical inference, with applications far beyond this. It can therefore also be used to predict a distribution for "trajectories" G "over a period of time" by maximising:

This "information entropy" does not necessarily have a simple correspondence with thermodynamic entropy. But it can be used to predict features of nonequilibrium thermodynamic
Non-equilibrium thermodynamics

Non-equilibrium thermodynamics is a branch of thermodynamics concerned with studying time-dependent thermodynamic systems, irreversible transformations and Open system ....
 systems as they evolve over time.

In the field of near-equilibrium thermodynamics, the Onsager reciprocal relations
Onsager reciprocal relations

In thermodynamics, the Onsager reciprocal relations express the equality of certain relations between fluxs and forces in thermodynamic systems out of equilibrium , but where a notion of local thermodynamic equilibrium exists....
 and the Green-Kubo relations
Green-Kubo relations

Green?Kubo relations give exact mathematical expression for transport coefficients in terms of integrals of time correlation functions....
 fall out very directly. The approach also creates a solid theoretical framework for the study of far-from-equilibrium thermodynamics, making the derivation of the entropy production fluctuation theorem
Fluctuation theorem

The fluctuation theorem is a theorem from statistical mechanics dealing with the relative probability that the entropy of a system which is currently away from thermodynamic equilibrium will increase or decrease over a given amount of time....
 particularly straightforward. Practical calculations for most far-from-equilibrium systems remain very challenging, however.

Technical note: For the reasons discussed in the article differential entropy
Differential entropy

Differential entropy is a concept in information theory which tries to extend the idea of information entropy, a measure of average surprisal of a random variable, to continuous probability distributions....
, the simple definition of Shannon entropy ceases to be directly applicable for random variable
Random variable

In mathematics, random variables are used in the study of Randomness and probability. They were developed to assist in the analysis of Game of chance, stochastic events, and the results of experiment by capturing only the mathematical properties necessary to answer probability questions....
s with continuous probability distribution function
Probability distribution function

Depending upon which text you consult, a probability distribution function is either:* a cumulative distribution function, or* a probability mass function, or...
s. Instead the appropriate quantity to maximise is the "relative information entropy,"

Hc is the negative of the Kullback-Leibler divergence, or discrimination information, of m(x) from p(x), where m(x) is a prior invariant measure
Invariant measure

In mathematics, an invariant measure is a measure that is preserved by some function . Invariant measures are of great interest in the study of dynamical systems....
 for the variable(s). The relative entropy Hc is always less than zero, and can be thought of as (the negative of) the number of bit
Bit

A bit is a binary numeral system numerical digit, taking a value of either 0 or 1. Binary digits are a basic unit of information Computer data storage and transmission in digital computing and digital information theory....
s of uncertainty lost by fixing on p(x) rather than m(x). Unlike the Shannon entropy, the relative entropy Hc has the advantage of remaining finite and well-defined for continuous x, and invariant under 1-to-1 coordinate transformations. The two expressions coincide for discrete probability distribution
Discrete probability distribution

Discrete probability distributions arise in the mathematical description of probability theory and statistical analysis in which the values that might be observed are restricted to being within a pre-defined list of possible values....
s, if one can make the assumption that m(xi) is uniform - i.e. the principle of equal a-priori probability, which underlies statistical thermodynamics.

Philosophical Implications


Adherents to the MaxEnt viewpoint tend to take a very definite position on some of the conceptual/philosophical questions
Philosophy of thermal and statistical physics

The philosophy of thermal and statistical physics is that part of the philosophy of physics whose subject matter is classical thermodynamics, statistical mechanics, and related theories....
 in thermodynamics. Some of these positions are sketched below.

The nature of the probabilities in statistical mechanics


According to the MaxEnt viewpoint, the probabilities in statistical mechanics are subjective
Subjectivity

Subjectivity refers to a subject's perspective or opinion, particularly feelings, beliefs, and desires. It is often used casually to refer to unjustified personal opinions, in contrast to knowledge and justified belief....
 (epistemic, personal), to the extent that they are conditioned on a particular model for the underlying state space (e.g. Liouvillian phase space
Phase space

In mathematics and physics, a phase space, introduced by Willard Gibbs in 1901, is a space in which all possible states of a system are represented, with each possible state of the system corresponding to one unique point in the phase space....
). They are also conditioned on a particular partial description of the system (the macroscopic description of the system used to constrain the MaxEnt probability assignment). The probabilities are objective
Objectivity (science)

"[A]n objective account is one which attempts to capture the nature of the object studied in a way that does not depend on any features of the particular subject who studies it....
 to the extent that given these inputs, a uniquely defined probability distribution will result.

At a trivial level, the probabilities cannot be entirely objective, because in reality there is only one system, and (assuming determinism
Determinism

Determinism is the philosophy proposition that every event, including human cognition and behavior, decision and action, is causality determined by an unbroken chain of prior occurrences. With numerous historical debates, many varieties and philosophical positions on the subject of determinism exist from traditions throughout...
) a single unknown trajectory it will evolve through. The probabilities therefore represent a lack of information in the analyst's macroscopic description of the system, not a property of the underlying reality itself.

Moreover, the quality of the predicted probabilities depends on whether the constraints of the macroscopic model are a sufficiently accurate and/or complete description of the system to capture all of the experimentally reproducible behaviour. This cannot be guaranteed, a priori. For this reason MaxEnt proponents also call the method predictive statistical mechanics. The predictions can fail. But if they do, this is informative, because it signals the presence of new constraints needed to capture reproducible behaviour in the system, which had not been taken into account.

Is entropy "real" ?


The thermodynamic entropy (at equilibrium) is a function of the state variables of the model description. It is therefore as "real" as the other variables in the model description. If the model constraints in the probability assignment are a "good" description, containing all the information needed to predict reproducible experimental results, then that includes all of the results one could predict using the formulae involving entropy from classical thermodynamics. To that extent, the MaxEnt STh is as "real" as the entropy in classical thermodynamics.

Of course, in reality there is only one real state of the system. The entropy is not a direct function of that state. It is a function of the real state only through the (subjectively chosen) macroscopic model description.

Is ergodic theory relevant ?


The Gibbsian ensemble idealises the notion of repeating an experiment again and again on different systems, not again and again on the same system. So long-term time averages and the ergodic hypothesis
Ergodic hypothesis

The quick definition of ergodic is that given sufficient time, a system will return to states that it has previously experienced. The text below explains this basic premise in detail....
, despite the intense interest in them in the first part of the twentieth century, strictly speaking are not relevant to the probability assignment for the state one might find the system in.

However, this changes if there is additional knowledge that the system is being prepared in a particular way some time before the measurement. One must then consider whether this gives further information which is still relevant at the time of measurement. The question of how 'rapidly mixing' different properties of the system are then becomes very much of interest. Information about some degrees of freedom of the combined system may become unusable very quickly; information about other properties of the system may go on being relevant for a considerable time.

If nothing else, the medium and long-run time correlation properties of the system are interesting subjects for experimentation in themselves. Failure to accurately predict them is a good indicator that relevant macroscopically determinable physics may be missing from the model.

The Second Law


According to Liouville's theorem
Liouville's theorem (Hamiltonian)

In physics, Liouville's theorem, named after the French mathematician Joseph Liouville, is a key theorem in classical statistical mechanics and Hamiltonian mechanics....
 for Hamiltonian dynamics, the hyper-volume of a cloud of points in phase space
Phase space

In mathematics and physics, a phase space, introduced by Willard Gibbs in 1901, is a space in which all possible states of a system are represented, with each possible state of the system corresponding to one unique point in the phase space....
 remains constant as the system evolves. Therefore, the information entropy must also remain constant, if we condition on the original information, and then follow each of those microstates forward in time:

However, as time evolves, that initial information we had becomes less directly accessible. Instead of being easily summarisable in the macroscopic description of the system, it increasingly relates to very subtle correlations between the positions and momenta of individual molecules. (Compare to Boltzmann's H-theorem
H-theorem

In thermodynamics, the H-theorem, introduced by Ludwig Boltzmann in 1872, describes the increase in the entropy of an ideal gas in an irreversible process, by considering the Boltzmann equation....
.) Equivalently, it means that the probability distribution for the whole system, in 6N-dimensional phase space, becomes increasingly irregular, spreading out into long thin fingers rather than the initial tightly defined volume of possibilities.

Classical thermodynamics is built on the assumption that entropy is a state function
State function

In thermodynamics, a state function, state quantity, or a function of state, is a physical quantity of a system that depends only on the current Thermodynamic state, not on the way in which the system got to that state....
 of the macroscopic variables -- i.e., that none of the history of the system matters, so that it can all be ignored.

The extended, wispy, evolved probability distribution, which still has the initial Shannon entropy STh(1), should reproduce the expectation values of the observed macroscopic variables at time t2. However it will no longer necessarily be a maximum entropy distribution for that new macroscopic description. On the other hand, the new thermodynamic entropy STh(2) assuredly will measure the maximum entropy distribution, by construction. Therefore, we expect:

At an abstract level, this result simply means that some of the information we originally had about the system has become "no longer useful" at a macroscopic level. At the level of the 6N-dimensional probability distribution, this result represents coarse graining -- i.e., information loss by smoothing out very fine-scale detail.

Caveats with the argument


Some caveats should be considered with the above.

1. Like all statistical mechanical results according to the MaxEnt school, this increase in thermodynamic entropy is only a prediction. It assumes in particular that the initial macroscopic description contains all of the information relevant to predicting the later macroscopic state. This may not be the case, for example if the initial description fails to reflect some aspect of the preparation of the system which later becomes relevant. In that case the "failure" of a MaxEnt prediction tells us that there is something more which is relevant that we may have overlooked in the physics of the system.

It is also sometimes suggested that quantum measurement, especially in the decoherence interpretation, may give an apparently unexpected reduction in entropy per this argument, as it appears to involve macroscopic information becoming available which was previously inaccessible. (However, the entropy accounting of quantum measurement is tricky, because to get full decoherence one may be assuming an infinite environment, with an infinite entropy).

2. The argument so far has glossed over the question of fluctuations. It has also implicitly assumed that the uncertainty predicted at time t1 for the variables at time t2 will be much smaller than the measurement error. But if the measurements do meaningfully update our knowledge of the system, our uncertainty as to its state is reduced, giving a new SI(2) which is less than SI(1). (Note that if we allow ourselves the abilities of Laplace's demon
Laplace's demon

In the history of science, Laplace's demon is a hypothetical "demon" envisioned in 1814 by Pierre-Simon Laplace such that if it knew the precise location and momentum of every atom in the universe then it could use Newton's laws to reveal the entire course of cosmic events, past and future....
, the consequences of this new information can also be mapped backwards, so our uncertainty about the dynamical state at time t1 is now also reduced from SI(1) to SI(2) ).

We know that STh(2) > SI(2); but we can now no longer be certain that it is greater than STh(1) = SI(1). This then leaves open the possibility for fluctuations in STh. The thermodynamic entropy may go "down" as well as up. A more sophisticated analysis is given by the entropy Fluctuation Theorem
Fluctuation theorem

The fluctuation theorem is a theorem from statistical mechanics dealing with the relative probability that the entropy of a system which is currently away from thermodynamic equilibrium will increase or decrease over a given amount of time....
, which can be established as a consequence of the time-dependent MaxEnt picture.

3. As just indicated, the MaxEnt inference runs equally well in reverse. So given a particular final state, we can ask, what can we "retrodict" to improve our knowledge about earlier states? However the Second Law argument above also runs in reverse: given macroscopic information at time t2, we should expect it too to become less useful. The two procedures are time-symmetric. But now the information will become less and less useful at earlier and earlier times. (Compare with Loschmidt's paradox
Loschmidt's paradox

Loschmidt's paradox, also known as the reversibility paradox, is the objection that it should not be possible to deduce an irreversible process from time-symmetric dynamics....
.) The MaxEnt inference would predict that the most probable origin of a currently low-entropy state would be as a spontaneous fluctuation from an earlier high entropy state. But this conflicts with what we know to have happened, namely that entropy has been increasing steadily, even back in the past.

The MaxEnt proponents' response to this would be that such a systematic failing in the prediction of a MaxEnt inference is a "good" thing. It means that there is thus clear evidence that some important physical information has been missed in the specification the problem. If it is correct that the dynamics "are" time-symmetric
T-symmetry

T Symmetry is the symmetry in physics under a time reversal Transformation —Although in restricted contexts one may find this symmetry, the universe itself does not show symmetry under time reversal due to the second law of thermodynamics....
, it appears that we need to put in by hand a prior probability
Prior probability

A prior probability is a conditional probability, interpreted as a description of what is known about a variable in the absence of some Marginal likelihood....
 that initial configurations with a low thermodynamic entropy are more likely than initial configurations with a high thermodynamic entropy. This cannot be explained by the immediate dynamics. Quite possibly, it arises as a reflection of the evident time-asymmetric evolution of the universe on a cosmological scale (see arrow of time
Arrow of time

In the natural sciences, arrow of time, or time?s arrow, is a term coined in 1927 by British astronomer Arthur Eddington used to distinguish a direction of time on a four-dimensional relativistic map of the world, which, according to Eddington, can be determined by a study of organizations of atoms, molecules, and bodies....
).