All Topics  
Trace (linear algebra)

 

   Email Print
   Bookmark   Link






 

Trace (linear algebra)



 
 
In linear algebra
Linear algebra

Linear algebra is the branch of mathematics concerned with the study of Euclidean vectors, vector spaces , linear maps , and system of linear equations....
, the trace of an n-by-n square matrix A is defined to be the sum of the elements on the main diagonal
Main diagonal

In linear algebra, the main diagonal of a matrix is the collection of cells where is equal to .The main diagonal of a square matrix is the diagonal which runs from the top left corner to the bottom right corner....
 (the diagonal from the upper left to the lower right) of A, i.e.,

where represents the entry on the ith row and jth column of A. Equivalently, the trace of a matrix is the sum of its eigenvalues, making it an invariant
Invariants of tensors

In mathematics, in the fields of multilinear algebra and representation theory, invariants of tensors are coefficients of the characteristic polynomial of the tensor A:...
 with respect to a change of basis
Change of basis

In linear algebra, a basis for a vector space of dimension n is a sequence of n vectors α1, ..., αn with the property that every vector in the space can be expressed uniquely as a linear combination of the basis vectors....
. This characterization can be used to define the trace for a linear operator in general.

Note that the trace is only defined for a square matrix (i.e.






Discussion
Ask a question about 'Trace (linear algebra)'
Start a new discussion about 'Trace (linear algebra)'
Answer questions from other users
Full Discussion Forum



Recent Posts









Encyclopedia


In linear algebra
Linear algebra

Linear algebra is the branch of mathematics concerned with the study of Euclidean vectors, vector spaces , linear maps , and system of linear equations....
, the trace of an n-by-n square matrix A is defined to be the sum of the elements on the main diagonal
Main diagonal

In linear algebra, the main diagonal of a matrix is the collection of cells where is equal to .The main diagonal of a square matrix is the diagonal which runs from the top left corner to the bottom right corner....
 (the diagonal from the upper left to the lower right) of A, i.e.,

where represents the entry on the ith row and jth column of A. Equivalently, the trace of a matrix is the sum of its eigenvalues, making it an invariant
Invariants of tensors

In mathematics, in the fields of multilinear algebra and representation theory, invariants of tensors are coefficients of the characteristic polynomial of the tensor A:...
 with respect to a change of basis
Change of basis

In linear algebra, a basis for a vector space of dimension n is a sequence of n vectors α1, ..., αn with the property that every vector in the space can be expressed uniquely as a linear combination of the basis vectors....
. This characterization can be used to define the trace for a linear operator in general.

Note that the trace is only defined for a square matrix (i.e. n×n).

The use of the term trace arises from the German term Spur (cognate
Cognate

Cognates in linguistics are words that have a common etymology origin.An example of cognates within the same language would be English shirt vs....
 with the English spoor), which, as a function in mathematics, is often abbreviated to "Sp".

Examples

Let be a linear operator represented by the matrix

Then tr(T) = −2 + 1 − 1 = −2.

The trace of the identity matrix
Identity matrix

In linear algebra, the identity matrix or unit matrix of size n is the n-by-n square matrix with ones on the main diagonal and zeros elsewhere....
 is the dimension of the space. The trace of a nilpotent matrix
Nilpotent matrix

In mathematics, a nilpotent matrix is an n×n square matrix M such thatfor some positive integer q. Similarly, a nilpotent transformation is a linear transformation L with for some integer q....
 is zero.

If A and B are positive semi-definite matrices of the same order then



Properties

The trace is a linear map. That is,

for all square matrices A and B, and all scalar
Scalar (mathematics)

In linear algebra, real numbers are called scalars and relate to vectors in a vector space through the operation of scalar multiplication, in which a vector can be multiplied by a number to produce another vector....
s r.

If A is an m×n matrix and B is an n×m matrix, then

(Note however that the trace is not commutative - in general,

In other words, we can only interchange the two halves of the expression, albeit repeatedly. Unlike the determinant
Determinant

In algebra, a determinant is a function depending on n that associates a scalar , det, to an n?n square matrix A. The fundamental geometric meaning of a determinant is a scale factor for measure when A is regarded as a linear transformation....
, the trace of the product is not the product of traces.)

Conversely, the above properties characterize the trace completely in the sense as follows. Let be a linear functional on the space of square matrices satisfying . Then and tr are proportional, for if and only if and (with the standard basis ), and thus .

When both A and B are n by n, the trace of the commutator
Commutator

In mathematics, the commutator gives an indication of the extent to which a certain binary operation fails to be commutative. There are different definitions used in group theory and ring theory....
 of and vanishes: tr([A,B]) = 0. As a consequence, the commutator of any two matrices is never similar to the identity matrix.

The trace is similarity-invariant
Similarity invariance

In mathematics, similarity invariance is a property exhibited by a function whose value is unchanged under similarities of its domain. That is, is invariant under similarities if where is a similarity of A....
, which means that A and P−1AP have the same trace. This is because

A matrix and its transposition
Transpose

In linear algebra, the transpose of a matrix A is another matrix AT created by any one of the following equivalent actions:...
 have the same trace:

tr(A) = tr(AT).


Trace of a linear operator


Given some linear map f : VV (V is a finite-dimensional vector space
Vector space

File:Vector addition ans scaling.pngA vector space is a mathematical structure formed by a collection of vectors: objects that may be Vector addition together and Scalar multiplication by numbers, called scalar s in this context....
) generally, we can define the trace of this map by considering the trace of matrix representation
Matrix representation

A matrix representation is a method used by a computer language to store matrix of more than one dimension in computer storage.Fortran and C use different schemes....
 of f, that is, choosing a basis
Basis (linear algebra)

In linear algebra, a basis is a set of vectors that, in a linear combination, can represent every vector in a given vector space or free module, and such that no element of the set can be represented as a linear combination of the others....
 for V and describing f as a matrix relative to this basis, and taking the trace of this square matrix. The result will not depend on the basis chosen, since different bases will give rise to similar matrices, allowing for the possibility of a basis independent definition for the trace of a linear map.

Such a definition can be given using the canonical isomorphism between the space End(V) of linear maps on V and VV*, where V* is the dual space
Dual space

In mathematics, any vector space, V, has a corresponding dual vector space consisting of all linear functionals on V. Dual vector spaces defined on finite-dimensional vector spaces can be used for defining tensors which are studied in tensor algebra....
 of V. Let v be in V and let f be in V*. Then the trace of the decomposable element vf is defined to be f(v); the trace of a general element is defined by linearity. Using an explicit basis for V and the corresponding dual basis for V*, one can show that this gives the same definition of the trace as given above.

Eigenvalue relationships

If A is a square n-by-n matrix with real
Real number

In mathematics, the real numbers may be described informally in several different ways. The real numbers include both rational numbers, such as 42 and −23/129, and irrational numbers, such as pi and the square root of two; or, a real number can be given by an infinite decimal representation, such as 2.4871773339...., where the digits co...
 or complex
Complex number

In mathematics, the complex numbers are an extension of the real numbers obtained by adjoining an imaginary unit, denoted i, which satisfies:...
 entries and if λ1,...,λn are the (complex and distinct) eigenvalues of A (listed according to their algebraic multiplicities), then

.

This follows from the fact that A is always similar to its Jordan form, an upper triangular matrix
Triangular matrix

In the mathematics discipline of linear algebra, a triangular matrix is a special kind of square matrix where the entries either below or above the main diagonal are zero....
 having λ1,...,λn on the main diagonal. In contrast, the determinant
Determinant

In algebra, a determinant is a function depending on n that associates a scalar , det, to an n?n square matrix A. The fundamental geometric meaning of a determinant is a scale factor for measure when A is regarded as a linear transformation....
 of is the product of its eigenvalues; i.e.,

Derivatives

The trace is the derivative of the determinant: it is the Lie algebra
Lie algebra

In mathematics, a Lie algebra is an algebraic structure whose main use is in studying geometric objects such as Lie groups and differentiable manifolds....
 analog of the (Lie group
Lie group

In mathematics, a Lie group is a group which is also a differentiable manifold, with the property that the group operations are compatible with the Differential structure....
) map of the determinant. This is made precise in Jacobi's formula
Jacobi's formula

In matrix calculus, Jacobi's formula expresses the differential of the determinant of a matrix A in terms of the adjugate of A and the derivative of A....
 for the derivative
Derivative

In calculus, a branch of mathematics, the derivative is a measure of how a function changes as its input changes. Loosely speaking, a derivative can be thought of as how much a quantity is changing at a given point....
 of the determinant (see under determinant
Determinant

In algebra, a determinant is a function depending on n that associates a scalar , det, to an n?n square matrix A. The fundamental geometric meaning of a determinant is a scale factor for measure when A is regarded as a linear transformation....
). As a particular case, : the trace is the derivative of the determinant at the identity. From this (or from the connection between the trace and the eigenvalues), one can derive a connection between the trace function, the exponential map between a Lie algebra and its Lie group (or concretely, the matrix exponential
Matrix exponential

In mathematics, the matrix exponential is a matrix function on square matrix analogous to the ordinary exponential function. Abstractly, the matrix exponential gives the connection between a matrix Lie algebra and the corresponding Lie group....
 function), and the determinant
Determinant

In algebra, a determinant is a function depending on n that associates a scalar , det, to an n?n square matrix A. The fundamental geometric meaning of a determinant is a scale factor for measure when A is regarded as a linear transformation....
:

det(exp(A)) = exp(tr(A)).


For example, consider the one-parameter family of linear transformations given by rotation through angle ?,

These transformations all have determinant 1, so they preserve area. The derivative of this family at ? = 0 is the antisymmetric matrix

which clearly has trace zero, indicating that this matrix represents an infinitesimal transformation which preserves area.

A related characterization of the trace applies to linear vector fields. Given a matrix A, define a vector field F on Rn by F(x) = Ax. The components of this vector field are linear functions (given by the rows of A). The divergence
Divergence

In vector calculus, the divergence is an operator that measures the magnitude of a vector field's source or sink at a given point; the divergence of a vector field is a scalar....
 div
F is a constant function, whose value is equal to tr(A). By the divergence theorem
Divergence theorem

In vector calculus, the divergence theorem, also known as Gauss?s theorem , Ostrogradsky?s theorem , or Gauss-Ostrogradsky theorem is a result that relates the flow of a vector field through a surface to the behavior of the vector field inside the surface....
, one can interpret this in terms of flows: if
F(x) represents the velocity of a fluid at the location x, and U is a region in Rn, the net flow of the fluid out of U is given by tr(A)· vol(U), where vol(U) is the volume
Volume

The volume of any solid, liquid, plasma, vacuum or theoretical object is how much three-dimensional space it occupies, often quantified numerically....
 of
U.

The trace is a linear operator, hence its derivative is constant:

Applications

The trace is used to define characters
Character (mathematics)

In mathematics, a character is a special kind of function from a group to a field . There are at least two distinct, but overlapping meanings....
 of group representation
Group representation

In the mathematics field of representation theory, group representations describe abstract group in terms of linear transformations of vector spaces; in particular, they can be used to represent group elements as matrix so that the group operation can be represented by matrix multiplication....
s. Two representations of a group are equivalent (up to change of basis on ) if for all .

The trace also plays a central role in the distribution of quadratic forms
Quadratic form (statistics)

If is a vector space of random variables, and is an -dimensional symmetric square matrix, then the scalar quantity is known as a quadratic form in ....
.

Lie algebra

A matrix whose trace is zero
0 (number)

0 is both a number and the numerical digit used to represent that number in numeral system. It plays a central role in mathematics as the additive identity of the integers, real numbers, and many other algebraic structures....
 is said to be
traceless or tracefree, and these matrices form the simple Lie algebra sln, which is the Lie algebra
Lie algebra

In mathematics, a Lie algebra is an algebraic structure whose main use is in studying geometric objects such as Lie groups and differentiable manifolds....
 of the special linear group
Special linear group

In mathematics, the special linear group of degree n over a field F is the set of n×n Matrix with determinant 1, with the group operations of ordinary matrix multiplication and matrix inversion....
 of matrices with determinant 1. The special linear group consists of the matrices which do not change volume, while the special linear algebra is the matrices which
infinitesimally do not change volume.

The bilinear form where is called the Killing form
Killing form

In mathematics, the Killing form, named after Wilhelm Killing, is a symmetric bilinear form that plays a basic role in the theories of Lie groups and Lie algebras....
, which is used for the classification of Lie algebras.

The trace defines a bilinear form: (
x, y square matrices) The form is symmetric, non-degenerate and associative in the sense that:

In a simple Lie algebra (e.g., ), every such bilinear form is proportional to each other; in particular, to the Killing form.

Two matrices
x and y are said to be trace orthogonal if .

Inner product


For an
m-by-n matrix A with complex (or real) entries and * being the conjugate transpose, we have

tr(A*A) ≥ 0


with equality if and only if
A = 0. The assignment

yields an inner product on the space of all complex (or real)
m-by-n matrices.

If
m=n then the norm
Norm (mathematics)

In linear algebra, functional analysis and related areas of mathematics, a norm is a function that assigns a strictly positive length or size to all vectors in a vector space, other than the zero vector....
 induced by the above inner product is called the Frobenius norm of a square matrix. Indeed it is simply the Euclidean norm if the matrix is considered as a vector of length
n2.

Generalization


The concept of trace of a matrix is generalised to the trace class
Trace class

In mathematics, a trace class operator is a compact operator for which a Trace may be defined, such that the trace is finite and independent of the choice of basis....
 of compact operator
Compact operator

In functional analysis, a branch of mathematics, a compact operator is a linear operator L from a Banach space X to another Banach space Y, such that the image under L of any bounded subset of X is a relatively compact subset of Y....
s on Hilbert space
Hilbert space

The mathematics concept of a Hilbert space, named after David Hilbert, generalizes the notion of Euclidean space. It extends the methods of vector algebra from the two-dimensional plane and three-dimensional space to infinite-dimensional spaces....
s, and the analog of the Frobenius norm is called the Hilbert-Schmidt
Hilbert-Schmidt operator

In mathematics, a Hilbert?Schmidt operator is a bounded operator A on a Hilbert space H with finite Hilbert?Schmidt norm, meaning that there exists an orthonormal basis of...
 norm.

The partial trace
Partial trace

In linear algebra and functional analysis, the partial trace is a generalization of the trace . Whereas the trace is a scalar valued function on operators, the partial trace is an operator-valued function....
 is another generalization of the trace that is operator-valued.

If
A is a general associative algebra
Associative algebra

In mathematics, an associative algebra is a vector space which also allows the multiplication of vectors in a distributivity and associativity manner....
 over a field
k, then a trace on A is often defined to be any map tr: Ak which vanishes on commutators: tr([a,b]) = 0 for all a,b in A. Such a trace is not uniquely defined; it can always at least be modified by multiplication by a nonzero scalar.

A supertrace
Supertrace

In the theory of superalgebras, if A is a commutative superalgebra, V is a free right A-supermodule and T is an endomorphism from V to itself, then the supertrace of T, str is defined by the following tangle diagram:...
 is the generalization of a trace to the setting of superalgebra
Superalgebra

In mathematics and theoretical physics, a superalgebra is a Z2-graded algebra. That is, it is an algebra over a commutative ring or field with a decomposition into "even" and "odd" pieces and a multiplication operator that respects the grading....
s.

The operation of tensor contraction
Tensor contraction

In multilinear algebra, a tensor contraction is an operation on one or more tensors that arises from the Bilinear form#different spaces of a finite-dimensional vector space and its dual vector space....
 generalizes the trace to arbitrary tensors.

Coordinate-Free Definition

We can identify the space of linear operators on a vector space V with the space , where . We also have a canonical bilinear function that consists of applying an element of to an element of to get an element of . This induces a linear function on the tensor product (by its universal property), which, as it turns out, when that tensor product is viewed as the space of operators, is equal to the trace.

For V finite-dimensional, with basis and dual basis , then is the entry of the matrix of the operator with respect to that basis, so we can write any operator as a sum of these. Notice that maps to 1 iff and 0 otherwise under the trace operator, and from this it's not too hard to see that this coordinate-free definition corresponds to the 'high-school' definition of trace.

See also

  • Trace class
    Trace class

    In mathematics, a trace class operator is a compact operator for which a Trace may be defined, such that the trace is finite and independent of the choice of basis....
  • Field trace
    Field trace

    In mathematics, the field trace is a linear mapping defined for certain field extensions. If L/K is a finite Galois extension, it is defined for a in L as the sum of all the Conjugate element ...
  • Golden-Thompson inequality
    Golden-Thompson inequality

    In mathematics, the Golden?Thompson inequality says that for Hermitian matrix A and B,where tr is the trace , and eA is the matrix exponential....
  • Characteristic function
    Characteristic function (probability theory)

    In probability theory, the characteristic function of any random variable completely defines its probability distribution. On the real number line it is given by the following formula, where X is any random variable with the distribution in question:...


External links