All Topics  
Level of measurement

 

   Email Print
   Bookmark   Link






 

Level of measurement



 
 
The "levels of measurement" is an expression which typically refers to the theory of scale types developed by the Harvard psychologist Stanley Smith Stevens
Stanley Smith Stevens

Stanley Smith Stevens was an United States psychologist who founded Harvard's Psycho-Acoustic Laboratory and is credited with the introduction of Stevens' power law....
. Stevens proposed his theory in a 1946 article titled "On the theory of scales of measurement" in the journal Science. In this article Stevens claimed that all measurement in science was conducted using four different types of numerical scales which he called "nominal", "ordinal", "interval" and "ratio".

Scale types and Stevens' "operational theory of measurement"
The theory of scale types is the intellectual handmaiden to Stevens' "operational theory of measurement," which was to become definitive within psychology and the behavioral sciences, despite it being quite at odds with the understanding of measurement held in the natural sciences (Michell, 1999).






Discussion
Ask a question about 'Level of measurement'
Start a new discussion about 'Level of measurement'
Answer questions from other users
Full Discussion Forum



Encyclopedia


The "levels of measurement" is an expression which typically refers to the theory of scale types developed by the Harvard psychologist Stanley Smith Stevens
Stanley Smith Stevens

Stanley Smith Stevens was an United States psychologist who founded Harvard's Psycho-Acoustic Laboratory and is credited with the introduction of Stevens' power law....
. Stevens proposed his theory in a 1946 article titled "On the theory of scales of measurement" in the journal Science. In this article Stevens claimed that all measurement in science was conducted using four different types of numerical scales which he called "nominal", "ordinal", "interval" and "ratio".

Scale types and Stevens' "operational theory of measurement"


The theory of scale types is the intellectual handmaiden to Stevens' "operational theory of measurement," which was to become definitive within psychology and the behavioral sciences, despite it being quite at odds with the understanding of measurement held in the natural sciences (Michell, 1999). Essentially, the operational theory of measurement was a reaction to the conclusions of a committee established in 1932 by the British Association for the Advancement of Science to investigate the possibility of genuine scientific measurement in the psychological and behavioral sciences. This committee, which became known as the Ferguson committee, published a Final Report (Ferguson, et al, 1940, p.245) in which Stevens' sone scale (Stevens & Davis, 1938) was an object of criticism:

...any law purporting to express a quantitative relation between sensation intensity and stimulus intensity is not merely false but is in fact meaningless unless and until a meaning can be given to the concept of addition as applied to sensation.

That is, if Stevens' sone scale was genuinely measuring the intensity of auditory sensations, then evidence for such sensations being quantitative attributes must be produced. The evidence needed was the presence of additive structure - a concept comprehensively treated by the German mathematician Otto Hölder
Otto Hölder

Otto Ludwig H?lder was a Germany mathematician born in Stuttgart.H?lder first studied at the Polytechnikum and then in 1877 went to Berlin where he was a student of Leopold Kronecker, Karl Weierstra?, and Ernst Kummer....
 (Hölder, 1901). Given the physicist and measurement theorist Norman Robert Campbell dominated the Ferguson committee's deliberations, the committee concluded that real measurement in the social sciences was impossible due to the lack of concatenation operations. This conclusion was later rendered false by the discovery of the theory of conjoint measurement
Theory of conjoint measurement

In the theory of conjoint measurement, the quantitative structure of natural attributes can be discovered in the absence of natural Concatenation Operation ....
 by Debreu (1960) and independently by Luce & Tukey (1964). However, Stevens' reaction was not to conduct experiments to test for the presence of additive structure in sensations, but instead to render the conclusions of the Ferguson committee null and void by proposing a new theory of measurement:

Stevens was greatly influenced by the ideas of another Harvard academic, the Nobel laureate physicist Percy Bridgman (1927), whose doctrine of operationism Stevens used to define measurement. In Stevens' definition for example, it is the use of a tape measure that defines length (the object of measurement) as being measurable (and so by implication quantitative). However, this is the critical logical flaw within operationism, which is the mistake of confusing what is known (length) with how it is known (the use of a tape measure). In more formal terms, operationism confuses the relations between two objects or events for properties of one of those of objects or events (Hardcastle, 1995; Michell, 1999; Moyer, 1981a,b; Rogers, 1989). Despite this fatal logical flaw and its descrediting of Stevens' theory of measurement, the latter has remained entrenched in the behavioural sciences to an such extent that Stevens (1975) began to complain that his idea was not being sufficiently accredited to him.

The Canadian measurement theorist William Rozeboom (1966) was an early and trenchant critic of Stevens' theory of scale types. But it was not until much later with the work of mathematical psychologists Theodore Alper (1985, 1987), Louis Narens (1981a, b) and R. Duncan Luce
R. Duncan Luce

Robert Duncan Luce is the Distinguished Research Professor of Cognitive Science at the University of California, Irvine.Luce received a B.S. in Aeronautical Engineering from the Massachusetts Institute of Technology in 1945, and PhD in Mathematics from the same university in 1950....
 (1986, 1987, 2001) did the concept of scale types receive the mathematical rigour which it lacked at its inception. As Luce (1997, p.395) bluntly stated:

The theory of scale types


Stevens (1946, 1951) proposed that measurement was conducted using four different types of scales. These were:
  • nominal
  • ordinal
  • interval
  • ratio


Scale Type Permissable Statistics Admissible Scale Transformation Mathematical structure
nominal mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
, chi square
One to One (equality
Equality (mathematics)

Equality is the paradigmatic example of the more general concept of equivalence relations on a set: those binary relations which are reflexive relation, symmetric relation, and transitive relation....
 (=))
standard set structure (unordered)
ordinal median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, percentile
Percentile

A percentile is the value of a variable below which a certain percentage of observations fall. So the 20th percentile is the value below which 20 percent of the observations may be found....
Monotonic increasing (order
Total order

In mathematics and set theory, a total order, linear order, simple order, or ordering is a binary relation on some Set X....
 (<))
totally ordered set
interval mean
Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
, standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
, correlation
Correlation

In probability theory and statistics, correlation indicates the strength and direction of a linear relationship between two random variables....
, regression
Regression

Regression could refer to:* Regression , a defensive reaction to some unaccepted impulses* Past life regression, a process claiming to retrieve memories of previous lives...
, analysis of variance
Analysis of variance

In statistics, analysis of variance is a collection of statistical models, and their associated procedures, in which the observed variance is partitioned into components due to different explanatory variables....
Positive linear (affine
Affine

Affine may refer to:*Affine cipher, a special case of the more general substitution cipher*Affine combination, a certain kind of constrained linear combination...
)
affine line
ratio All statistics permitted for interval scales plus the following: geometric mean
Geometric mean

The geometric mean, in mathematics, is a type of mean or average, which indicates the central tendency or typical value of a set of numbers. It is similar to the arithmetic mean, which is what most people think of with the word "average," except that instead of adding the set of numbers and then dividing the sum by the count of numbers in the...
, harmonic mean
Harmonic mean

In mathematics, the harmonic mean is one of several kinds of average. Typically, it is appropriate for situations when the average of Rate s is desired....
, coefficient of variation
Coefficient of variation

In probability theory and statistics, the coefficient of variation is a normalization measure of statistical dispersion of a probability distribution....
, logarithms
Positive similarities (multiplication
Multiplication

Multiplication is the Operation of scaling one number by another. It is one of the four basic operations in elementary arithmetic .Multiplication is defined for Natural number in terms of repeated addition; for example, 4 multiplied by 3 can be calculated by adding 3 copies of 4 together:...
)
field
Field (mathematics)

In abstract algebra, a field is an algebraic structure with notions of addition, subtraction, multiplication and division , satisfying certain axioms....


Nominal scale

Nominal scales are mere codes assigned to objects as labels, they are not measurements. For example, rocks can be generally categorized as (1) igneous, (2) sedimentary and (3) metamorphic
Metamorphic

The term Metamorphic can be associated with a number of meanings:*Metamorphic rock: The term for rocks that have been transformed by extreme heat and pressure....
. A code of "3" given to any particular stone observed does not mean that stone possesses more "rockness" than a stone coded as "1", anymore than a person with red hair does not possess more "hairness" than a person with blonde hair.

This is also called a categorical variable; see also categorical data.

Stevens (1946, p.679) must have known that claiming nominal scales to measure obviously non-quantitative things would have attracted criticism, so he invoked his theory of measurement to justify nominal scales as measurement:

The only kind of measure of central tendency that remains invariant under one-one transformations is the mode. The median and mean cannot be defined.

Ordinal scale


In this scale type, the numbers assigned to objects or events represent the rank order (1st, 2nd, 3rd etc.) of the entities assessed. An example of ordinal measurement is the results of a horse race, which say only which horses arrived first, second, third, etc. but include no information about times. Another is the Mohs scale of mineral hardness
Mohs scale of mineral hardness

Not to be confused with Siemens_#Mho, a unit of electric conductance.The Mohs scale of mineral hardness characterizes the scratch resistance of various minerals through the ability of a harder material to scratch a softer material....
, which characterizes the hardness of various minerals through the ability of a harder material to scratch a softer one, saying nothing about the actual hardness of any of them. Interestingly, Stevens' writings betrayed a critical view of psychometrics
Psychometrics

Psychometrics is the field of study concerned with the theory and technique of educational and psychological measurement, which includes the measurement of knowledge, abilities, attitudes, and Wiktionary:personality traits....
 as he argued:

Psychometricians like to theorise that psychometric tests produce interval scale measures of cognitive abilities (e.g. Lord & Novick, 1968; von Eye, 2005) but there is little prima facie evidence to suggest that such attributes are anything more than ordinal (Cliff & Keats, 2000; Michell, 2008).

The central tendency of an ordinal attribute can be represented by its mode or its median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, but the mean cannot be defined.

Interval scale


Quantitive attributes are all able to be measured on interval scales, as any difference between the levels of an attribute can be multiplied by any real number to exceed or equal another difference. A highly familiar example of interval scale measurement is temperature with the Celsius scale. In this particular scale, the unit of measurement is 1/100 of the difference between the melting temperature and the boiling temperature of water in atmospheric pressure. The "zero point" on an interval scale is arbitrary; and negative values can be used. The formal mathematical term is an affine space
Affine space

In mathematics, an affine space is a geometric structure that generalizes the affine geometry properties of Euclidean space. It can be thought of informally as a vector space where one has forgotten which point is the origin....
 (in this case an affine line). Variables measured at the interval level are called "interval variables" or sometimes "scaled variables" as they have units of measurement
Units of measurement

The definition, agreement and practical use of units of measurement have played a crucial role in human endeavour from early ages up to this day....
.

Ratio
Ratio

A ratio is an expression which compares quantities relative to each other. The most common examples involve two quantities, but in theory any number of quantities can be compared....
s between numbers on the scale are not meaningful, so operations such as multiplication and division cannot be carried out directly. But ratios of
differences can be expressed; for example, one difference can be twice another.

The central tendency of a variable measured at the interval level can be represented by its mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
, its median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, or its arithmetic mean
Arithmetic mean

In mathematics and statistics, the arithmetic mean of a list of numbers is the sum of all of the list divided by the number of items in the list....
. Statistical dispersion can be measured in most of the usual ways, which just involved differences or averaging, such as range
Range (statistics)

In descriptive statistics, the range is the length of the smallest interval which contains all the data. It is calculated by subtracting the smallest observation from the greatest and provides an indication of statistical dispersion....
, interquartile range
Interquartile range

In descriptive statistics, the interquartile range , also called the midspread, middle fifty and middle of the #s, is a measure of statistical dispersion, being equal to the difference between the third and first quartiles....
, and standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
. Since one cannot divide, one cannot define measures that require a ratio, such as studentized range
Studentized range

In statistics, the studentized range computed from a list x1, ..., xn of numbers iswhereis the sample variance and...
 or coefficient of variation
Coefficient of variation

In probability theory and statistics, the coefficient of variation is a normalization measure of statistical dispersion of a probability distribution....
. More subtly, while one can define moments
Moment (mathematics)

The concept of moment in mathematics evolved from the concept of moment in physics. The nth moment of a real-valued function f of a real variable about a value c is...
 about the origin, only
central moments
Central moment

In probability theory and statistics, the kth moment about the mean of a real-valued random variable X is the quantity μk := E[k], where E is the expected value....
 are useful, since the choice of origin is arbitrary and not meaningful. One can define standardized moment
Standardized moment

In probability theory and statistics, the kthstandardized moment of a probability distribution is where is the kth moment about the mean and σ is the standard deviation....
s, since ratios of
differences are meaningful, but one cannot define coefficient of variation
Coefficient of variation

In probability theory and statistics, the coefficient of variation is a normalization measure of statistical dispersion of a probability distribution....
, since the mean is a moment about the origin, unlike the standard deviation, which is (the square root of) a central moment.

Ratio measurement


Most measurement in the physical sciences and engineering is done on ratio scales. Mass, length, time, plane angle, energy and electric charge are examples of physical measures that are ratio scales. The scale type takes its name from the fact that measurement is the estimation of the ratio between a magnitude of a continuous quantity and a unit magnitude of the same kind (Michell, 1997, 1999). Informally, the distinguishing feature of a ratio scale is the possession of a non-arbitrary zero value. For example, the Kelvin
Kelvin

The kelvin is a Units of measurement of temperature and is one of the seven SI base units. The Kelvin scale is a Thermodynamic temperature scale where absolute zero, the theoretical absence of all thermal energy, is zero ....
 temperature scale has a non-arbitrary zero point of absolute zero
Absolute zero

Absolute zero is a temperature marked by a 0 entropy configuration. It is the coldest temperature theoretically possible, and cannot be reached, by artificial or natural means....
, which is denoted 0K and is equal to -273.15 degrees Celsius. This zero point is non arbitrary as the particles which comprise matter at this temperature have zero kinetic energy.

Examples of ratio scale measurement in the behavioural sciences are all but non-existant. Luce (2000) argues that an example of ratio scale measurement in psychology can be found in rank and sign dependent expected utility theory.

All statistical measures can be used for a variable measured at the ratio level, as all necessary mathematical operations are defined. The central tendency of a variable measured at the ratio level can be represented by, in addition to its mode
Mode (statistics)

In statistics, the mode is the value that occurs the most frequently in a data set or a probability distribution. In some fields, notably education, sample data are often called scores, and the sample mode is known as the modal score....
, its median
Median

In probability theory and statistics, a median is described as the number separating the higher half of a sample, a population, or a probability distribution, from the lower half....
, or its arithmetic mean
Arithmetic mean

In mathematics and statistics, the arithmetic mean of a list of numbers is the sum of all of the list divided by the number of items in the list....
, also its geometric mean
Geometric mean

The geometric mean, in mathematics, is a type of mean or average, which indicates the central tendency or typical value of a set of numbers. It is similar to the arithmetic mean, which is what most people think of with the word "average," except that instead of adding the set of numbers and then dividing the sum by the count of numbers in the...
 or harmonic mean
Harmonic mean

In mathematics, the harmonic mean is one of several kinds of average. Typically, it is appropriate for situations when the average of Rate s is desired....
. In addition to the measures of statistical dispersion defined for interval variables, such as range
Range (statistics)

In descriptive statistics, the range is the length of the smallest interval which contains all the data. It is calculated by subtracting the smallest observation from the greatest and provides an indication of statistical dispersion....
 and standard deviation
Standard deviation

In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
, for ratio variables one can also define measures that require a ratio, such as studentized range
Studentized range

In statistics, the studentized range computed from a list x1, ..., xn of numbers iswhereis the sample variance and...
 or coefficient of variation
Coefficient of variation

In probability theory and statistics, the coefficient of variation is a normalization measure of statistical dispersion of a probability distribution....
.

Debate on classification scheme

There has been, and continues to be, debate about the merits of the classifications, particularly in the cases of the nominal and ordinal classifications (Michell, 1986). Thus, while Stevens' classification is widely adopted, it is by no means universally accepted (for example, Velleman & Wilkinson, 1993).

Duncan (1986) observed that Stevens' classification
nominal measurement is contrary to his own definition of measurement. Stevens (1975) said on his own definition of measurement that "the assignment can be any consistent rule. The only rule not allowed would be random assignment, for randomness amounts in effect to a nonrule". However, so-called nominal measurement involves arbitrary assignment, and the "permissible transformation" is any number for any other. This is one of the points made in Lord's (1953) satirical paper On the Statistical Treatment of Football Numbers.

Among those who accept the classification scheme, there is also some controversy in behavioural sciences over whether the mean is meaningful for ordinal measurement. In terms of measurement theory, it is not, because the arithmetic operations are not made on numbers that are measurements in units, and so the results of computations do not give numbers in units. However, many behavioural scientists use means for ordinal data anyway. This is often justified on the basis that ordinal scales in behavioural science are really somewhere between true ordinal and interval scales; although the interval difference between two ordinal ranks is not constant, it is often of the same order of magnitude. For example, applications of measurement models in educational contexts often indicate that total scores have a fairly linear relationship with measurements across a range of an assessment. Thus, some argue, that so long as the unknown interval difference between ordinal scale ranks is not too variable, interval scale statistics such as means can meaningfully be used on ordinal scale variables. Statistical analysis software such as PSPP
PSPP

PSPP is a free software application for analysis of sampled data. It has a graphical user interface and conventional command line interface. It is written in C , uses GNU Scientific Library for its mathematical routines, and plotutils for generating graphs....
 require the user to select the appropriate measurement class for each variable. This ensures that subsequent user errors cannot inadvertently perform meaningless analyses (for example correlation analysis with a variable on a nominal level).

L. L. Thurstone made progress toward developing a justification for obtaining interval-level measurements based on the law of comparative judgment
Law of comparative judgment

The law of comparative judgment was conceived by L. L. Thurstone. In modern day terminology, it is more aptly described as a model that is used to obtain measurements from any process of pairwise comparison....
. Further progress was made by Georg Rasch
Georg Rasch

Georg Rasch was a Danish mathematician, statistician, and psychometrician, most famous for the development of a class of measurement models known as Rasch models....
(1960), who developed the probabilistic Rasch model
Rasch model

Rasch models are used for analysing data from assessments to measure things such as abilities, attitudes, and personality traits. For example, they may be used to estimate a student's reading ability from answers to questions on a reading assessment, or the extremity of a person's attitude to capital punishment from responses on a questionnai...
 which provides a theoretical basis and justification for obtaining interval-level measurements from counts of observations such as total scores on assessments.

Another issue is derived from Nicholas R. Chrisman's article "Rethinking Levels of Measurement for Cartography", in which he introduces an expanded list of levels of measurement to account for various measurements that do not necessarily fit with the traditional notion of levels of measurement. Measurements bound to a range and repeat (like degrees in a circle, time, etc), graded membership categories, and other types of measurement do not fit to Steven's original work, leading to the introduction of 6 new levels of measurement leading to: (1) Nominal, (2) Graded membership, (3) Ordinal, (4) Interval, (5) Log-Interval, (6) Extensive Ratio, (7) Cyclical Ratio, (8) Derived Ratio, (9) Counts and finally (10) Absolute. The extended levels of measurement are rarely used outside of academic geography.

See also

  • Quantitative
    Quantitative

    A quantitative attribute is one that exists in a range of magnitudes, and can therefore be measurement. Measurements of any particular quantitative property are expressed as a specific quantity, referred to as a Unit of measurement, multiplied by a number....
  • Qualitative data


External links

  • [ftp://ftp.sas.com/pub/neural/measurement.html Measurement theory: Frequently asked questions]