All Topics  
Statistics

 
Statistics

   Email Print
   Bookmark   Link






 

Statistics



 
 
Statistics is a mathematical science
Mathematics

Mathematics is the study of quantity, structure, space, change, and related topics of pattern and form. Mathematicians seek out patterns whether found in numbers, space, natural science, computers, imaginary abstractions, or elsewhere....
 pertaining to the collection, analysis, interpretation or explanation, and presentation of data
DATA

Debt, AIDS, Trade in Africa is a multinational Non-governmental organization founded in January 2002 in London by U2's Bono along with Robert Sargent Shriver III and activists from the Jubilee 2000 Drop the Debt campaign....
. It also provides tools for prediction and forecasting based on data. It is applicable to a wide variety of academic disciplines, from the natural
Natural science

In science, the term natural science refers to a methodological naturalism approach to the study of the universe, which is understood as obeying rules or law of nature origin....
 and social sciences to the humanities
Humanities

The humanities are academic disciplines which study the human condition, using methods that are primarily analytic, critical, or speculative, as distinguished from the mainly empirical approaches of the natural science and social sciences....
, government and business.

Statistical methods can be used to summarize or describe a collection of data; this is called descriptive statistics
Descriptive statistics

Descriptive Statistics are used to describe the basic features of the data gathered from an experimental study in various ways. A descriptive Statistics is distinguished from inductive statistics....
.






Discussion
Ask a question about 'Statistics'
Start a new discussion about 'Statistics'
Answer questions from other users
Full Discussion Forum



Quotations


A new statistic proves that 73% of all statistics are pure inventions.

A single death is a tragedy; a million deaths is a statistic.

A statistician is someone who is good with numbers but lacks the personality to become an accountant.

He uses statistics as a drunken man uses lampposts - for support rather than illumination.

I could prove God statistically.

If your experiment needs statistics, you ought to have done a better experiment.






Encyclopedia


Statistics is a mathematical science
Mathematics

Mathematics is the study of quantity, structure, space, change, and related topics of pattern and form. Mathematicians seek out patterns whether found in numbers, space, natural science, computers, imaginary abstractions, or elsewhere....
 pertaining to the collection, analysis, interpretation or explanation, and presentation of data
DATA

Debt, AIDS, Trade in Africa is a multinational Non-governmental organization founded in January 2002 in London by U2's Bono along with Robert Sargent Shriver III and activists from the Jubilee 2000 Drop the Debt campaign....
. It also provides tools for prediction and forecasting based on data. It is applicable to a wide variety of academic disciplines, from the natural
Natural science

In science, the term natural science refers to a methodological naturalism approach to the study of the universe, which is understood as obeying rules or law of nature origin....
 and social sciences to the humanities
Humanities

The humanities are academic disciplines which study the human condition, using methods that are primarily analytic, critical, or speculative, as distinguished from the mainly empirical approaches of the natural science and social sciences....
, government and business.

Statistical methods can be used to summarize or describe a collection of data; this is called descriptive statistics
Descriptive statistics

Descriptive Statistics are used to describe the basic features of the data gathered from an experimental study in various ways. A descriptive Statistics is distinguished from inductive statistics....
. In addition, patterns in the data may be modeled
Mathematical model

A mathematical model uses mathematics language to describe a system. Mathematical models are used not only in the natural sciences and engineering disciplines but also in the social sciences ; physicists, engineers, computer sciences, and economists use mathematical models most extensively....
 in a way that accounts for randomness and uncertainty in the observations, and are then used to draw inferences about the process or population being studied; this is called inferential statistics. Descriptive, predictive, and inferential statistics comprise applied statistics.

There is also a discipline called mathematical statistics
Mathematical statistics

Mathematical statistics is the study of statistics from a purely mathematical standpoint, using probability theory as well as other branches of mathematics such as linear algebra and mathematical analysis....
, which is concerned with the theoretical basis of the subject. Moreover, there is a branch of statistics called exact statistics
Exact statistics

Exact statistics, such as that described in exact test, is a branch of statistics that was developed to provide more accurate results pertaining to statistical testing and interval estimation by eliminating procedures based on asymptotic analysis and approximate statistical methods....
 that is based on exact probability statements.

The word statistics can either be singular or plural. In its singular form, statistics refers to the mathematical science discussed in this article. In its plural form, statistics is the plural of the word statistic
Statistic

A statistic is the result of applying a function to a Data set.More formally, statistical theory defines a statistic as a function of a sample where the function itself is independent of the sample's distribution: the term is used both for the function and for the value of the function on a given sample....
, which refers to a quantity (such as a mean
Mean

In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
) calculated from a set of data.

History



"Five men, Conring
Hermann Conring

Hermann Conring was a German intellectual. He made significant contributions to the study of medicine, politics and law.Descended from Lutheran clergy on both sides of his family, second-youngest of ten children, Conring showed early promise as a student....
, Achenwall
Gottfried Achenwall

Gottfried Achenwall was a Germany philosophy and statistician. He is counted among the inventors of statistics.He was born in Elblag . Beginning in 1738 he studied in University of Jena, University of Halle, again Jena and University of Leipzig....
, Süssmilch
Johann Peter Süssmilch

Johann Peter S??milch was a Germany priest, statistician and demographer.He studied medicine and theology at University of Jena and University of Halle and in 1741 was an army chaplain in the Silesian Wars....
, Graunt
John Graunt

John Graunt was one of the first demographers, though by profession he was a haberdasher. Born in London, Graunt, along with William Petty, developed early human statistical and census methods that later provided a framework for modern demography....
 and Petty
William Petty

Sir William Petty was an England economist, scientist and philosopher. He first became prominent serving Oliver Cromwell and Commonwealth of England in Ireland....
 have been honored by different writers as the founder of statistics."
claims one source (Willcox, Walter (1938) The Founder of Statistics. Review of the International Statistical Institute
International Statistical Institute

The International Statistical Institute is a professional association of statisticians. It publishes a variety of books and journals, and holds an international conference every two years....
 5(4):321-328.)

Some scholars pinpoint the origin of statistics to 1662, with the publication of "Observations on the Bills of Mortality" by John Graunt
John Graunt

John Graunt was one of the first demographers, though by profession he was a haberdasher. Born in London, Graunt, along with William Petty, developed early human statistical and census methods that later provided a framework for modern demography....
. Early applications of statistical thinking revolved around the needs of states to base policy on demographic and economic data, hence its stat- etymology
History of statistics

Statistics arose, no later than the 18th century, from the need of states to collect data on their people and economies, in order to administer them. Its meaning broadened in the early 19th century to include the collection and analysis of data in general....
. The scope of the discipline of statistics broadened in the early 19th century to include the collection and analysis of data in general. Today, statistics is widely employed in government, business, and the natural and social sciences.

Because of its empirical roots and its applications, statistics is generally considered not to be a subfield of pure mathematics, but rather a distinct branch of applied mathematics. Its mathematical foundations were laid in the 17th century with the development of probability theory
Probability theory

Probability theory is the branch of mathematics concerned with analysis of Statistical randomness phenomena. The central objects of probability theory are random variables, stochastic processes, and event s: mathematical abstractions of determinism events or measured quantities that may either be single occurrences or evolve over time in an a...
 by Pascal
Pascal

Pascal or PASCAL may refer to:...
 and Fermat. Probability theory arose from the study of games of chance. The method of least squares was first described by Carl Friedrich Gauss
Carl Friedrich Gauss

Johann Carl Friedrich Gauss. was a Germans mathematician and scientist who contributed significantly to many fields, including number theory, statistics, mathematical analysis, Differential geometry and topology, geodesy, electrostatics, astronomy and optics....
 around 1794. The use of modern computer
Computer

A computer is a machine that manipulates Data according to a list of Code .The first devices that resemble modern computers date to the mid-20th century , although the computer concept and various machines similar to computers existed earlier....
s has expedited large-scale statistical computation, and has also made possible new methods that are impractical to perform manually.

Overview


In applying statistics to a scientific, industrial, or societal problem, it is necessary to begin with a process or population
Statistical population

In statistics, a statistical population is a Set of entities concerning which statistical inferences are to be drawn, often based on a random sample taken from the population....
 to be studied. This might be a population of people in a country, of crystal grains in a rock, or of goods manufactured by a particular factory during a given period. It may instead be a process observed at various times; data collected about this kind of "population" constitute what is called a time series
Time series

In statistics, signal processing, and many other fields, a time series is a sequence of data points, measured typically at successive times, spaced at time intervals....
.

For practical reasons, rather than compiling data about an entire population, a chosen subset of the population, called a sample
Sampling (statistics)

Sampling is that part of statistical practice concerned with the selection of individual observations intended to yield some knowledge about a population of concern, especially for the purposes of statistical inference....
, is studied. Data are collected about the sample in an observational or experiment
Experiment

In scientific inquiry, an experiment is a method of investigating causal relationships among variables. An experiment is a cornerstone of the empiricism approach to acquiring data about the world and is used in both natural sciences and social sciences....
al setting. The data are then subjected to statistical analysis, which serves two related purposes: description and inference.
  • Descriptive statistics
    Descriptive statistics

    Descriptive Statistics are used to describe the basic features of the data gathered from an experimental study in various ways. A descriptive Statistics is distinguished from inductive statistics....
     can be used to summarize the data, either numerically or graphically, to describe the sample. Basic examples of numerical descriptors include the mean
    Mean

    In statistics, mean has two related meanings:* the arithmetic mean .* the expected value of a random variable, which is also called the population mean....
     and standard deviation
    Standard deviation

    In statistics, standard deviation is a simple measure of the variability or statistical dispersion of a data set. A low standard deviation indicates that all of the data points are very close to the same value , while high standard deviation indicates that the data are ?spread out? over a large range of values....
    . Graphical summarizations include various kinds of charts and graphs.
  • Inferential statistics is used to model patterns in the data, accounting for randomness and drawing inferences about the larger population. These inferences may take the form of answers to yes/no questions (hypothesis testing), estimates of numerical characteristics (estimation
    Estimation

    Estimation is the calculation approximation of a result which is usable even if input data may be incomplete or uncertainty.In statistics, see estimation theory, estimator....
    ), descriptions of association (correlation
    Correlation

    In probability theory and statistics, correlation indicates the strength and direction of a linear relationship between two random variables....
    ), or modeling of relationships (regression
    Regression analysis

    In statistics, regression analysis is a collective name for techniques for the modeling and analysis of numerical data consisting of values of a dependent variable and of one or more independent variables ....
    ). Other modeling
    Mathematical model

    A mathematical model uses mathematics language to describe a system. Mathematical models are used not only in the natural sciences and engineering disciplines but also in the social sciences ; physicists, engineers, computer sciences, and economists use mathematical models most extensively....
     techniques include ANOVA, time series
    Time series

    In statistics, signal processing, and many other fields, a time series is a sequence of data points, measured typically at successive times, spaced at time intervals....
    , and data mining
    Data mining

    Data mining is the process of extracting hidden patterns from data. As more data is gathered, with the amount of data doubling every three years, data mining is becoming an increasingly important tool to transform this data into information....
    .


The concept of correlation is particularly noteworthy. Statistical analysis of a data set
Data set

A data set is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question....
 may reveal that two variables (that is, two properties of the population under consideration) tend to vary together, as if they are connected. For example, a study of annual income and age of death among people might find that poor people tend to have shorter lives than affluent people. The two variables are said to be correlated (which is a positive correlation in this case). However, one cannot immediately infer the existence of a causal relationship between the two variables. (See Correlation does not imply causation.) The correlated phenomena could be caused by a third, previously unconsidered phenomenon, called a lurking variable
Lurking variable

In statistics, a confounding variable is an extraneous variable in a statistical model that correlates with both the dependent variable and the independent variable....
 or confounding variable.

If the sample is representative of the population, then inferences and conclusions made from the sample can be extended to the population as a whole. A major problem lies in determining the extent to which the chosen sample is representative. Statistics offers methods to estimate and correct for randomness in the sample and in the data collection procedure, as well as methods for designing robust experiments in the first place. (See experimental design.)

The fundamental mathematical concept employed in understanding such randomness is probability
Probability

Probability, or wikt:chance, is a way of expressing knowledge or belief that an Event will occur or has occurred. In mathematics the concept has been given an exact meaning in probability theory, that is used extensively in such areas of study as mathematics, statistics, finance, gambling, science, and philosophy to draw conclusions about t...
. Mathematical statistics
Mathematical statistics

Mathematical statistics is the study of statistics from a purely mathematical standpoint, using probability theory as well as other branches of mathematics such as linear algebra and mathematical analysis....
 (also called statistical theory
Statistical theory

The theory of statistics includes a number of topics:Statistical models of the sources of data and typical problem formulation:#survey sampling from a finite population...
) is the branch of applied mathematics
Applied mathematics

Applied mathematics is a branch of mathematics that concerns itself with the mathematical techniques typically used in the application of mathematical knowledge to other domains....
 that uses probability theory and analysis
Mathematical analysis

Mathematical analysis, which mathematicians refer to simply as analysis, has its beginnings in the rigorous formulation of calculus. It is the branch of mathematics most explicitly concerned with the notion of a limit , whether the limit of a sequence or the limit of a function....
 to examine the theoretical basis of statistics.

The use of any statistical method is valid only when the system or population under consideration satisfies the basic mathematical assumptions of the method. Misuse of statistics
Misuse of statistics

A misuse of statistics occurs when a statistical argument asserts a falsehood. In some cases, the misuse may be accidental. In others, it is purposeful and for the gain of the perpetrator....
 can produce subtle but serious errors in description and interpretation — subtle in the sense that even experienced professionals sometimes make such errors, serious in the sense that they may affect, for instance, social policy, medical practice and the reliability of structures such as bridges. Even when statistics is correctly applied, the results can be difficult for the non-expert to interpret. For example, the statistical significance
Statistical significance

In statistics, a result is called statistically significant if it is unlikely to have occurred by chance. "A statistically significant difference" simply means there is statistical evidence that there is a difference; it does not mean the difference is necessarily large, important, or significant in the common meaning of the word....
 of a trend in the data, which measures the extent to which the trend could be caused by random variation in the sample, may not agree with one's intuitive sense of its significance. The set of basic statistical skills (and skepticism) needed by people to deal with information in their everyday lives is referred to as statistical literacy
Statistical literacy

Statistical literacy is a term used to describe an individual's or group's ability to understand statistics. Statistical literacy is necessary for citizens to understand material presented in publications such as newspapers, television, the internet....
.

Statistical methods


Experimental and observational studies

A common goal for a statistical research project is to investigate causality
Causality

Causality denotes a necessary relationship between one event and another event which is the direct consequence of the first.While this informal understanding suffices in everyday use, the Philosophy analysis of how best to characterize causality extends over millennia....
, and in particular to draw a conclusion on the effect of changes in the values of predictors or independent variable
Independent variable

The terms "dependent variable" and "independent variable" are used in similar but subtly different ways in mathematics and statistics as part of the standard terminology in those subjects....
s on dependent variables or response. There are two major types of causal statistical studies: experimental studies and observational studies. In both types of studies, the effect of differences of an independent variable (or variables) on the behavior of the dependent variable are observed. The difference between the two types lies in how the study is actually conducted. Each can be very effective.

An experimental study involves taking measurements of the system under study, manipulating the system, and then taking additional measurements using the same procedure to determine if the manipulation has modified the values of the measurements. In contrast, an observational study does not involve experimental manipulation. Instead, data are gathered and correlations between predictors and response are investigated.

An example of an experimental study is the famous Hawthorne studies, which attempted to test the changes to the working environment at the Hawthorne plant of the Western Electric Company. The researchers were interested in determining whether increased illumination would increase the productivity of the assembly line
Assembly line

An assembly line is a manufacturing process in which parts are added to a product in a sequential manner using optimally planned logistics to create a finished product much faster than with handcrafting-type methods....
 workers. The researchers first measured the productivity in the plant, then modified the illumination in an area of the plant and checked if the changes in illumination affected the productivity. It turned out that the productivity indeed improved (under the experimental conditions). (See Hawthorne effect
Hawthorne effect

The Hawthorne effect is a form of reactivity ,The term was coined in 1955 by Henry A. Landsberger when analyzing older experiments from 1924-1932 at the Hawthorne Works ....
.) However, the study is heavily criticized today for errors in experimental procedures, specifically for the lack of a control group and blindness
Double-blind

The blind method is a part of the scientific method, used to prevent research outcomes from being influenced by either the placebo effect or the observer bias....
.

An example of an observational study is a study which explores the correlation between smoking and lung cancer. This type of study typically uses a survey to collect observations about the area of interest and then performs statistical analysis. In this case, the researchers would collect observations of both smokers and non-smokers, perhaps through a case-control study, and then look for the number of cases of lung cancer in each group.

The basic steps of an experiment are;
  1. Planning the research, including determining information sources, research subject selection, and ethical
    Ethics

    Ethics is a word for a philosophy that encompasses proper conduct and good living. It is significantly broader than the common conception of ethics as the analyzing of right and wrong....
     considerations for the proposed research and method.
  2. Design of experiments
    Design of experiments

    Design of experiments, or experimental design, is the design of all information-gathering exercises where variation is present, whether under the full control of the experimenter or not....
    , concentrating on the system model and the interaction of independent and dependent variables.
  3. Summarizing a collection of observations
    Summary statistics

    File:Michelsonmorley-boxplot.svgIn descriptive statistics, summary statistics are used to summarize a set of observations, in order to communicate the largest amount as simply as possible....
     to feature their commonality by suppressing details. (Descriptive statistics
    Descriptive statistics

    Descriptive Statistics are used to describe the basic features of the data gathered from an experimental study in various ways. A descriptive Statistics is distinguished from inductive statistics....
    )
  4. Reaching consensus about what the observations tell
    Statistical inference

    Inferential statistics or statistical induction comprises the use of statistics to make inferences concerning some unknown aspect of a population....
     about the world being observed. (Statistical inference
    Statistical inference

    Inferential statistics or statistical induction comprises the use of statistics to make inferences concerning some unknown aspect of a population....
    )
  5. Documenting / presenting the results of the study.


Levels of measurement

See: Stanley Stevens' "Scales of measurement" (1946): nominal, ordinal, interval, ratio
There are four types of measurements or levels of measurement
Level of measurement

The "levels of measurement" is an expression which typically refers to the theory of scale types developed by the Harvard psychologist Stanley Smith Stevens....
 or measurement scales used in statistics: nominal, ordinal, interval, and ratio. They have different degrees of usefulness in statistical research
Research

Research is defined as human activity based on intellectual application in the investigation of matter. The primary purpose for applied research is discovery , interpretation , and the development of methods and systems for the advancement of human knowledge on a wide variety of scientific matters of our world and the universe....
. Ratio measurements have both a zero value defined and the distances between different measurements defined; they provide the greatest flexibility in statistical methods that can be used for analyzing the data. Interval measurements have meaningful distances between measurements defined, but have no meaningful zero value defined (as in the case with IQ measurements or with temperature measurements in Fahrenheit
Fahrenheit

Fahrenheit is a temperature scale named after the physicist Daniel Gabriel Fahrenheit , who proposed it in 1724. Today, the scale has largely been replaced by the Celsius scale; it is still in use for non-scientific purposes in the United States and a few other countries such as Belize....
). Ordinal measurements have imprecise differences between consecutive values, but have a meaningful order to those values. Nominal measurements have no meaningful rank order among values.

Since variables conforming only to nominal or ordinal measurements cannot be reasonably measured numerically, sometimes they are called together as categorical variables, whereas ratio and interval measurements are grouped together as quantitative or continuous variables due to their numerical nature.

Statistical techniques

Some well known statistical test
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
s and procedure
Procedure

A procedure is a specified series of actions, acts or operations which have to be executed in the same manner in order to always obtain the same result under the same circumstances ....
s are:

Specialized disciplines

Some fields of inquiry use applied statistics so extensively that they have specialized terminology. These disciplines include:

Statistics form a key basis tool in business and manufacturing as well. It is used to understand measurement systems variability, control processes (as in statistical process control
Statistical process control

Statistical Process Control is an effective method of monitoring a process through the use of control charts. Control charts enable the use of objective criteria for distinguishing background variation from events of significance based on statistical techniques....
 or SPC), for summarizing data, and to make data-driven decisions. In these roles, it is a key tool, and perhaps the only reliable tool.

Statistical computing

Gretl Screenshot
The rapid and sustained increases in computing power starting from the second half of the 20th century have had a substantial impact on the practice of statistical science. Early statistical models were almost always from the class of linear model
Linear model

Disambiguation : go here for the Linear model of innovationIn statistics, given a sample the most general form of linear model is formulated as...
s, but powerful computers, coupled with suitable numerical algorithms, caused an increased interest in nonlinear models
Nonlinear regression

In statistics, nonlinear regression is a form of regression analysis in which observational data are modeled by a function which is a nonlinear combination of the model parameters and depends on one or more independent variables....
 (especially neural networks
Neural Networks

Neural Networks is the official journal of the three oldest societies dedicated to research in neural networks: International Neural Network Society, European Neural Network Society and Japanese Neural Network Society, published by Elsevier....
) as well as the creation of new types, such as generalised linear model
Generalized linear model

In statistics, the generalized linear model is a flexible generalization of ordinary linear regression. It relates the random distribution of the measured variable of the experiment to the systematic portion of the experiment through a function called the link function....
s and multilevel model
Multilevel model

Multilevel models are statistical models of parameters that vary at more than one level. These models can be seen as generalizations of linear models, although they can also extend non-linear models....
s.

Increased computing power has also led to the growing popularity of computationally-intensive methods based on resampling
Resampling (statistics)

In statistics, resampling is any of a variety of methods for doing one of the following:# Estimating the precision of sample statistics by using subsets of available data or drawing randomly with replacement from a set of data points ...
, such as permutation tests and the bootstrap
Bootstrapping (statistics)

In statistics, bootstrapping is a modern, computer-intensive, general purpose approach to statistical inference, falling within a broader class of Resampling methods....
, while techniques such as Gibbs sampling
Gibbs sampling

In mathematics and physics, Gibbs sampling is an algorithm to generate a sequence of samples from the joint probability of two or more random variables....
 have made use of Bayesian models more feasible. The computer revolution has implications for the future of statistics with new emphasis on "experimental" and "empirical" statistics. A large number of both general and special purpose statistical software
List of statistical packages

A statistical package is a suite of computer programs that are specialised for statistics. It enables people to obtain the results of standard statistical procedures and statistical significance tests, without requiring low-level numerical programming....
 are now available.

Misuse



There is a general perception that statistical knowledge is all-too-frequently intentionally misused
Misuse of statistics

A misuse of statistics occurs when a statistical argument asserts a falsehood. In some cases, the misuse may be accidental. In others, it is purposeful and for the gain of the perpetrator....
 by finding ways to interpret only the data that are favorable to the presenter. A famous saying attributed to Benjamin Disraeli is, "There are three kinds of lies: lies, damned lies, and statistics
Lies, damned lies, and statistics

"Lies, damned lies, and statistics" is part of a phrase attributed to Benjamin Disraeli and popularised in the United States by Mark Twain: "There are three kinds of lies: lies, damned lies, and statistics." The statement refers to the persuasive power of numbers, the use of statistics to bolster weak arguments, and the tendency of people to...
". Harvard President Lawrence Lowell wrote in 1909 that statistics, "...like veal pies, are good if you know the person that made them, and are sure of the ingredients".

If various studies appear to contradict one another, then the public may come to distrust such studies. For example, one study may suggest that a given diet or activity raises blood pressure
Blood pressure

Blood pressure is the pressure exerted by circulating blood on the walls of blood vessels, and constitutes one of the principal vital signs. The pressure of the circulating blood decreases as it moves away from the heart through artery and capillary, and toward the heart through veins....
, while another may suggest that it lowers blood pressure. The discrepancy can arise from subtle variations in experimental design, such as differences in the patient groups or research protocols, that are not easily understood by the non-expert. (Media reports usually omit this vital contextual information entirely, because of its complexity.)

By choosing (or rejecting, or modifying) a certain sample, results can be manipulated. Such manipulations need not be malicious or devious; they can arise from unintentional biases of the researcher. The graphs used to summarize data can also be misleading.

Deeper criticisms come from the fact that the hypothesis testing approach, widely used and in many cases required by law or regulation, forces one hypothesis (the null hypothesis
Null hypothesis

In statistics, a null hypothesis is a concept which arises in the context of statistical hypothesis testing. A common convention is to use the symbol H0 to denote the null hypothesis....
) to be "favored", and can also seem to exaggerate the importance of minor differences in large studies. A difference that is highly statistically significant can still be of no practical significance. (See criticism of hypothesis testing and controversy over the null hypothesis
Null hypothesis

In statistics, a null hypothesis is a concept which arises in the context of statistical hypothesis testing. A common convention is to use the symbol H0 to denote the null hypothesis....
.)

One response is by giving a greater emphasis on the p-value
P-value

In statistics hypothesis testing, the p-value is the probability of obtaining a result at least as extreme as the one that was actually observed, assuming that the null hypothesis is true....
 than simply reporting whether a hypothesis is rejected at the given level of significance. The p-value, however, does not indicate the size of the effect. Another increasingly common approach is to report confidence interval
Confidence interval

In statistics, a confidence interval is an interval estimation of a population parameter. Instead of estimating the parameter by a single value, an interval likely to include the parameter is given....
s. Although these are produced from the same calculations as those of hypothesis tests or p-values, they describe both the size of the effect and the uncertainty surrounding it.

Statistics applied to mathematics or the arts

Traditionally, Statistics was concerned with drawing inferences using a semi standardized methodology that was required learning in most sciences. This has changed with use of Statistics in non-inferential contexts. What was considered to be a dry subject, taken only as a requirement for degrees in many fields, is now viewed enthusiastically. What was derided by some mathematical purists is now considered essential methodology in some areas.
  • Scatter plots of data generated by a distribution function may be transformed with familiar tools used in Statistics to reveal underlying patterns, which may lead to hypotheses in number theory
    Number theory

    Number theory is the branch of pure mathematics concerned with the properties of numbers in general, and integers in particular, as well as the wider classes of problems that arise from their study....
    .
  • Methods of Statistics including predictive methods in forecasting
    Forecasting

    Forecasting is the process of estimation in unknown situations. Prediction is a similar, but more general term. Both can refer to estimation of time series, cross-sectional data or longitudinal study data....
    , are combined with chaos theory
    Chaos theory

    In mathematics, chaos theory describes the behavior of certain dynamical system s ? that is, systems whose states evolve with time ? that may exhibit dynamics that are highly sensitive to initial conditions ....
     and fractal geometry to create video works considered to be of beauty. The process art
    Process art

    Process art is an artistic movement as well as a creative sentiment and world view where the end product of art and craft, the :wikt:objet d?art, is not the principal focus....
     of Jackson Pollock
    Jackson Pollock

    Paul Jackson Pollock was an influential American painter and a major force in the abstract expressionism movement. In October 1945, he married the artist Lee Krasner....
     relied on artistic experiments whereby underlying distributions in nature were artistically revealed. With the advent of computers, methods of Statistics were applied to formalize such distribution driven natural processes, in order to make and analyze moving video art.
  • Methods of Statistics may be used predicatively, not inferentially in performance art
    Performance art

    Performance art is art in which the actions of an individual or a group at a particular place and in a particular time constitute the work. It can happen anywhere, at any time, or for any length of time....
    , as in a card trick based on a markov process
    Markov process

    A Markov process, named after the Russian mathematician Andrey Markov, is a mathematical model for the random evolution of a memoryless system, that is, one for which the likelihood of a given future state, at any given moment, depends only on its present state, and not on any past states....
     that only works some of the time, predicted using statistical methodology.
  • Statistics is used to predicatively create art, for example in applications of Statistical mechanics
    Statistical mechanics

    Statistical mechanics is the application of probability theory, which includes Mathematics tools for dealing with large populations, to the field of mechanics, which is concerned with the motion of particles or objects when subjected to a force....
     with the Statistical or Stochastic music invented by Iannis Xenakis
    Iannis Xenakis

    Iannis Xenakis was a Greeks modernist composer, musical theoretician, and architect. He is regarded as an important and influential composer of the twentieth century....
    , where the music is performance specific, and does not always come out as expected, but does within a range predicted using Statistics.


See also

  • Glossary of probability and statistics
    Glossary of probability and statistics

    The following is a glossary of terms. It is not intended to be all-inclusive....
  • List of academic statistical associations
    List of academic statistical associations

    International statistical societies*Institute of Mathematical Statistics*International Biometric Society*International Chinese Statistical Association...
  • List of basic statistics topics
    List of basic statistics topics

    Statistics is a Mathematics pertaining to the collection, analysis, interpretation, and presentation of data. It is applicable to a wide variety of academic disciplines, from the physical and social sciences to the humanities; it is also used and Misuse of statistics for making informed decisions in all areas of business and government....
  • List of national and international statistical services
    List of national and international statistical services

    National statistical services ...
  • List of publications in statistics
    List of publications in statistics

    Probability'The Doctrine of Chances':'Author:' Abraham de Moivre:'Publication data:' 1738 :'Online version:' ?'Th?orie analytique des probabilit?s':'Author:' Pierre-Simon Laplace:'Publication data:' 1820 :'Online version:'; , with more accurate character recognition; , complete PDF and PDFs by section...
  • List of statistical packages
    List of statistical packages

    A statistical package is a suite of computer programs that are specialised for statistics. It enables people to obtain the results of standard statistical procedures and statistical significance tests, without requiring low-level numerical programming....
  • List of statistical topics
    List of statistical topics

    Please add any Wikipedia articles related to statistics that are not already on this list.The "Related changes" link in the margin of this page leads to a list of the most recent changes to the articles listed below....
  • List of statisticians
    List of statisticians

    Statisticians or people who made notable contributions to the theories of statistics, or related aspects of probability, or machine learning....
  • Notation in probability and statistics
  • Forecasting
    Forecasting

    Forecasting is the process of estimation in unknown situations. Prediction is a similar, but more general term. Both can refer to estimation of time series, cross-sectional data or longitudinal study data....
  • Foundations of statistics
    Foundations of statistics

    Foundations of statistics is the usual name for the epistemology debate over how one should conduct inductive inference from data. Among issues considered are the question of Bayesian inference versus frequentist inference, the distinction between Ronald Fisher's "significance testing" and Jerzy Neyman-Egon Pearson "hypothesis testing", and...
  • Multivariate statistics
    Multivariate statistics

    Multivariate statistics or multivariate analysis in statistics describes a collection of procedures which involve observation and statistical analysis of more than one statistical variable at a time....
  • Official statistics
    Official statistics

    Official statistics are related directly to the field of statistics and cover all major areas of citizens' lives, such as economic and social development, living conditions , health , education , and the environment ....
  • Regression analysis
    Regression analysis

    In statistics, regression analysis is a collective name for techniques for the modeling and analysis of numerical data consisting of values of a dependent variable and of one or more independent variables ....
  • Statistical consultant
    Statistical consultant

    A statistical consultant provides statistical and research advice and guidance to clients in business, medicine, psychology, law, industry. The role of the statistical consultant varies from project to project, and can include any or all of the following:...
    s
  • Statistician
    Statistician

    Statisticians work with theoretical and applied statistics in both the private and public sectors. The core of that work is to measure, interpret, and describe the world and human activity patterns within it....
  • Structural equation modeling
    Structural equation modeling

    Structural equation modeling is a statisticaltechnique for testing and estimating causal relationshipsusing a combination of statistical data and qualitative causal...


External links


Online non-commercial textbooks

  • , by Will G. Hopkins, AUT University
  • , by U.S. National Institute of Standards and Technology
    National Institute of Standards and Technology

    The National Institute of Standards and Technology , known between 1901 and 1988 as the National Bureau of Standards , is a measurement standards laboratory which is a non-regulatory agency of the United States Department of Commerce....
     and SEMATECH
    SEMATECH

    SEMATECH is a non-profit consortium that performs basic research into Semiconductor device manufacturing. It was conceived of in 1986, formed in 1987, and began operating in 1988 as a partnership between the United States government and 14 U.S.-based semiconductor manufacturers to solve common manufacturing problems and regain competitivene...
  • , by David Lane, Joan Lu, Camille Peres, Emily Zitek, et al.
  • , by , Tufts University
    Tufts University

    Tufts University is a private research university in Medford, Massachusetts/Somerville, Massachusetts, near Boston, Massachusetts, United States....
  • , by


Other non-commercial resources

  • (Carleton College
    Carleton College

    Carleton College is an independent Sectarianism, coeducational, Liberal arts colleges in the United States in Northfield, Minnesota, Minnesota, United States....
    )
  • (ERIC)
  • (Rice University
    Rice University

    William Marsh Rice University is a private university research university located in Houston, Texas, Texas, United States. The campus is located near the Houston Museum District and adjacent to the Texas Medical Center....
    )
  • (University of Melbourne
    University of Melbourne

    The University of Melbourne is a public university located in Melbourne, Victoria . The second oldest university in Australia, and the oldest in Victoria, its main campus is in Parkville, Victoria, an inner suburb just north of the Melbourne CBD....
    )
  • (Carnegie Mellon University
    Carnegie Mellon University

    Carnegie Mellon University is a top private university research university in Pittsburgh. Since its inception, Carnegie Mellon has grown into a world-renowned institution, with numerous programs that are frequently college and university rankings among the best in the world....
    )