All Topics  
Null hypothesis

 

   Email Print
   Bookmark   Link






 

Null hypothesis



 
 
In statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
, a null hypothesis (H0) is a concept which arises in the context of statistical hypothesis testing
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
. A common convention is to use the symbol H0 to denote the null hypothesis. The null hypothesis describes in a formal way some aspect of the statistical behaviour of a set of data and this description is treated as valid unless the actual behaviour of the data contradicts this assumption. Statistical hypothesis testing
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
 is used to make a decision about whether the data does contradict the null hypothesis: this is also called significance testing.






Discussion
Ask a question about 'Null hypothesis'
Start a new discussion about 'Null hypothesis'
Answer questions from other users
Full Discussion Forum



Recent Posts









Encyclopedia


In statistics
Statistics

Statistics is a Mathematics pertaining to the collection, analysis, interpretation or explanation, and presentation of data. It also provides tools for prediction and forecasting based on data....
, a null hypothesis (H0) is a concept which arises in the context of statistical hypothesis testing
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
. A common convention is to use the symbol H0 to denote the null hypothesis. The null hypothesis describes in a formal way some aspect of the statistical behaviour of a set of data and this description is treated as valid unless the actual behaviour of the data contradicts this assumption. Statistical hypothesis testing
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
 is used to make a decision about whether the data does contradict the null hypothesis: this is also called significance testing. A null hypothesis is never proven by such methods, as the absence of evidence against the null hypothesis does not establish the truth of the null hypothesis. Failing to reject H0 says that there is no strong reason to change any decisions or procedures predicated on its truth, but it also allows for the possibility of obtaining further data and then re-examining the same hypothesis.

The term was coined by the English
England

native_name =|conventional_long_name = England|common_name = England|image_flag = Flag of England.svg|image_coat = England COA.svg|symbol_type = Royal Coat of Arms...
 geneticist
Geneticist

A geneticist is a scientist who studies genetics, the science of heredity and genetic variation of organisms. A geneticist can be employed as a researcher or lecturer....
 and statistician Ronald Fisher
Ronald Fisher

Sir Ronald Aylmer Fisher, Fellow of the Royal Society was an England statistician, evolutionary biologist, and genetics. He was described by Anders Hald as "a genius who almost single-handedly created the foundations for modern statistical science" and Richard Dawkins described him as "the greatest of Charles Darwin successors"....
.

Notionally, the null hypothesis set out for a particular significance test always occurs in conjunction with an alternative hypothesis. Although in some cases it may seem reasonable to consider the alternative hypothesis as simply the negation of the null hypothesis, this would be misleading. In fact, significance testing and statements about hypotheses always take place within the context of a set of assumptions (which may unfortunately be unstated). This provides a way of considering alternative hypotheses which are the negation of the null hypothesis within the context of the overall assumptions. However not all alternative hypotheses are of this "negation type": the simplest cases are directional hypotheses. An important case arises in testing for differences across a number of different groups, where the null hypothesis may be "no difference across groups" with the alternative hypothesis being that the mean values for the groups would be in a certain pre-specified order. In the theory of statistical hypothesis testing
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
, the triple of "assumptions", "null hypothesis" and "alternative hypothesis" provides the basis for choosing an appropriate test statistic.

Testing for differences


In scientific and medical applications, the null hypothesis plays a major role in testing the significance of differences in treatment and control
Scientific control

Scientific controls are a vital part of the scientific method, since they can eliminate or minimise unintended influences such as researcher bias, environmental changes and biological variation....
 groups. This use, while widespread, is criticized on a number of grounds.

The assumption at the outset of the experiment is that no difference exists between the two groups (for the variable being compared): this is the null hypothesis in this instance. Examples of other types of null hypotheses are:
  • that values in samples from a given population can be modelled using a certain family of statistical distributions.
  • that the variability of data in different groups is the same, although they may be centred around different values.


Example

For example, one may want to compare the test scores of two random samples of men and women, and ask whether or not one group (population) has a mean score (which really is) different from the other. A null hypothesis would be that the mean score of the male population was the same as the mean score of the female population:

H0 : μ1 = μ2


where:

H0 = the null hypothesis
μ1 = the mean of population 1, and
μ2 = the mean of population 2.


Alternatively, the null hypothesis can postulate (suggest) that the two samples are drawn from the same population, so that the variance and shape of the distributions are equal, as well as the means.

Formulation of the null hypothesis is a vital step in testing statistical significance
Statistical significance

In statistics, a result is called statistically significant if it is unlikely to have occurred by chance. "A statistically significant difference" simply means there is statistical evidence that there is a difference; it does not mean the difference is necessarily large, important, or significant in the common meaning of the word....
. One can then establish the probability of observing the obtained data (or data more different from the prediction of the null hypothesis) if the null hypothesis is true. That probability is what is commonly called the "significance level" of the results.

That is, in scientific experimental design, we may predict that a particular factor will produce an effect on our dependent variable — this is our alternative hypothesis. We then consider how often we would expect to observe our experimental results, or results even more extreme, if we were to take many samples from a population where there was no effect (i.e. we test against our null hypothesis). If we find that this happens rarely (up to, say, 5% of the time), we can conclude that our results support our experimental prediction — we reject our null hypothesis.

Directionality


In many statements of null hypotheses there is no appearance that these can have a "directionality", in that the statement says that values are identical. However, null hypotheses can and do have "direction" - in many of these instances statistical theory allows the formulation of the test procedure to be simplified so that the test is equivalent to testing for an exact identity. For instance, if we formulate a one-tailed alternative hypothesis, application of Drug A will lead to increased growth in patients, then the true null hypothesis is the opposite of the alternative hypothesis — that is, application of Drug A will not lead to increased growth in patients. The effective null hypothesis will be application of Drug A will have no effect on growth in patients.

To understand why the effective null hypothesis is valid, it is instructive to consider the nature of the hypotheses outlined above. We are predicting that patients exposed to Drug A will see increased growth compared to a control group who do not receive the drug. That is,

H1: μdrug > μcontrol,


where:

μ = the patients' mean growth.


The effective null hypothesis is H0: µdrug = µcontrol.

The true null hypothesis is HT: µdrug = µcontrol.

The reduction occurs because, in order to gauge support for the alternative hypothesis, classical hypothesis testing requires us to calculate how often we would have obtained results as or more extreme than our experimental observations. In order to do this, we need first to define the probability of rejecting the null hypothesis for each possibility included in the null hypothesis and second to ensure that these probabilities are all less than or equal to the quoted significance level of the test. For any reasonable test procedure the largest of all these probabilities will occur on the boundary of the region HT, specifically for the cases included in H0 only. Thus the test procedure can be defined (that is the critical values can be defined) for testing the null hypothesis HT exactly as if the null hypothesis of interest was the reduced version H0.

Note that there are some who argue that the null hypothesis cannot be as general as indicated above: as Fisher, who first coined the term "null hypothesis" said, "the null hypothesis must be exact, that is free of vagueness and ambiguity, because it must supply the basis of the 'problem of distribution,' of which the test of significance is the solution." Thus according to this view, the null hypothesis must be numerically exact — it must state that a particular quantity or difference is equal to a particular number. In classical science, it is most typically the statement that there is no effect of a particular treatment; in observations, it is typically that there is no difference between the value of a particular measured variable and that of a prediction. The usefulness of this viewpoint must be queried - one can note that the majority of null hypotheses test in practice do not meet this criterion of being "exact". For example, consider the usual test that two means are equal where the true values of the variances are unknown - exact values of the variances are not specified.

Most statisticians believe that it is valid to state direction as a part of null hypothesis, or as part of a null hypothesis/alternative hypothesis pair (for example see http://davidmlane.com/hyperstat/A73079.html). The logic is quite simple: if the direction is omitted, then if the null hypothesis is not rejected it is quite confusing to interpret the conclusion. Say, the null is that the population mean = 10, and the one-tailed alternative: mean > 10. If the sample evidence obtained through x-bar equals -200 and the corresponding t-test statistic equals -50, what is the conclusion? Not enough evidence to reject the null hypothesis? Surely not! But we cannot accept the one-sided alternative in this case. Therefore, to overcome this ambiguity, it is better to include the direction of the effect if the test is one-sided. The statistical theory required to deal with the simple cases dealt with here, and more complicated ones, makes use of the concept of an unbiased test.

Limitations


A test of a null hypothesis is useful because it sets a limit on the probability of observing a data set
Data set

A data set is a collection of data, usually presented in tabular form. Each column represents a particular variable. Each row corresponds to a given member of the data set in question....
 as or more extreme than that observed if the null hypothesis is true. In general it is much harder to be precise about the corresponding probability if the alternative hypothesis is true.

If experimental observations contradict the prediction of the null hypothesis, it means that either the null hypothesis is false, or the event under observation occurs very improbably. This gives us high confidence in the falsehood of the null hypothesis, which can be improved in proportion to the number of trials conducted. However, accepting the alternative hypothesis only commits us to a difference in observed parameters; it does not prove that the theory or principles that predicted such a difference is true, since it is always possible that the difference could be due to additional factors not recognized by the theory.

For example, rejection of a null hypothesis that predicts that the rates of symptom relief in a sample of patients who received a placebo
Placebo

The placebo effect is a phenomenon in medicine where the results of a medical treatment are affected by their symbolism, and not just their medical value....
 and a sample who received a medicinal drug will be equal allows us to make a non-null statement (that the rates differed); it does not prove that the drug relieved the symptoms, though it gives us more confidence in that hypothesis.

The formulation, testing, and rejection of null hypotheses is methodologically consistent with the falsifiability
Falsifiability

Falsifiability is the logical possibility that an assertion can be shown false by an observation or a physical experiment. That something is "falsifiable" does not mean it is false; rather, that if it is false, then this can be shown by observation or experiment....
 model of scientific discovery
Science

In its broadest sense, science refers to any systematic knowledge or practice. In its more usual restricted sense, science refers to a system of acquiring knowledge based on scientific method, as well as to the organized body of knowledge gained through such research....
 formulated by Karl Popper
Karl Popper

Knight Bachelor Karl Raimund Popper Order of the Companions of Honour, Fellow of the Royal Society, Fellow of the British Academy was an Austrian and British philosopher and a professor at the London School of Economics....
 and widely believed to apply to most kinds of empirical research
Empirical research

Empirical research is any research that bases its findings on direct or indirect observation as its test of reality. Such research may also be conducted according to Hypothetico deductive model procedures, such as those developed from the work of Ronald Fisher....
. However, concerns regarding the high power
Statistical power

The power of aStatistical hypothesis testing is the probability that the test will reject a false null hypothesis . As power increases, the chances of a Type II error decrease....
 of statistical tests
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
 to detect differences in large samples have led to suggestions for re-defining the null hypothesis, for example as a hypothesis that an effect falls within a range considered negligible. This is an attempt to address the confusion among non-statisticians between significant and substantial, since large enough samples are likely to be able to indicate differences however minor.

The theory underlying the idea of a null hypothesis is closely associated with the frequency
Frequency probability

Frequency probability is the Probability interpretations that defines an event's probability as the limit of its relative frequency in a large number of trials....
 theory of probability, in which probabilistic statements can only be made about the relative frequencies of events in arbitrarily large samples. One way in which a failure to reject the null hypothesis is meaningful is in relation to an arbitrarily large population from which the observed sample is supposed to be drawn. A second way in which it is meaningful is from approach where both an experiment and all details of the statistical analysis are decided before doing the experiment. The significance level of a test is then conceptually identical to the probability of incorrectly rejecting the null hypothesis judged at a pre-experiment stage, where this probability need not be a frequency-based/large-sample one.

Controversy


As with statistical hypothesis testing
Statistical hypothesis testing

A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
, the use of null hypothesis testing is criticized on a number of grounds.

Straw man

Null hypothesis testing is controversial when the alternative hypothesis is suspected to be true at the outset of the experiment, making the null hypothesis the reverse of what the experimenter actually believes; it is put forward as a straw man
Straw man

A straw man logical argument is an informal fallacy based on misrepresentation of an opponent's position. To "attack a straw man" is to create the illusion of having refuted a proposition by substituting a superficially similar proposition , and refuting it, without ever having actually refuted the original position....
 only to allow the data to contradict it. Many statisticians have pointed out that rejecting the null hypothesis says nothing or very little about the likelihood that the null is true. Under traditional null hypothesis testing, the null is rejected when the conditional probability
Conditional probability

Conditional probability is the probability of some event A, given the occurrence of some other event B. Conditional probability is written P, and is read "the probability of A, given B"....
 P(Data as or more extreme than observed | Null) is very small, say 0.05. However, some say researchers are really interested in the probability P(Null | Data as actually observed) which cannot be inferred from a p-value
P-value

In statistics hypothesis testing, the p-value is the probability of obtaining a result at least as extreme as the one that was actually observed, assuming that the null hypothesis is true....
: some like to present these as inverses of each other but the events "Data as or more extreme than observed" and "Data as actually observed" are very different. In some cases, P(Null | Data) approaches 1 while P(Data as or more extreme than observed | Null) approaches 0, in other words, we can reject the null when it's virtually certain to be true. For this and other reasons, Gerd Gigerenzer
Gerd Gigerenzer

Gerd Gigerenzer is a Germany psychologist who has studied the use of bounded rationality and heuristics in decision making, especially in medicine....
 has called null hypothesis testing "mindless statistics" while Jacob Cohen
Jacob Cohen

Jacob Cohen was a US statistician and psychologist best known for his work on statistical power, where he helped to lay foundations for current statistical meta-analysis....
 described it as a ritual conducted to convince ourselves that we have the evidence needed to confirm our theories.

Bayesian criticism

Bayesian
Bayesian

Bayesian refers to methods in probability and statistics named after the Reverend Thomas Bayes , in particular methods related to:* the degree-of-belief interpretation of probability, as opposed to frequency or proportion or propensity interpretations; or...
 statisticians normally reject the idea of null hypothesis testing, instead using various techniques in Bayesian inference
Bayesian inference

Bayesian inference is statistical inference in which evidence or observations are used to update or to newly infer the probability that a hypothesis may be true....
. Given a prior probability distribution for one or more parameters, sample evidence can be used to generate an updated posterior distribution. In this framework, but not in the null hypothesis testing framework, it is meaningful to make statements of the general form "the probability that the true value of the parameter is greater than 0 is p". According to Bayes Theorem, we have:

thus P(Null | Data) approaches 1 while P(Data | Null) approaches 0 exactly when P(Null)/P(Data) approaches 1, i.e. (for instance) when the a priori probability of the null hypothesis, P(Null), is also approaching 1, while P(Data) approaches 0: then P(Data | Null) is low because we have extremely unlikely data, but the Null hypothesis is extremely likely to be true.

Publication bias


In 2002, a group of psychologists launched a new journal dedicated to experimental studies in psychology
Psychology

Psychology is an academic and applied science discipline involving the science study of human mental functions and behavior. Occasionally it also relies on symbolic hermeneutics and critical theory, although these traditions are less pronounced than in other social sciences such as sociology....
 which support the null hypothesis. The Journal of Articles in Support of the Null Hypothesis (JASNH) was founded to address a scientific publishing bias against such articles. According to the editors,

"other journals and reviewers have exhibited a bias against articles that did not reject the null hypothesis. We plan to change that by offering an outlet for experiments that do not reach the traditional significance levels (p < 0.05). Thus, reducing the file drawer problem
File drawer problem

The file drawer problem is that many studies in a given area of research may be conducted but never reported, and those that are not reported may on average report different results from those that are reported....
, and reducing the bias in psychological literature. Without such a resource researchers could be wasting their time examining empirical questions that have already been examined. We collect these articles and provide them to the scientific community free of cost."


The "File Drawer problem" is a problem that exists due to the fact that academics tend not to publish results that indicate the null hypothesis could not be rejected. This does not mean that the relationship they were looking for did not exist, but it means they couldn't prove it. Even though these papers can often be interesting, they tend to end up unpublished, in "file drawers."

Ioannidis has inventoried factors that should alert readers to risks of publication bias.

See also

  • Counternull
    Counternull

    In statistics, and especially in the statistical analysis of psychology data, the counternull is a statistic used to aid the understanding and presentation of research results....
  • Dream argument
    Dream argument

    The "dream argument" is the postulation that the act of dreaming provides preliminary evidence that the senses we trust to distinguish reality from illusion should not be fully trusted, and therefore any state that is dependent on our senses should at the very least be carefully examined and rigorously tested to determine if it is in fact "re...
  • Straw man
    Straw man

    A straw man logical argument is an informal fallacy based on misrepresentation of an opponent's position. To "attack a straw man" is to create the illusion of having refuted a proposition by substituting a superficially similar proposition , and refuting it, without ever having actually refuted the original position....
  • P-value
    P-value

    In statistics hypothesis testing, the p-value is the probability of obtaining a result at least as extreme as the one that was actually observed, assuming that the null hypothesis is true....
  • Publication bias
    Publication bias

    Publication bias arises from the tendency for researchers, editors, and pharmaceutical companies to handle experimental results that are positive differently from results that are negative or inconclusive....
  • Statistical hypothesis testing
    Statistical hypothesis testing

    A statistical hypothesis test is a method of making statistical decisions using experimental data. It is sometimes called confirmatory data analysis, in contrast to exploratory data analysis....
  • Null Hypothesis: The Journal of Unlikely Science
    Null Hypothesis: The Journal of Unlikely Science

    Null Hypothesis: The Journal of Unlikely Science is an online Satire science website, which casts a wry eye over the world of science and technology....


External links