Decision Rule
Critical value approach
In hypothesis testing, a critical value is a point on the test distribution that is compared to the test statistic to determine whether to reject the null hypothesis. If the absolute value of your test statistic is greater than the critical value, you can declare statistical significance and reject the null hypothesis. Critical values correspond to , so their values become fixed when you choose the test's .
Here we have an example concerning the mean grade point average, suppose we take a random sample of students majoring in mathematics. Since , our test statistic has degrees of freedom. Suppose we set our significance level at 0.05, so that we have only a chance of making a Type I error.
Examples for critical value approach
- Critical value for right-tailed test
- The critical value for conducting the right-tailed test : versus : is the t-value, denoted , such that the probability to the right of it is . It can be shown using either statistical software or a t-table that the critical value is 1.7613. That is, we would reject the null hypothesis : = 3 in favor of the alternative hypothesis : > 3 if the test statistic is greater than 1.7613.
- Critical value for right-tailed test
- The critical value for conducting the left-tailed test : versus : is the t-value, denoted , such that the probability to the left of it is α. It can be shown using either statistical software or a t-table that the critical value is -1.7613. That is, we would reject the null hypothesis : = 3 in favor of the alternative hypothesis : < 3 if the test statistic is less than -1.7613.
- Critical value for right-tailed test
- There are two critical values for the two-tailed test : versus : — one for the left-tail denoted and one for the right-tail denoted . The value is the t-value such that the probability to the left of it is , and the value is the t-value such that the probability to the right of it is . It can be shown using either statistical software or a t-table that the critical value is -2.1448 and the critical value is 2.1448. That is, we would reject the null hypothesis : in favor of the alternative hypothesis : if the test statistic is less than -2.1448 or greater than 2.1448:
P-value approach
The P-value approach involves determining "likely" or "unlikely" by determining the probability — assuming the null hypothesis were true — of observing a more extreme test statistic in the direction of the alternative hypothesis than the one observed.
- If the P-value is small, say less than (or equal to) , then it is "unlikely."
- If the P-value is large, say more than , then it is "likely."
If the P-value is less than (or equal to) , then the null hypothesis is rejected in favor of the alternative hypothesis. And, if the P-value is greater than , then the null hypothesis is not rejected.
Here, we have four steps to use the P-value approach to make the decision for hypothesis test.
- Specify the null and alternative hypotheses.
Using the sample data and assuming the null hypothesis is true, calculate the value of the test statistic. Again, to conduct the hypothesis test for the population mean , we use the t-statistic which follows a t-distribution with degrees of freedom.
Using the known distribution of the test statistic, calculate the P-value.
- If the null hypothesis is true, what is the probability that we will observe a more extreme test statistic in the direction of the alternative hypothesis than we did?
- Set the significance level, .
- is the probability of making a Type I error to be small and the common value for are 0.01, 0.05, or 0.10.
- Compare the P-value to .
- If the P-value is less than (or equal to) , reject the null hypothesis in favor of the alternative hypothesis. If the P-value is greater than , do not reject the null hypothesis.
Examples for p-value approach
In our example concerning the mean grade point average, suppose that our random sample of students majoring in mathematics yields a test statistic equaling 2.5. Since , our test statistic has degrees of freedom. Also, suppose we set our significance level at 0.05, so that we have only a 5% chance of making a Type I error.
- P-value approach for right-tailed test
- The P-value for conducting the right-tailed test versus is the probability that we would observe a test statistic greater than if the population mean μμ really were 3. Recall that probability equals the area under the probability curve. The P-value is therefore the area under a curve and to the right of the test statistic . It can be shown using statistical software that the P-value is 0.0127.
- The P-value, 0.0127, tells us it is "unlikely" that we would observe such an extreme test statistic in the direction of if the null hypothesis were true. Therefore, our initial assumption that the null hypothesis is true must be incorrect. That is, since the P-value, 0.0127, is less than = 0.05, we reject the null hypothesis in favor of the alternative hypothesis .
- Note that we would not reject in favor of if we lowered our willingness to make a Type I error to instead, as the P-value, 0.0127, is then greater than = 0.01.
- P-value approach for left-tailed test
- The P-value for conducting the left-tailed test versus is the probability that we would observe a test statistic less than if the population mean really were 3. The P-value is therefore the area under a curve and to the left of the test statistic . It can be shown using statistical software that the P-value is 0.0127.
- The P-value, 0.0127, tells us it is "unlikely" that we would observe such an extreme test statistic in the direction of if the null hypothesis were true. Therefore, our initial assumption that the null hypothesis is true must be incorrect. That is, since the P-value, 0.0127, is less than , we reject the null hypothesis in favor of the alternative hypothesis . *Note that we would not reject in favor of if we lowered our willingness to make a Type I error to instead, as the P-value, 0.0127, is then greater than .
- P-value approach for two-tailed test
- The P-value for conducting the two-tailed test versus is the probability that we would observe a test statistic less than -2.5 or greater than 2.5 if the population mean really were 3. That is, the two-tailed test requires taking into account the possibility that the test statistic could fall into either tail (and hence the name "two-tailed" test). The P-value is therefore the area under a curve to the left of -2.5 and to the right of the 2.5. It can be shown using statistical software that the P-value is 0.0127 + 0.0127, or 0.0254.
- Note that the P-value for a two-tailed test is always two times the P-value for either of the one-tailed tests. The P-value, 0.0254, tells us it is "unlikely" that we would observe such an extreme test statistic in the direction of if the null hypothesis were true. Therefore, our initial assumption that the null hypothesis is true must be incorrect. That is, since the P-value, 0.0254, is less than = 0.05, we reject the null hypothesis in favor of the alternative hypothesis .
- Note that we would not reject in favor of if we lowered our willingness to make a Type I error to instead, as the P-value, 0.0254, is then greater than = 0.01.