7. Hypothesis Testing: Introduction to Hypothesis Testing
Statistical Power
Statistical Power
The Statistical Power of a test is the probability of rejecting the null hypothesis when it is false.
Statistical Power can be thought of as the sensitivity of the hypothesis test.
Ideally, you want to have high power to detect practically significant effects, but low power for detecting effects which are not of practical significance. High power can be achieved by using the most powerful statistical procedure available, and then choosing a sufficiently large sample size.
Researchers generally choose a sample size for their research based on the goal of achieving a desired Power level for detecting a particular effect size. For example, a researcher may want at least Power for detecting that the true mean is more than units larger than the hypothesized mean .
Power Calculation for a One-sample Z-test with Statistical Software
The Power of a hypothesis test can be calculated in either Excel or R.
Consider a left-tailed test of the null hypothesis against the alternative hypothesis at a significance level .
Suppose the true value of the population mean is , where . Then we can compute the power of this test by completing the following two steps:
1. Compute how small must be to cause to be rejected at a significance level . Call this value :
Excel
R
2. Compute the corresponding power:
Excel
R
Consider a right-tailed test of the null hypothesis against the alternative hypothesis at a significance level .
Suppose the true value of the population mean is , where . Then we can compute the power of this test by completing the following two steps:
1. Compute how large must be to cause to be rejected at a significance level . Call this value :
Excel
R
2. Compute the corresponding power:
Excel
R
There are a number of different ways we can calculate the Power of the test. Click on one of the panels to toggle a specific solution.
Let denote the true population mean, then .
Compute how small must be to cause to be rejected at the level of significance:
Compute the Power:
Let denote the true population mean, then .
Compute how small must be to cause to be rejected at the level of significance:
Compute the Power:
Power Curve
Often, a Power curve is used to investigate the relationship between Power and effect size for a specified sample size, significance level, or standard deviation.
For example, below is the Power curve for the test against at the level of significance, when and . Here, denotes the true value of the population mean .
Note that for an effect size of , the Power always equals the significance level (in this case, ).
The Power steadily increases as the effect size increases, reaching a target Power of when the effect size equals .
The Power then levels off towards the maximum Power of for larger effect sizes.
Modifying or will change the shape of the Power curve. For a given effect size, the Power will be higher if:
- The sample size increases.
- The significance level increases.
- The population standard deviation decreases.