5. Sampling: Sampling Distributions
Sampling Distribution of the Sample Proportion
A type of variable which is commonly studied in statistics is the binary variable.
Binary Variable
Definition
A binary or dichotomous variable is a categorical variable that can only take on possible values.
Examples
- True/false
- Success/failure
- Yes/no
- On/off
The mean of a binary variable is mathematically equivalent to the proportion.
Proportion
Definition
In statistics, a proportion refers to the fraction of a group that possesses a particular characteristic.
The population and sample proportion are denoted and , respectively.
Formula
When using a sample proportion to estimate a population proportion, the sampling distribution of the sample proportion can be used to determine how much estimation error is reasonable to expect.
Sampling Distribution of the Sample Proportion
The sampling distribution of the sample proportion is the probability distribution of the sample proportions of every possible sample of a particular size that can be drawn from a population.
The mean of the distribution of sample proportions is called the expected value of the sample proportion and is denoted .
The standard deviation of the distribution of sample proportions is called the standard error of the sample proportion and is denoted . The standard error is a measure of how much discrepancy to expect between a sample proportion and the population proportion .
Conditions for Normality
For any population of which a proportion possesses a particular characteristic, the sampling distribution of the sample proportion for samples of size may be considered approximately normal if both of the following conditions are satisfied:
- We expect there to be at least positive cases:
- We expect there to be at least negative cases:
If both these conditions are satisfied, the sampling distribution of the sample proportions may be considered approximately normal with parameters:
Investigate whether the sampling distribution of the sample proportion may be considered approximately normal:
Since both conditions are satisfied, the expected value of the sample proportion, , is equal to the population proportion :