1. Descriptive Statistics: Frequency Distributions
Measures of Location I: Quantiles
Besides describing the characteristics of a distribution as a whole, descriptive statistics can also be used to provide more information about individual scores. One particularly useful piece of information is the location of a score relative to all other scores within the distribution.
Knowing a score's location relative to the other scores can, for instance, help you judge whether a particular score should be considered high, low, or average. Raw scores are, by themselves, not very informative in this regard.
One way to express a score's location within a distribution is to calculate its percentile rank.
Percentile Rank
The percentile rank of a score is the percentage of scores in the distribution that are equal to or lower than it.
A student scored points at an exam and would like to know how well she did compared to her classmates. The scores of the entire class are as follows:
To calculate the percentile rank of , first count the number of scores that are equal to or lower than , which in this case is .
Next, divide that number by the total number of scores, which in this case is , and multiply by .
So the percentile rank of is .
When a score is identified by its percentile rank, the score is called a percentile.
Percentiles
Definition
Percentiles are the values that divide a distribution of scores into one hundred equal parts.
The percentile of distribution is the value such that percent of the scores are equal to or below it.
Formula
The index of the percentile of a distribution is:
where is the number of scores and is a value between 1 and 99.
Percentile Calculation
The calculation of the percentile starts by ordering the scores in the distribution from smallest to largest. Next, to find the index of the percentile, use the following formula:
where the total number of scores in the distribution.
It is important to note that formula above is used to determine the location of the percentile and not the value associated with it.
If is an integer, then the percentile is the score located at the position of the ordered data.
Whenever is not an integer, linear interpolation is used to calculate the percentile:
- Find the two integers closest to by rounding up and down. These indices are denoted by and , respectively.
- Determine the values located at these positions. These values are denoted by and , respectively.
- Calculate the percentile with the following formula:
There are a number of different ways we can calculate the percentile. Click on one of the panels to toggle a specific solution.
Next, to find the index of the percentile (), use the following formula:
Since is an integer, the percentile is the score located at the position of the ordered data:
Assuming the sample scores are located in cells A1 through A21, the Excel command to calculate the percentile is:PERCENTILE(array, k)
- array: The array or cell range of numeric values for which you want the percentile value.
- k: The percentile value in the range , inclusive.
This gives:
Thus, to calculate the percentile, run the following command:quantile(x, probs)
- x: The numeric vector whose sample quantiles are wanted.
- probs: The numeric vector of probabilities with values in the range .
This gives:
Percentiles divide a distribution into equal parts. It is possible, however, to divide a distribution into any arbitrary number of equal parts. When dividing a distribution of scores into equal parts, the dividing values are called quantiles.
Quantiles
If you divide the data set into equal parts, you call the dividing values -quantiles and there are always quantiles.
If you divide a distribution of scores into four equal parts, the dividing values are referred to as quartiles.
Quartiles
Definition
Quartiles are the values that divide a distribution of scores into four equal parts.
The first (), second (), and third () quartiles are equal to the 25th, 50th, and 75th percentile, respectively.
The second quartile is also called the median.
Formula
The index of the quartile of a distribution is:
where is the number of scores and is a value between 1 and 3.
Quartile Calculation
The calculation of quartiles starts by ordering the scores in the distribution from smallest to largest. Next, to find the index of the quartile, use the following formula:
where is the total number of scores in the dataset.
It is important to note that formula above is used to determine the location of the quartile and not the value associated with it.
If is an integer, then the quartile is the score located at the position of the ordered data.
Whenever is not an integer, linear interpolation is used to calculate the quartile:
- Find the two integers closest to by rounding up and down. These indices are denoted by and , respectively.
- Determine the values located at these positions. These values are denoted by and , respectively.
- Calculate the quartile with the following formula:
There are a number of different ways we can calculate the quartile. Click on one of the panels to toggle a specific solution.
Next, to find the index of the quartile (), use the following formula:
Since is an integer, the quartile is the score located at the position of the ordered data:
Assuming the sample scores are located in cells A1 through A21, the Excel command to calculate the quartile is:QUARTILE(array, quart)
- array: The array or cell range of numeric values for which you want the quartile value.
- quart: Indicates which quartile to return.
This gives:
Thus, to calculate the quartile, run the following command:quantile(x)
- x: The numeric vector whose sample quantiles are wanted.
Looking at the output generated by R, under we find: