How do we find the standard deviation of a sample? Why is it important to know the standard deviation of a sample?
Standard Deviation of a Sample
The standard deviation of a sample is a measure of the amount of variation or dispersion of a set of values. It is calculated as the square root of the variance, which is the average of the squared differences from the mean.
Here’s the formula for standard deviation (σ):
Where:
- Σ is the sum of…
- (xi - μ)^2: the squared difference between each data point (xi) and the mean (μ)
- N is the number of data points in the sample
Knowing the standard deviation of a sample is important because it gives us an understanding of how spread out the values in our data are around the mean. A low standard deviation means that the values tend to be close to the mean, while a high standard deviation means that the values are spread out over a wider range.
It’s a key concept in statistics and is used in many areas, including finance, physics, and machine learning, to name a few.
Example
Let’s consider a simple example with a sample of five values: 2, 4, 6, 8, 10.
First, we calculate the mean (μ):
Next, we calculate the squared differences from the mean:
- (2 - 6)^2 = 16
- (4 - 6)^2 = 4
- (6 - 6)^2 = 0
- (8 - 6)^2 = 4
- (10 - 6)^2 = 16
Then, we calculate the average of these squared differences (this is the variance):
Finally, we take the square root of the variance to get the standard deviation:
So, the standard deviation of our sample is approximately 2.83.
Variance
Variance is a statistical measurement that describes the spread of data points in a data sample. It measures how far each number in the set is from the mean (average) and thus from every other number in the set. Variance is often denoted by the symbol σ².
Here’s the formula for variance:
Where:
- Σ is the sum of…
- (xi - μ)^2: the squared difference between each data point (xi) and the mean (μ)
- N is the number of data points in the sample
Variance is used to see how individual numbers relate to each other within a data set, rather than using broader mathematical techniques such as arranging data into quartiles. It’s a key concept in many areas of statistics, including finance, physics, and machine learning.
Example
Continuing with our previous example where we had a sample of five values: 2, 4, 6, 8, 10. We calculated the variance as:
So, the variance of our sample is 8.
Calculating Variance, Standard Deviation, and more using the TI 84 Calculator
In 1 Variable Stats On Calculator, I discuss how you can use the 1-Var Stats function on the TI-84 Plus CE to calculate the mean, sum of x, sum of x squared, sample standard deviation, population standard deviation, and more for a single-variable data set. This is a useful tool for quickly obtaining these statistics for your data.