Piqosity Logo
  • Leaders
  • Forum
  • Login

Lessons

Video Transcript (0:07) Standard deviation is a statistical measure of how spread out data is. Specifically, we use standard deviation to see how much our data varies. The lowercase Greek letter sigma, , represents the standard deviation. (0:22) The idea of how much our data varies is specified by variance, which is actually how we define standard deviation. (0:30) Variance is defined as the average of squared differences from the mean of the data set and is represented by the lowercase greek letter . (0:40) Consider a set of data that represents the number of siblings that each student in a class has. It could look something like {1, 0, 2, 1, 0, 1, 3, 3, 1, 2, 1, 0, 0}. (0:50) This means that to calculate variance, there are three steps: Calculate the mean of the data Calculate the sum of squared differences from the mean. Find the average of the value found in step 2. (1:05) First we calculate the mean. Add up all of the values and divide by the total number of values to find that the mean is . This means that on average, there is 1 sibling for each student. (1:20) Second, we calculate the sum of squared differences from the mean. This means that we subtract each data point from the mean, square it, and then add all of these values together. To illustrate this process, we will simply move through our original data set left to right and apply that process to each term. (1:45) The total sum of all of our squared differences from the mean is 13. (1:50) Now, we must tackle the final step of averaging this. However, it is not as straightforward as just dividing the sum of squared differences by 13! We must take into consideration whether we are dealing with the data from either: A population A sample (2:06) A population would be if we collected data from every person that we are interested in. This is like the U.S. Census, which aims to collect data about every single person in the United States. (2:18) Sample data is data that is gained from a smaller group of the population. For example, a sample of people from Texas would not necessarily represent the whole United States if that’s what we’re trying to measure. (2:32) If we have the population data, we could simply divide the variance by the total number of data points. For instance, in the class of 13 students that we’re dealing with, if we are only interested in describing the data from that class, we’d have the entire population. However, if we were trying to generalize the data to all of the classes in that school, this group of 13 students would just be a sample! (2:56) In a sample, we actually divide by 1 less than the number of data points. This is an attempt to “correct” our data since the sample is likely to be a little bit off from the actual population value. For our example, we’d divide the sum of our squared differences by 12 instead of 13. (3:15) This means that if the 13 students represent a population, the variance is  and if the students represent a sample, the variance is . (3:30) You may be wondering why on earth we’ve spent all of this time calculating variance for a lesson on standard deviation! This is because just like how standard deviation is represented by sigma, the variance is represented by sigma squared. So, to calculate the standard deviation, we just take the square root of variance! (3:48) For our population data, this is  and we have a standard deviation of 1. This means that on average, we could expect many of the values in our data set to be 1 value away from the mean. Since our mean is 1, this means that we expect many of the values to be between 0 and 2, since this is subtracting or adding 1 to the mean. This is exactly what our data looks like with many students having 0, 1, or 2 siblings and only one student having 3 siblings! (4:29) Calculating standard deviation is a nightmarish experience for most students, but thankfully calculators can usually do most of the work. The simpler skill to develop is determining how data is spread out on a bell curve. (4:41) A normal distribution looks like this: (4:45) The mean is the value in the middle of the curve and the data is spread out underneath the curve evenly. How large the standard deviation is will affect how spread out our curve becomes! (4:55) If the standard deviation is large, then the data will become very spread out and our curve will look a bit more flat: (5:02) If the standard deviation is low, then the data is not too spread out and will be concentrated around the center: (5:09) Being able to simply look at a graph of data and determine if it has a large or small standard deviation is integral to understanding the main idea of standard deviation, which is simply: how spread out is our data? Show Edit Destroy
Standard deviation is part of statistics and studies how data deviates or changes from the data’s mean (average). Basically, it looks how the data spreads out in increments from the mean—a low standard deviation means that most of the numbers are close to the average, and a high standard deviation means that the numbers are more spread out. For example, imagine a teacher gives her 26 students a pre-assessment for a math class to learn how prepared her students are. If she scores all the pre-assessments and calculates the standard deviation of the scores, what would the result tell her? If the test scores have a low standard deviation, it means most of the students have a similar knowledge base, and it would be easier to tailor her curriculum. However, if the standard deviation is very high, it means there is a large variance in what the students know, and she may have to consider splitting the students into two groups based on knowledge level.   Calculating Standard Deviation The formula to calculate standard deviation is: To calculate standard deviation, follow these steps: Find the mean of the values in the population Subtract the mean from each value in the population Square each difference Find the sum of the squared differences Divide the sum of the squared differences by the number of population values in the population (mean) Take the square root of the final mean to find the standard deviation Population vs. Sample The above equation is how you calculate standard deviation for a population. However, if the data is a sample (a selection from the total population), then the calculation changes.  Still, the steps remain very similar. To calculate sample standard deviation: Find the mean of the sample values Subtract the mean from each sample value  Square each difference Find the sum of the squared differences Divide the sum of the squared differences by the number of sample values minus one (this is the only step that differs!) Take the square root of the above result to find the sample standard deviation  Examine the following practice problems to compare the population standard deviation and the sample standard deviation. Take the following data set:  1, 2, 4, 5, 8 Let’s first assume that these values comprise the total population. Now, here’s how to solve the problem if these values are just a sample of the total population. We can see that the values are slightly different. In particular, the sample standard deviation is slightly larger than the population standard deviation. If you are only working with a sample, you are less confident of the overall variation, thus we choose to estimate that the standard deviation is a bit higher. Standard Deviation Practice Problems Answers to Practice Problems Mean = $3,995.24. Standard deviation = 264.53. Set B has a larger standard deviation. When this group is treated as a population, standard deviation = 6.02. As a sample, standard deviation = 6.25. D A Show Edit Destroy

New Lesson
Piqosity © 2021 | Get Technical Support
Piqosity offers personalized practice for the ACT, P/SAT, and Upper Level ISEE.
Create a free account below to start practicing nearly 7,000 adaptive questions.

 

 



By clicking “Start Practicing,” you agree to our Terms and that you have read our Privacy Policy.