What percentage of cases fall between approximately 1 and +1 standard deviations on the normal curve?

Last updated
Save as PDF

Page ID7099

University of Missouri-St. Louis, Rice University, & University of Houston, Downtown Campus via University of Missouri’s Affordable and Open Access Educational Resources Initiative

\(Z\)-scores and the standard normal distribution go hand-in-hand. A \(z\)-score will tell you exactly where in the standard normal distribution a value is located, and any normal distribution can be converted into a standard normal distribution by converting all of the scores in the distribution into \(z\)-scores, a process known as standardization.

We saw in the previous chapter that standard deviations can be used to divide the normal distribution: 68% of the distribution falls within 1 standard deviation of the mean, 95% within (roughly) 2 standard deviations, and 99.7% within 3 standard deviations. Because \(z\)-scores are in units of standard deviations, this means that 68% of scores fall between \(z\) = -1.0 and \(z\) = 1.0 and so on. We call this 68% (or any percentage we have based on our \(z\)-scores) the proportion of the area under the curve. Any area under the curve is bounded by (defined by, delineated by, etc.) by a single \(z\)-score or pair of \(z\)-scores.

An important property to point out here is that, by virtue of the fact that the total area under the curve of a distribution is always equal to 1.0 (see section on Normal Distributions at the beginning of this chapter), these areas under the curve can be added together or subtracted from 1 to find the proportion in other areas. For example, we know that the area between \(z\) = -1.0 and \(z\) = 1.0 (i.e. within one standard deviation of the mean) contains 68% of the area under the curve, which can be represented in decimal form at 0.6800 (to change a percentage to a decimal, simply move the decimal point 2 places to the left). Because the total area under the curve is equal to 1.0, that means that the proportion of the area outside \(z\)= -1.0 and \(z\) = 1.0 is equal to 1.0 – 0.6800 = 0.3200 or 32% (see Figure \(\PageIndex{1}\) below). This area is called the area in the tails of the distribution. Because this area is split between two tails and because the normal distribution is symmetrical, each tail has exactly one-half, or 16%, of the area under the curve.

Figure \(\PageIndex{1}\): Shaded areas represent the area under the curve in the tails

We will have much more to say about this concept in the coming chapters. As it turns out, this is a quite powerful idea that enables us to make statements about how likely an outcome is and what that means for research questions we would like to answer and hypotheses we would like to test. But first, we need to make a brief foray into some ideas about probability.

The standard normal distribution is a normal distribution with a mean of zero and standard deviation of 1. The standard normal distribution is centered at zero and the degree to which a given measurement deviates from the mean is given by the standard deviation. For the standard normal distribution, 68% of the observations lie within 1 standard deviation of the mean; 95% lie within two standard deviation of the mean; and 99.9% lie within 3 standard deviations of the mean. To this point, we have been using "X" to denote the variable of interest (e.g., X=BMI, X=height, X=weight). However, when using a standard normal distribution, we will use "Z" to refer to a variable in the context of a standard normal distribution. After standarization, the BMI=30 discussed on the previous page is shown below lying 0.16667 units above the mean of 0 on the standard normal distribution on the right.

====

Since the area under the standard curve = 1, we can begin to more precisely define the probabilities of specific observation. For any given Z-score we can compute the area under the curve to the left of that Z-score. The table in the frame below shows the probabilities for the standard normal distribution. Examine the table and note that a "Z" score of 0.0 lists a probability of 0.50 or 50%, and a "Z" score of 1, meaning one standard deviation above the mean, lists a probability of 0.8413 or 84%. That is because one standard deviation above and below the mean encompasses about 68% of the area, so one standard deviation above the mean represents half of that of 34%. So, the 50% below the mean plus the 34% above the mean gives us 84%.

Probabilities of the Standard Normal Distribution Z

This table is organized to provide the area under the curve to the left of or less of a specified value or "Z value". In this case, because the mean is zero and the standard deviation is 1, the Z value is the number of standard deviation units away from the mean, and the area is the probability of observing a value less than that particular Z value. Note also that the table shows probabilities to two decimal places of Z. The units place and the first decimal place are shown in the left hand column, and the second decimal place is displayed across the top row.

But let's get back to the question about the probability that the BMI is less than 30, i.e., P(X<30). We can answer this question using the standard normal distribution. The figures below show the distributions of BMI for men aged 60 and the standard normal distribution side-by-side.

Distribution of BMI and Standard Normal Distribution

====

The area under each curve is one but the scaling of the X axis is different. Note, however, that the areas to the left of the dashed line are the same. The BMI distribution ranges from 11 to 47, while the standardized normal distribution, Z, ranges from -3 to 3. We want to compute P(X < 30). To do this we can determine the Z value that corresponds to X = 30 and then use the standard normal distribution table above to find the probability or area under the curve. The following formula converts an X value into a Z score, also called a standardized score:

where μ is the mean and σ is the standard deviation of the variable X.

In order to compute P(X < 30) we convert the X=30 to its corresponding Z score (this is called standardizing):

Thus, P(X < 30) = P(Z < 0.17). We can then look up the corresponding probability for this Z score from the standard normal distribution table, which shows that P(X < 30) = P(Z < 0.17) = 0.5675. Thus, the probability that a male aged 60 has BMI less than 30 is 56.75%.

Another Example

Using the same distribution for BMI, what is the probability that a male aged 60 has BMI exceeding 35? In other words, what is P(X > 35)? Again we standardize:

We now go to the standard normal distribution table to look up P(Z>1) and for Z=1.00 we find that P(Z<1.00) = 0.8413. Note, however, that the table always gives the probability that Z is less than the specified value, i.e., it gives us P(Z<1)=0.8413.

Therefore, P(Z>1)=1-0.8413=0.1587. Interpretation: Almost 16% of men aged 60 have BMI over 35.

Normal Probability Calculator

Z-Scores with R

As an alternative to looking up normal probabilities in the table or using Excel, we can use R to compute probabilities. For example,

> pnorm(0)

[1] 0.5

A Z-score of 0 (the mean of any distribution) has 50% of the area to the left. What is the probability that a 60 year old man in the population above has a BMI less than 29 (the mean)? The Z-score would be 0, and pnorm(0)=0.5 or 50%.

What is the probability that a 60 year old man will have a BMI less than 30? The Z-score was 0.16667.

> pnorm(0.16667)

[1] 0.5661851

So, the probabilty is 56.6%.

What is the probability that a 60 year old man will have a BMI greater than 35?

35-29=6, which is one standard deviation above the mean. So we can compute the area to the left

> pnorm(1)

[1] 0.8413447

and then subtract the result from 1.0.

1-0.8413447= 0.1586553

So the probability of a 60 year ld man having a BMI greater than 35 is 15.8%.

Or, we can use R to compute the entire thing in a single step as follows:

> 1-pnorm(1)

[1] 0.1586553

Probability for a Range of Values

What is the probability that a male aged 60 has BMI between 30 and 35? Note that this is the same as asking what proportion of men aged 60 have BMI between 30 and 35. Specifically, we want P(30 < X < 35)? We previously computed P(30<X) and P(X<35); how can these two results be used to compute the probability that BMI will be between 30 and 35? Try to formulate and answer on your own before looking at the explanation below.

Answer

Now consider BMI in women. What is the probability that a female aged 60 has BMI less than 30? We use the same approach, but for women aged 60 the mean is 28 and the standard deviation is 7.

Answer

What is the probability that a female aged 60 has BMI exceeding 40? Specifically, what is P(X > 40)?

Answer

return to top | previous page | next page

What percentage of cases fall between 1 and 1 standard deviation in the normalized distribution?

Empirical Rule or 68-95-99.7% Rule Approximately 68% of the data fall within one standard deviation of the mean. Approximately 95% of the data fall within two standard deviations of the mean. Approximately 99.7% of the data fall within three standard deviations of the mean.

What percentage of cases are between 1 and standard deviation?

In a normal distribution, 68% of cases fall within one standard deviation of the mean and 95% of cases fall within two standard deviations.

What percentage of data falls within the 1 standard deviation of a normal distribution?

Normal distributions come up time and time again in statistics. A normal distribution has some interesting properties: it has a bell shape, the mean and median are equal, and 68% of the data falls within 1 standard deviation.

What is 1 standard deviation on a normal curve?

A Standard Normal Distribution is a type of normal distribution with a mean of 0 and a standard deviation of 1. This means that the normal distribution has its center at 0 and intervals that increase by 1.

When the percentage change in the supply of a good is more than the percentage change in its price the elasticity of supply will be?

What analysis expresses each item on the income statement as a percentage of a specified base?

What percentage of eligible employees must be covered by non contributory group insurance plans

In which of the following contracts does the buyer pay the supplier for allowable performance costs along with a predetermined percentage based on total costs?

What percentage of cases fall between approximately 1 and +1 standard deviations on the normal curve?

Probabilities of the Standard Normal Distribution Z

Normal Probability Calculator

Z-Scores with R

Probability for a Range of Values

What percentage of cases fall between 1 and 1 standard deviation in the normalized distribution?

What percentage of cases are between 1 and standard deviation?

What percentage of data falls within the 1 standard deviation of a normal distribution?

What is 1 standard deviation on a normal curve?

zusammenhängende Posts

Online shopping differs from retail shopping because a larger percentage of it happens during the

What is the percentage of teenagers who experience major depression in which their psychological disorder?

Worldwide small and medium sized companies SMEs account for about what percentage of firms

What percentage of naval tonnage did the united states destroy under the london naval treaty?

What method of analyzing financial statement shows an account as a percentage of the whole?

When the percentage of changes in price results in the exact percentage change in supply supply is said to be?

When the percentage change in the supply of a good is more than the percentage change in its price the elasticity of supply will be?

What analysis expresses each item on the income statement as a percentage of a specified base?

What percentage of eligible employees must be covered by non contributory group insurance plans

In which of the following contracts does the buyer pay the supplier for allowable performance costs along with a predetermined percentage based on total costs?

Toplist

Neuester Beitrag

Stichworte