How will you find the interval estimate of the population mean if the population standard deviation is known?

You are estimating the population mean, mu, not the sample mean, x bar.

Population Standard Deviation Known

If the population standard deviation, sigma is known, then the mean has a normal (Z) distribution.

The maximum error of the estimate is given by the formula for E shown

. The Z here is the z-score obtained from the normal table, or the bottom of the t-table as explained in the introduction to estimation. The z-score is a factor of the level of confidence, so you may get in the habit of writing it next to the level of confidence.

Once you have computed E, I suggest you save it to the memory on your calculator. On the TI-82, a good choice would be the letter E. The reason for this is that the limits for the confidence interval are now found by subtracting and adding the maximum error of the estimate from/to the sample mean.

Student's t Distribution

When the population standard deviation is unknown, the mean has a Student's t distribution. The Student's t distribution was created by William T. Gosset, an Irish brewery worker. The brewery wouldn't allow him to publish his work under his name, so he used the pseudonym "Student".

The Student's t distribution is very similar to the standard normal distribution.

It is symmetric about its mean
It has a mean of zero
It has a standard deviation and variance greater than 1.
There are actually many t distributions, one for each degree of freedom
As the sample size increases, the t distribution approaches the normal distribution.
It is bell shaped.
The t-scores can be negative or positive, but the probabilities are always positive.

Degrees of Freedom

A degree of freedom occurs for every data value which is allowed to vary once a statistic has been fixed. For a single mean, there are n-1 degrees of freedom. This value will change depending on the statistic being used.

Population Standard Deviation Unknown

If the population standard deviation, sigma is unknown, then the mean has a student's t (t) distribution and the sample standard deviation is used instead of the population standard deviation.

The maximum error of the estimate is given by the formula for E shown

. The t here is the t-score obtained from the Student's t table. The t-score is a factor of the level of confidence and the sample size.

Notice the formula is the same as for a population mean when the population standard deviation is known. The only thing that has changed is the formula for the maximum error of the estimate.

Table of Contents

Here we consider the joint estimation of a multivariate set of population means. That is, we have observed a set of p X-variables and may wish to estimate the population mean for each variable. In some instances, we may also want to estimate one or more linear combinations of population means. Our basic tool for estimating the unknown value of a population parameter is a confidence interval, an interval of values that is likely to include the unknown value of the parameter.

General Format for a Confidence Interval

The general format of a confidence interval estimate of a population mean is:

\(\text{Sample mean} \pm \text{Multiplier × Standard error of mean}\)

For variable \(X_{j}\), a confidence interval estimate of its population mean \(\mu_{j}\) is

\(\bar{x}_j \pm \text{Multiplier}\dfrac{s_j}{\sqrt{n}}\)

In this formula, \(\bar{x}_{j}\) is the sample mean, \(s_{j}\) is the sample standard deviation and n is the sample size. The multiplier value is a function of the confidence level, the sample size, and the strategy used for dealing with the multiple inference issue.

Strategies for Determining the Multiplier Section

The following list covers some common strategies:

One at a Time Confidence Intervals: This strategy essentially considers each mean separately and uses the desired confidence level (usually 95%) for each single interval.
Bonferroni Method: With this method we set a family wide error rate and then divide this family error rate by the number of intervals to be computed to determine the error rate (and hence confidence level) for each individual interval.
Simultaneous Confidence Region: This strategy uses properties of the multivariate normal distribution to define joint confidence intervals. The multiplier for this method is conservative because the family error rate applies to the family of all possible linear combinations of population means.

One at a Time Intervals Section

For a \(1 - \alpha\) confidence interval, the “one at a time” multiplier is the t-value such that the probability is \(1 - \alpha\) between –t and +t under a t-distribution with n - 1 degrees of freedom. Said another way, the value of t is such that the probability greater than +t is \(\alpha/2\).

Notationally, the one at a time multiplier is:

\(\text{Multiplier} = t_{n-1}(\alpha/2)\)

With this notation, a confidence interval for \(\mu_{j}\) is computed as:

\(\bar{x}_j \pm t_{n-1}(\alpha/2)\frac{s_j}{\sqrt{n}}\)

Note! The notation for the t-multiplier can be confusing because it varies between textbooks and statistical software. For instance, Excel’s command to determine the p-value requires that you give the value of α whereas SAS requires that you give the cumulative probability \(1 - \alpha / 2\) for the desired t-value.

Example 5-1: One at a Time Intervals

Suppose that the sample size is n = 25 and we want a 95% confidence interval for the population mean. Thus \(\alpha = 0.05\). Our textbook would write the multiplier as \(t_{24}(.025)\). In Excel, the command =TINV(.05,24) will give the multiplier (value = 2.064). In SAS, a command such as t1=tinv(.975,24) will make the variable t1 that contains the desired multiplier.

Bonferroni Method Multiplier Section

When we determine confidence intervals for the population means of several variables, we are creating a family of confidence intervals. The family-wide error rate is the probability that at least one of the confidence intervals in the family will not capture the population mean. The family-wide confidence level = 1 – family-wide error rate.

Suppose that we have a family of p confidence intervals and the error rates for the individual intervals are \(\alpha _ { 1 } , \alpha _ { 2 } , \dots , \alpha _ { p }\). The Bonferroni Inequality states that the family wide-error rate is less than or equal to the sum of \(\alpha _ { 1 } , \alpha _ { 2 } , \dots , \alpha _ { p }\). That is family-wide error rate \(\leq \Sigma \alpha _ { i }\). In terms of the family-wide confidence that all intervals capture their population means, we can write this as \(1 - \Sigma \alpha _ { i } \leq\) family-wide confidence level.

Most often, we divide the desired family-wide error rate equally across the intervals that we will compute. If we are computing p confidence intervals with a desired family wide confidence level of \(\alpha\), we use an error rate of \(\alpha / p\) (so confidence \(= 1 - (\alpha / p)\) for each individual interval. This guarantees that the family wide confidence level will be greater than or equal to \(1 - \alpha\).

Suppose that we are calculating p intervals with a family error rate equal to \(\alpha\).

Notationally, the Bonferroni method multiplier is:

\(\text{Multiplier} = t_{n-1}(\alpha/2p)\)

A confidence interval for\(\mu_{j}\) is computed as:

\(\bar{x}_j \pm t_{n-1}(\alpha/2p)\frac{s_j}{\sqrt{n}}\)

Example 5-2: Bonferroni Method Multiplier

Suppose that n = 25. The family wide error = 5% for a family confidence = 95%. We are computing intervals for p = 5 means. The error rate for each interval will be .05/5 = 1%. We might use the Excel command = TINV(.01,24) to find that the multiplier = 2.797. In SAS, we use the cumulative probability \(= 1- \alpha /2p\) so the command for finding the t-multiplier in this instance is something like t1=tinv(.995, 24).

Simultaneous Confidence Region Multiplier Section

This method is derived from properties of the multivariate normal distribution. The multiplier applies to the family of all possible linear combinations of the population means considered, including the individual means. It is conservative (meaning that the multiplier tends to be larger than absolutely necessary). When family confidence is used, compare the value of this multiplier to the Bonferroni method multiplier and use the smaller of the two.

Notationally, the simultaneous confidence region multiplier is:

\(\text{Multiplier}=\sqrt{\frac{p(n-1)}{n-p}F_{p,n-p}(\alpha)}\)

\(F _ { p , n - p } ( \alpha )\) represents a value of F such that the probability greater than this value is α under an F-distribution with p and n - p degrees of freedom.

Example 5-3: Simultaneous Confidence Region Multiplier

Suppose that we have a sample size of n = 25 and we have p = 3 variables. With a 5% family error rate (and 95% family confidence), the F-value can be found in Excel using = FINV(.05, 3, 22) = 3.049. SAS uses cumulative probabilities so in this case, a command like f1= FINV(.95,3, 22) would make f1 be the F-value. The multiplier in this example is

\(\sqrt{\frac{3(25-1)}{25-3}3.049}=3.159\)

This multiplier could be used for all confidence intervals for parameters that are linear combinations of the three population means (and for the three individual means).

Summary of Multipliers Section

The following table summarizes the three different multipliers and gives notes about using Excel and SAS.

MethodTextbook notation for multiplierExcel notesSAS notes

One at a time: Confidence = (\(1 - \alpha)\) for each interval

\(t _ { n - 1 } ( \alpha / 2 )\)

To determine the t -value, enter the equation TINV(\(\alpha, \text{df}\))

To determine the t-value, create t1= tinv(\(1 - \alpha/2,\ n-1\))

Bonferroni Method: Confidence = \(1 - \alpha)\) for whole family

\(t _ { n - 1 } ( \alpha / 2 p )\)

To determine the t -value, enter the equation TINV(α / p, df)

To determine the t- value, create t2= tinv(\(1 - \alpha / 2p,\ n-1\))

Multivariate Simultaneous Intervals

\(\sqrt{\frac{p(n-1)}{n-p}F_{p,n-p}(\alpha)}\)

To determine the F value, enter the equation FINV(\(\alpha,\text{num df, denom df})\)

To determine the F- value, create

F= finv(\(1 - \alpha, \text{p, n-p})\)

Example 5-4 Section

This example uses the dataset that includes mineral content measurements at three different arm bone locations for n = 25 women . We’ll determine confidence intervals for the three different population means. Sample means and standard deviations for the three variables are:

Simple StatisticsVariableNMeanStd Dev

domradius	25	0.84380	0.11402
domhumerus	25	1.79268	0.28347
domulna	25	0.70440	0.10756

Click to expand the solution using each method.

We’ll use a .95 confidence level for each interval. With n = 25, df = 24 and \(t _ { 24 } ( .025 ) = 2.064\). This can found in Excel as =TINV(.05,24).

The confidence intervals have the form \(\bar{x}_j \pm 2.064\dfrac{s_j}{\sqrt{n}}\). Intervals are the following.

For dominant radius:
\(0.84380 \pm 2.064 \dfrac{0.11402}{\sqrt{25}}\) which is 0.797 to 0.891
For dominant humerus:
\(1.79268 \pm 2.064 \dfrac{0.28347}{\sqrt{25}}\) which is 1.676 to 1.910
For dominant ulna:
\(0.70440 \pm 2.064\dfrac{0.10576}{\sqrt{25}}\) which is 0.660 to 0.749

We’ll use a .95 confidence family-wide level so the family error = .05. For each interval, the error rate = .05/3 = 0.16666… The multiplier is \(t _ { 24 } ( .008333 ) = 2.574\) which can be found in Excel as =TINV(.05/3,24).

The confidence intervals have the form \(\bar{x}_j \pm 2.574\dfrac{s_j}{\sqrt{n}}\). Intervals are the following.

For dominant radius:
\(0.84380 \pm 2.574 \dfrac{0.11402}{\sqrt{25}}\) which is 0.785 to 0.903
For dominant humerus:
\(1.79268 \pm 2.574 \dfrac{0.28347}{\sqrt{25}}\) which is 1.647 to 1.939
For dominant ulna:
\(0.70440 \pm 2.574 \dfrac{0.10576}{\sqrt{25}}\) which is 0.649 to 0.760

The necessary F value is \(\sqrt{\dfrac{3(25-1)}{25-3}3.049} = 3.159\). (See Example 3 above for details)

The confidence intervals have the form \(\bar{x}_j \pm 3.159 \dfrac{s_j}{\sqrt{n}}\). Intervals are the following.

For dominant radius:
\(0.84380 \pm 3.159 \dfrac{0.11402}{\sqrt{25}}\) which is 0.772 to 0.916
For dominant humerus:
\(1.79268 \pm 3.159 \dfrac{0.28347}{\sqrt{25}}\) which is 1.614to 1.972
For dominant ulna:
\(0.70440 \pm 3.159 \dfrac{0.10576}{\sqrt{25}}\) which is 0.636 to 0.773

Minitab

Using SAS

Steve Rathbun, formerly of Penn State, wrote the following SAS code (download below) to generate confidence intervals for population means using the three methods discussed here. The code reads a dataset, reshapes it to have a data line for each variable value, determines means and standard deviations and then calculates and prints the three types of intervals. To use this code for different situations, you need only to change the third line where the value of p is set and the data step where the data set is read and reshaped.

Download the SAS program here: CI_pop_means.sas

The output for the program just given is below. It includes the sample mean and variance for each variable and the three confidence intervals. Limits for the one at a time intervals are given as loone and upone. Limits for the Bonferroni method are given as lobon and upbon. Limits for the simultaneous confidence region method are given as losim and upsim.

Example: Nutrient Intake Data - Descriptive Statistics

The MEANS Procedure

VariableNMeanStd DevMinimumMaximum

Calcium	737	624.0492537	397.2775401	7.4400000	2866.44
Iron	737	11.1298996	5.9841905	0	58.6680000
Protein	737	65.8034410	30.5757564	0	251.0120000
A	737	839.3653460	1633.54	0	34434.27
C	737	78.9284464	73.5952721	0	433.3390000

Using Minitab

Click on the video below to get walk throughs of the three methods as they are presented below: the one-at-a-time confidence interval, the Bonferroni method and the multivariate simultaneous interval method, all the Minitab statistical software application.

Previous5.1 - Distribution of Sample Mean Vector
Next5.3 - Inferences for Correlations

How do you find the interval with mean and standard deviation?

Since 95% of values fall within two standard deviations of the mean according to the 68-95-99.7 Rule, simply add and subtract two standard deviations from the mean in order to obtain the 95% confidence interval.

How do you find the interval estimate of the population mean?

In the large-sample case, a 95% confidence interval estimate for the population mean is given by x̄ ± 1.96σ/ √n. When the population standard deviation, σ, is unknown, the sample standard deviation is used to estimate σ in the confidence interval formula.

How do you find the population mean from the population standard deviation?

It is calculated by dividing the standard deviation by the square root of the sample size ( ), and so it gets smaller as the sample size gets bigger. In other words, with a very large N, the sample mean approaches the population mean.

How do you find the standard deviation of an interval estimate?

The standard deviation for each group is obtained by dividing the length of the confidence interval by 3.92, and then multiplying by the square root of the sample size: For 90% confidence intervals 3.92 should be replaced by 3.29, and for 99% confidence intervals it should be replaced by 5.15.

How will you find the interval estimate of the population mean if the population standard deviation is known?

Population Standard Deviation Known

Student's t Distribution

Degrees of Freedom

Population Standard Deviation Unknown

Strategies for Determining the Multiplier Section

One at a Time Intervals Section

Example 5-1: One at a Time Intervals

Bonferroni Method Multiplier Section

Example 5-2: Bonferroni Method Multiplier

Simultaneous Confidence Region Multiplier Section

Example 5-3: Simultaneous Confidence Region Multiplier

Summary of Multipliers Section

Example 5-4 Section

Using SAS

Using Minitab

How do you find the interval with mean and standard deviation?

How do you find the interval estimate of the population mean?

How do you find the population mean from the population standard deviation?

How do you find the standard deviation of an interval estimate?

zusammenhängende Posts

Where can you find a credible or reliable source of information to get accurate details?

Find compound interest on Rs 50000 for 2 years at rate 8 p.a. compounded half-yearly

Find the amount and the compound interest on rupees 11000 at 8 per annum for 2 years

How do you find the sample standard deviation from the population standard deviation?

Find compound interest on Rs 50000 at 12% per annum for 6 months compounded quarterly

Which of the following conflict handling intentions might involve attempting to find a win/win solution?

Which of the following conflict-handling intentions involves attempting to find a win-win solution

Wie finde ich den Besitzer von AirPods?

A sum of money triples itself at 10% per annum at simple interest find the number of years

A sum of money triples itself in 12 years at simple interest find the rate of interest

Toplist

Neuester Beitrag

Stichworte