A measurement follows the normal distribution with a standard deviation of 15 and an unknown expectation μ. You can consider that measurement to be the “original” distribution.
Two statisticians propose two distinct ways to estimate the unknown quantity μ with the aid of a sample of size 36.
They will do that by evaluating two different SAMPLING DISTRIBUTIONS to determine which method is better.
There are two statisticians involved in this task: statistician “A” and statistician “B.”
Statistician A proposes to use the sampling distribution of the sample average as an estimate.
Statistician B proposes to use the sampling distribution of the sample median instead.
In order to choose between the two options, they agree to prefer the statistic that has a smaller variance (with respect to the sampling distribution).
Tasks 1-9 refer to this problem of comparing the two statistics to each other.
Assume that the actual expectation of the measurement is equal to 5 (μ=5). Then the expectation of the statistic that was proposed by Statistician A is equal to: ?
Assume that the actual expectation of the measurement is equal to 5 (μ=5). Then the standard deviation of the statistic that was proposed by Statistician A is equal to: ?
the measurement is equal to 5 (μ=5). Then the expectation of the statistic that was proposed by Statistician B is equal to: ?
Assume that the actual expectation of the measurement is equal to 5 (μ=5). Then the standard deviation of the statistic that was proposed by Statistician B is equal to: ?
Assume that the actual expectation of the measurement is equal to 2.3 (μ=2.3). Then the expectation of the statistic that was proposed by Statistician A is equal to: ?
Assume that the actual expectation of the measurement is equal to 2.3 (μ=2.3). Then the standard deviation of the statistic that was proposed by Statistician A is equal to: ?
Assume that the actual expectation of the measurement is equal to 2.3 (μ=2.3). Then the expectation of the statistic that was proposed by Statistician B is equal to: ?
Assume that the actual expectation of the measurement is equal to 2.3 (μ=2.3). Then the standard deviation of the statistic that was proposed by Statistician B is equal to: ?
Based on the information collected in Tasks 1-8, which of the two statistics produces values which tends to be more concentrated about the expectation of the measurement?
# N ~ (m, 15)n=36N=10^5# population size (assumed)# m = mu.X = ? # mean of the populationsigma.X=15# sd of the populationvar.X=(15)^2# variance of the populationm1=5XA=rep(0,N)XB=rep(0,N)for (iin1:N){X.samp=rnorm(n,m1,sd.X)# generate samples of size n=36XA[i]=mean(X.samp)# XA: the meanXB[i]=median(X.samp)# XB: the median}### 1XA.mu=mean(XA)# mean of XA, or: XA.mu = mu.X = m1 = 5### 2XA.var=var(XA)# varianceXA.sigma=sqrt(XA.var)# sd of XA### 3XB.mu=mean(XB)# mean of XB, or XB.mu = mu.X = m1 = 5### 4XB.var=var(XB)XB.sigma=sqrt(XB.var)m2=2.3YA=rep(0,N)YB=rep(0,N)for (iin1:N){Y.samp=rnorm(n,m2,sd.X)# generate samples of size n=36YA[i]=mean(Y.samp)# YA: the meanYB[i]=median(Y.samp)# YB: the median}### 5YA.mu=mean(YA)# mean of YA### 6YA.var=var(YA)# varianceYA.sigma=sqrt(YA.var)# sd of YA### 7YB.mu=mean(YB)# mean of YB,### 8YB.var=var(YB)YB.sigma=sqrt(YB.var)### 9# we see that Statistician A has smaller variance in both cases, so he is more accurate.
2.Normal Approximation of the Sampling Distribution of a Sum¶
# Poisson ~ (lambda = 5.35)lambda=5.35mu.X.sec=lambda=5.35var.X.sec=5.35sigma.X.sec=sqrt(var.X)mu.X.minute=lambda*60var.X.minute=lambda*60sigma.X.minute=sqrt(var.X.minute)### 1lambda*60# number of phone calls per minute### 2qnorm(0.80,mu.X.minute,sigma.X.minute)### 3# P (x <= 300)pnorm(300,mu.X.minute,sigma.X.minute)
Recall that the population average of the heights in the file “pop1.csv” is μ = 170.035. Using simulations we found that the probability of the sample average of the height falling within 1 centimeter of the population average is approximately equal to 0.626.
From the simulations we also got that the standard deviation of the sample average is (approximately) equal to 1.122.
In the next 3 questions you are asked to apply the Normal approximation to the distribution of the sample average using this information. The answer may be rounded up to 3 decimal places of the actual value.
Using the Normal approximation, the probability that sample average of the heights falls within 1 centimeter of the population average is
Using the Normal approximation we get that the central region that contains 95% of the distribution of the sample average is of the form 170.035 ± z · 1.122. The value of z is
Using the Normal approximation, the probability that sample average of the heights is less than 168 is
According to the Internal Revenue Service, the average length of time for an individual to complete (record keep, learn, prepare, copy, assemble and send) IRS Form 1040 is 10.53 hours (without any attached schedules).
The distribution is unknown. Let us assume that the standard deviation is 2 hours.
Suppose we randomly sample 36 taxpayers and compute their average time to completing the forms.
Then the probability that the average is more than 11 hours is approximately equal to (The answer may be rounded up to 3 decimal places of the actual value.) ?
Suppose that a category of world class runners are known to run a marathon (26 miles) in an expectation of 145 minutes with a standard deviation of 14 minutes.
Consider 49 of the races. In the next 3 questions you are asked to apply the Normal approximation to the distribution of the sample average using this information. The answer may be rounded up to 3 decimal places of the actual value.
A switching board receives a random number of phone calls. The expected number of calls is 5.3 per minute. Assume that the distribution of the number of calls is Poisson.
The average number of calls per minute is recorded by counting the total number of calls received in one hour, divided by 60, the number of minutes in an hour.
In the next 4 questions you are asked to apply the Normal approximation to the distribution of the sample average using this information. The answer may be rounded up to 3 decimal places of the actual value.
It is claimed that the expected length of time some computer part may work before requiring a reboot is 2 months. In order to examine this claim 80 identical parts are set to work.
Assume that the distribution of the length of time the part can work (in months) is Exponential.
In the next 4 questions you are asked to apply the Normal approximation to the distribution of the average of the 80 parts that are examined.
The answer may be rounded up to 3 decimal places of the actual value.
The central region that contains 90% of the distribution of the average is of the form E(X) ± c, where E(X) is the expectation of the sample average. The value of c is
The probability that the average is more than 2.5 months is