1) – Side effects of Lipitor
The drug Lipitor is meant to reduce total cholesterol and LDLcholesterol. In clinical trials, 19 out of 863 patients taking 10 mg of Lipitor daily complained of flulike symptoms. Suppose that it is known that 1.9% of patients taking competing drugs complain of flulike symptoms.
Part I  Is there significant evidence to support the claim that more than 1.9% of Lipitor users experience flulike symptoms as a side effect at the α = 0.01 level of significance?
Part II – Construct a 98% confidence interval estimate for the proportion of all patients who experience flulike symptoms when taking 10 mg Lipitor daily. (Go to page 8 to do this)
Part I  Is there significant evidence to support the claim that more than 1.9% of Lipitor users experience flulike symptoms as a side effect at the α = 0.01 level of significance?
a) Describe in words the population and success attribute
People taking 10 mg of Lipitor daily
Experience flulike symptoms
b) List all relevant statistics
x = 19, n = 863, From this we get phat = 19/863 = .022
α = 0.01, p = .019 (proportion of people who take other drug and have flu like symptoms)
Note: the proportion experiencing flulike symptoms is higher in the sample of Lipitor users; is that due to chance or could it be because of the drug?
c) Verify assumptions
We are going to assume that for patients taking 10 mg of Lipitor daily the proportion experiencing flu like symptoms is the same as the one for the competing drug; that is 1.9%
Using p = 0.019, we check the assumptions:
Np = 863 (.019) = 16.4, and n(1p) = 863(.981) = 846.6
Since both are greater than 10, the distribution of phat (for samples of size 863) is approximately normal
The mean is p = 0.019 and the standard deviation of phat is SQRT( 0.019*0.981/863) = 0.004647
d) Set both hypothesis
H0: p = .019
Ha: p > .019 right tailed test
e) Sketch graph, shade rejection region, label, and indicate possible locations of the point estimate in the graph.
You do this. The point estimate is phat = .022
****You should be wondering: Is phat =_0.022__ higher than p = _.019_ by chance, or is it significantly higher? The pvalue found below will help you in answering this.
f) Run the test: 1propZTest: and obtain
prop > 0.019, test statistic z = 0.65,
pvalue = P(phat > 0.022) = 0.258 > 0.01
If the true proportion is 0.019, it is very likely to observe a sample proportion phat of 0.022 or a more extreme one in samples of size 863. Phat of 0.022 is higher than 0.019 by chance.
At the 1% significance level, the sample data do not provide enough evidence to support the claim that more than 1.9% of Lipitor users experience flulike symptoms as a side effect.
Notice that the conclusion will be the same at any significance level because the pvalue is very high.
g) Use formulas to find the teststatistic
(use the ztable to find the area to the left of z = 0.65 which is 0.7422)
h) Find the pvalue showing all steps
P(phat > .022) = P(z > .65) = 1  .7422 = .2578 > significance level
Part II – Construct a 98% confidence interval estimate for the proportion of all patients who experience flulike symptoms when taking 10 mg Lipitor daily.
a) Verify assumptions
For confidence intervals the assumptions are that np and n(1p) are both greater than 15. This holds on true in this case
Np = 863 (.019) = 16.4, and n(1p) = 863(.981) = 846.6
b) Construct the interval with the calculator:
Use 1propZInterval and get (.0104 , .03364)
We are 98% confident that the proportion of Lipitor users experiencing flulike symptoms is somewhere between 0.0104 and 0.03364. Since the interval contains .019, it’s possible for p to be 0.019. With 98% confidence we can say that we don’t have enough evidence to support the claim that more than 1.9% of Lipitor users experience flulike symptoms.
This conclusion agrees with the hypothesis testing conclusion.
c) Now with the formulas:
.0104 < p < .0336 (notice the margin of error = 0.0116 ~ 0.012 = 1.2%)
d) Complete the following:

We are __98___% confident that the percentage of patients taking 10 mg of Lipitor daily complaining of flulike symptoms is between ___1.0% and __3.4%

With __98% confidence we can say that ___2.2% of patients who take 10 mg of Lipitor daily complain of flu like symptoms, with a margin of error of ____1.2%

The statement “98% confident” means that, if 100 samples of size __863___ were taken and the corresponding 98% confidence intervals constructed around the sample proportion, about __98___ of the intervals will contain the parameter p and about ___2 will not.

For 98% of such intervals, the sample proportion would not differ from the actual population proportion by more than ___.0116___
2) CLINICAL TRIAL OF TAMIFLU
Clinical trials involved treating flu patients with Tamiflu, which is a medicine intended to attack the influenza virus and stop it from causing flu symptoms. Among 724 patients treated with Tamiflu, 72 experienced nausea as an adverse reaction.
Part I – Use a 0.01% significance level to test the claim that the rate of nausea is greater than the 6% rate experienced by flu patients given a placebo. Does nausea appear to be a concern for those given the Tamiflu treatment?
Part II Construct a 98% confidence interval estimate for the proportion of patients treated with Tamiflu who experience nausea as an adverse reaction.
PART I
a) Describe in words the population and success attribute
Flu patients treated with Tamiflu
Experience nausea as an adverse reaction
b) List all relevant statistics
x = 72, n = 724, From this we get phat = 72/724 = 0.099 (9.9%)
α = 0.01, p = .06 (proportion experiencing nausea in the group receiving a placebo)
Note: the proportion experiencing nausea is higher in the sample of Tamiflu users; is that due to chance or could it be because of the drug?
c) Verify assumptions
We are going to assume that patients taking Tamiflu experience nausea at the same rate as the ones taking the placebo, which is 6%
Using p = 0.06, we check the assumptions:
Np = 724*0.06 = 43.4, and n(1p) = 724(.94) = 680.6
Since both are greater than 10, the distribution of phat (for samples of size 724) is approximately normal
The mean is p = 0.06 and the standard deviation of phat is SQRT( 0.06*0.94/724) = 0.008826
d) Set both hypothesis
H0: p = .06
Ha: p > .06 right tailed test
e) Sketch graph, shade rejection region, label, and indicate possible locations of the point estimate in the graph.
You do this. The point estimate is phat = .099
****You should be wondering: Is phat =_0.099__ higher than p = _.06_ by chance, or is it significantly higher? The pvalue found below will help you in answering this.
f) Run the test: 1propZTest: and obtain
prop > 0.06, test statistic z = 4.47,
pvalue = P(phat > 0.099) = 0.0000039 < 0.01 (very strong evidence against the null hypothesis and in favor of the alternative hypothesis)
If the true proportion experiencing nausea is 6% it is very unlikely to observe 9.9% of the people in the sample experiencing nausea. Phat of 0.099 is significantly higher than 0.06.
At the 1% significance level, there is strong evidence to support the claim that the proportion experiencing nausea is much higher in the group receiving Tamiflu. Nausea appears to be a concern for those taking Tamiflu.
Notice that the conclusion will be the same at any significance level because the pvalue is very small.
g) Use formulas to find the teststatistic
(Go to the ztable and get the area to the left of 4.47 which is almost 1)
h) Find the pvalue showing all steps
P(phat > .099) = P(z >4.47) = 1 – (almost 1) = almost zero < any significance level
Part II Construct a 98% confidence interval estimate for the proportion of patients treated with Tamiflu who experience nausea as an adverse reaction.
a) Verify assumptions
For confidence intervals the assumptions are that np and n(1p) are both greater than 15. This holds on true in this case
Np = 724*0.06 = 43.4, and n(1p) = 724(.94) = 680.6
b) Construct the interval with the calculator:
Use 1propZInterval and get (.07357, .12532)
We are 98% confident that the proportion of Tamiflu patients experiencing nausea is between 0.074 and 0.125. Since all values in the interval are higher than 0.06, with 98% confidence we can say that the rate at of nausea is greater in the Tamiflu group than in the placebo group.
This conclusion agrees with the hypothesis testing conclusion.
c) Now with the formulas:
.0731 < p < .1249 (notice the margin of error = 0.0259 ~ 0.026 = 2.6%)
d) Complete the following:

We are __98___% confident that the percentage of patients taking Tamiflu and experiencing nausea is between ___7.3% and __12.5%

With __98% confidence we can say that ___9.9% of patients who take Tamiflu experience nausea, with a margin of error of __2.6 %

The statement “98% confident” means that, if 100 samples of size __724___ were taken and the corresponding 98% confidence intervals were constructed around the sample proportion, about __98___ of the intervals will contain the parameter p and about ___2 will not.

For 98% of such intervals, the sample proportion would not differ from the actual population proportion by more than ___.0259___
3) SMOKING AND COLLEGE EDUCATION
A survey showed that among 785 randomly selected subjects who completed four years of college, 18.3% smoke.
Part I – Use a 1% significance level to test the claim that the rate of smoking among those with four years of college is less than the 27% rate for the general population.
Part II – Construct a 98% confidence interval estimate for the proportion of smokers among students who completed four years of college.
DONE IN CLASS
4)  Nasonex
In clinical trials of Nasonex, 3774 adult adolescent allergy patients (patients 12 years and older) were randomly divided into two groups. The patients in group 1 (experimental group) received 200 mcg of Nasonex, while the patients in Group 2 (control group) received a placebo. Of the 2103 patients in the experimental group, 547 reported headaches as side effect. Of the 1671 patients in the control group, 368 reported headaches as a side effect.
Part 1  Is there significant evidence to support the claim that the proportion of Nasonex users that experienced headaches as a side effect is greater than the proportion in the control group at the 0.05 significance level?
Part 2 – Use a feature of your calculator to construct a 90% confidence interval estimate for the difference between the two population proportions. What is the interval suggesting?
Population 1: Experimental group = allergy patients (12years and older) who received 200 mcg of Nasonex Population 2: Placebo group = allergy patients (12years and older) who received a placebo
Success attribute: experience headache
p1 is the proportion of all Nasonex users who experience headaches)
p2 is the proportion of all placebo (NonNasonex) users who experience headaches)
First: Verify assumptions
Check that in each population np and nq are both 10 or more
Part 1  Is there significant evidence to support the claim that the proportion of Nasonex users that experienced headaches as a side effect is greater than the proportion in the control group at the 0.05 significance level?
a) Set both hypothesis
This is a right tailed test
b) Sketch graph, shade rejection region, label, and indicate possible locations of the point estimate in the graph.
You do this
The point estimate is
****You should be wondering: Is the proportion that experience headaches in the experimental group larger than the one in the control group by chance, or is it significantly higher? The pvalue found below will help you in answering this.
c) Use a feature of the calculator to test the hypothesis. Indicate the feature used and the results:
Use 2PropZTest and get
Test statistic z = 2.84
P() = P(z > 2.84) = pvalue = .0023 < .05 (significance level)
The pvalue is very small. It is very unlikely (p = 0.023) to observe such a difference between sample proportions (p1 – p2 = 0.04) when the proportion experiencing headaches is the same in both populations. Such a difference between the sample proportions is significantly larger than zero and it is more likely to be observed when p1 is larger than p2.
At the 5% significance level we support the claim that proportion of Nasonex users that experienced headaches as a side effect is greater than the proportion in the control group
Part 2 – Use a feature of your calculator to construct a 90% confidence interval estimate for the difference between the two population proportions. What is the interval suggesting?
The point estimate is
Is by chance or significantly lower?
Note: In the experimental group, a higher percentage experience headache, could that be because of the drug?
Construct the interval by using 2PropZInterval and get .01695 < < .0628
We are 90% confident that the difference between the proportion experiencing headaches in the Nasonex and the placebo groups is between 0.01695 and 0.0628. Since the interval is completely above zero, it suggests that > 0 which means that . This is the same as the conclusion to the hypothesis testing part.
What is the margin of error?
0.0628 – 0.04 = 0.0228
(2) In view of your answers to part 1, could you reasonably conclude that Nasonex causes headaches as a side effect?
This was an experiment and in the case of an experimental study we could interpret statistical significance as a causal relationship.
In this case, the difference of 4% is statistically significant, but it may not have any practical significance. Most parents would be willing to accept the additional “risk” of a headache in order to relieve their child’s allergy symptoms.
5) – Vasectomies and Prostate Cancer
Approximately 450,000 vasectomies are performed each year in the U.S. In this surgical procedure for contraception, the tube carrying sperm from the testicles is cut and tied. Several studies have been conducted to analyze the relationship between vasectomies and prostate cancer. The results of one such study by E. Giovannucci et al. appeared in the paper “A Retrospective Cohort Study of Vasectomy and Prostate Cancer in U.S. Men”. Of 21,300 men who had not had a vasectomy, 69 were found to have prostate cancer; of 22,000 men who had had a vasectomy, 113 were found to have prostate cancer.
Part 1  At the 1% significance level, do the data provide sufficient evidence to conclude that men who have had a vasectomy are at greater risk of having prostate cancer?
Part 2 – Use the calculator to determine a 98% confidence interval for the difference between the prostate cancer rates of men who have had a vasectomy and those who have not.
Population 1: men without vasectomy
Population 2: men with vasectomy
Success attribute: have prostate cancer
Let p1 be the proportion of men without a vasectomy who have prostate cancer
Let p2 be the proportion of men with a vasectomy who have prostate cancer
NO VASECTOMY VASECTOMY
phat(1) = 0.0032 is our first estimate for p1
phat (2) = 0.0051 is our first estimate for p2
phat(1) – phat(2) = 0.0032 – 0.0051= 0.0019 is our first estimate for p1 – p2
0.32% of men in the NOvasectomy group and 0.51% in the vasectomy group had prostate cancer. Is the percent higher in the vasectomy group by chance or is it significantly higher? Is the proportion of men with prostate cancer lower in the group of men without a vasectomy?
We are going to assume that all assumptions are met and we can perform a test with the methods learned in class.
Part 1  At the 1% significance level, do the data provide sufficient evidence to conclude that men who have had a vasectomy are at greater risk of having prostate cancer?
a) Set both hypothesis
This is a left tailed test
b) Sketch graph, shade rejection region, label, and indicate possible locations of the point estimate in the graph.
You do this.
The point estimate is
****You should be wondering: Is the proportion of men with prostate cancer in the group without vasectomy lower than in the group with vasectomy by chance, or is it significantly lower? The pvalue found below will help you in answering this.
c) Use a feature of the calculator to test the hypothesis. Indicate the feature used and the results:
Use 2PropZTest and get
Test statistic z =  3.05
P() = P(z <  3.05) = pvalue = .001 < .01 (significance level)
It is unlikely to observe such a difference (0.0019) or a more extreme one between the sample proportions when you select samples from two populations that have the same proportions?
The proportion of men with prostate cancer in the group without vasectomy is significantly lower than the proportion in the group with vasectomy.
At the 1% significance level, the data provide sufficient evidence to conclude that men who have had a vasectomy are at greater risk of having prostate cancer.
Part 2 – Use the calculator to determine a 98% confidence interval for the difference between the prostate cancer rates of men who have had a vasectomy and those who have not. Are the results consistent with the results of the hypothesis test? Explain.
The point estimate is
Is by chance or significantly lower?
Construct the interval by using 2PropZInterval and get .0033 < < .0005
Since the interval is completely below zero, it suggests that < 0 which means
This is the same as. We get the same conclusion as in the hypothesis testing part.
With 98% confidence the data provide sufficient evidence to conclude that men who have had a vasectomy are at greater risk of having prostate cancer.
Some other related questions:
(1) Is this study a designed experiment or an observational study?
Observational
(2) In view of your answers to part 1, could you reasonably conclude that having a vasectomy causes an increased risk of prostate cancer?
No, for an observational study, it is not reasonable to interpret statistical significance as a causal relationship. In the case of an experimental study we could interpret statistical significance as a causal relationship. 