Math 210
Laboratory 8
Chi-Square Tests: Heart disease, baldness, and
Yahtzee
On January 24, 2000 the Holland Sentinel reported on a study that
found
the greater the hair loss on top of a man's head, the greater the risk
for heart disease. This study, co-authored by Dr. JoAnn Manson
from
Harvard's Brigham and Women's Hospital in Boston, analyzed baldness
patterns
of 22,000 male doctors who were 40 to 84 years old when enrolled in the
Physician's Health Study. Eleven years into the study, the
researcher
asked the doctors to describe their patterns of baldness at age 45.
- In another study researchers selected a sample of 663 heart
disease
patients
(male) and a control group of 772 males not suffering from heart
disease.
Each was asked to classify their degree of baldness on a 5-point scale.
The results are given in the following table. Use these data to
answer
the following questions.
|
Baldness:
|
None
|
Little
|
Some
|
Much
|
Extreme
|
|
Heart_Disease
|
251
|
165
|
195
|
50
|
2
|
|
Control
|
331
|
221
|
185
|
34
|
1
|
- Of those in the control group, what percent claimed to have:
- Little or no baldness?
- Some, much or extreme baldness?
- Of those with heart disease, what percent claimed to have:
- Little or no baldness?
- Some, much or extreme baldness?
- Lets now make a bar chart of our data. To do this, put
the data in Minitab in the same order as shown in the table
above. Select Graph > Bar
Chart Under Bars
Represent select Values from a
table and choose Two-Way Table
and Cluster.
Under graph variable put the
none to extreme columns. Under Row
labels put the column that has heart disease and baldness.
Click on Columns are outermost
categories and rows are innermost. Click on Chart Options and then select Show Y as a Percent and under Take Percent and/or Accumulate click
on Within categories at 1 level..
Click OK and OK. Edit your axis labels
appropriately. If there is no relationship between heart disease
and bladness, each part of the graph should look the same.
- At this stage in the investigation, state whether you think
there is a
relationship between heart disease and baldness. Explain your answer.
- We are now going to run a test of significance on this.
- State the null and alternative hypotheses you would use to
test whether
there is a relationship between heart disease and baldness.
- Complete a chi-square test to determine if there is a
relationship
between
heart disease and baldness. Use Stat
> Tables > Chi-square Test.) Make sure you give
the chi-square
statistic, the P-value, and a conclusion.
- Briefly discuss whether your results necessarily mean that
heart
disease
is caused by baldness or baldness is caused by heart disease.
In a traditional Yahtzee game, five dice are tossed and the player
tries
to complete certain categories some of which are similar to poker like
three
of a kind, full house, or a straight. If the player gets all five
dice
to be all the same number, it is called a Yahtzee. There are hand
held
electronic versions of this game. In this game, dice are not
actually
tossed - the built in computer simulates this. This
might
make one wonder if these simulated dice are really random. To
answer
this question, a hand held electronic Yahtzee was used to simulate
drawing
five dice 100 times. The results are found below. We are
going to conduct a chi-square goodness of fit test on these data to see
if the simulated die does not give an even
distribution
of outcomes.
number
on the dice
|
1
|
2
|
3
|
4
|
5
|
6
|
observed frequency
|
72
|
79
|
82
|
80
|
95
|
92
|
- Another type of chi-square test is a
goodness-of-fit
test. Instead of testing the entire two-way table for
independence
as done in question 1, this test can determine whether each die is
gives
an even distribution or not.
- We want to perform a chi-square goodness of fit test to
determine if the numbers on the simulated dice are not distributed
evenly. Write down the
null
and alternative hypotheses for this kind of test.
- If the numbers on the dice are distributed evenly, what should
each excepted frequency be?
- Make a bar graph of the data. To do this, number on the
dice in column 1 and the observed frequency in column 2. Select Graph > Bar Chart > Simple.
Under Bars Represent select Values from a table. Your
graph variable should be C2
and the Categorical variable should be C1. Edit your axis labels
appropriately. Do not leave them as C1 and C2. Do you think there is
enough difference in your sample proportions to say that the numbers
are not distributed evenly for the population?
- To complete the chi-square test put the expected frequencies in
column 1. Now select Stat > Tables > Chi-Square
Goodness-of-Fit Test (One Variable). In Observed Counts,
enter C2. Under Test, choose Equal proportions.
Click OK. Report your test statistic and P-value.
- What is your conclusion for your goodness-of-fit
test?