Chi-Square Tests: Yahtzee
In a traditional Yahtzee game, five dice are tossed and the player
tries
to complete certain categories some of which are similar to poker like
three
of a kind, full house, or a straight. If the player gets all five
dice
to be all the same number, it is called a Yahtzee. There are hand
held
electronic versions of this game. In this game, dice are not
actually
tossed - the built in computer simulates this. This
might
make one wonder if these simulated dice are really random. To
answer
this question, a hand held electronic Yahtzee was used to simulate
drawing
five dice 100 times. The results can be found here. We are going to conduct two
types
of chi-square tests on this data set. The first will be to see if
there
is a relationship between the simulated die and the number shown on the
die.
The second will be to see if each simulated die gives an even
distribution
of outcomes in the long run.
- We want to perform a chi-square goodness of fit test on each column to determine if each is not distributed evenly. Write down the null and alternative hypotheses for this kind of test.
- The expected frequencies for each outcome is 100/6 or approximately 16.67. Use this to calculate the chi-square test statistic for Die_A. To do this, choose Calc > Calculator. In Store result in variable:, enter the name of an open column. In Expression:, enter SUM((Die_A-16.67)**2/16.67). Click OK. Repeat this for the other four dice. Report your five chi-square test statistics.
- We now need to calculate the P-values for each of our tests. To do this select Calc > Probability Distributions > Chi-Square. In the window that pops up click on Cumulative Probability, leave 0.0 as the Noncentrality parameter, put in 4 for your Degrees of freedom, click on Input constant, input your chi-square statistic for Die_A from part (b), and click OK. The value that Minitab gives you is one minus the P-value, so subtract your output from one to get the P-value. Repeat this for the other four dice.
- What are your conclusions for each of your five goodness-of-fit tests?
- The mean of a chi-square distribution is equal to its degrees of freedom. This means that if the null hypothesis is correct, we complete many different chi-square tests, we would expect the mean of our test statistics to be approximately the same as the degrees of freedom. What is the mean of your chi-square test statistics from part (c)? How does this compare with the number of degrees of freedom?