Math 210
Laboratory 8

Coffee and Smoking

In doing a project for a statistics class, a group of students wanted to see if there was a relationship between drinking coffee and smoking cigarettes.  The data found here, is simulated data based on the students findings.  Copy this data set into Minitab and answer the following questions.

  1. Let’s first do some preliminary work to see if our data make sense and that their seems to be a representative sample of all Hope students.
    1. What are the counts and proportions of males and females?  Does this seem reasonable for a random sample of Hope Students?  Why or why not?

    1.  Make a pie chart for the percent of males and females.

    1. What are the counts and proportions for each of the classes (F/So/J/Sr)?  Does this seem reasonable for a random sample of Hope students?  Why or why not?
    2. Make a bar graph for the number of students in each class.
    1.  Run some descriptive statistics on each of the quantitative variables (Cups_Coffee/Day and Cigarettes/Day) and make histograms for them as well.  Stats > Basic Stats >Display Descriptive Statistics then click on Graphs and check the box Histogram of data.  Look at the values of the mean, standard deviation, the five number summaries, and the histograms.  You should find one data value that is not right.  A piece of data was entered incorrectly.  What is it?  How should you fix it?  Adjust this entry appropriately before going on to the next question. (You do NOT need to report the descriptive statistics or put the histograms in your report.)

  1.  The students doing this survey were interested in whether there was an association between drinking coffee and smoking cigarettes.  They felt that if a student drank coffee, he or she would be more likely to smoke. 
    1. To investigate their hypothesis, we need to create a two-way table from this data.  Using Coffee drinking Yes and Coffee Drinking No as to two row headings and Cigarette Smoking Yes and Cigarette Smoking No as the two Column Headings.  Each (of the four) cells in the table will contain a count.  Create the two-way table.    
    1. What percent of all the students drink coffee?
    2. What percent of all the students smoke?  
    3. What percent of the students who drink coffee also smoke?
    4. What percent of students who do not drink coffee smoke?
    5. Based on these percents, do you think the student’s hypothesis that there is an association between smoking and coffee drinking is correct?  We will be able to answer this question more precisely after we study the sections on statistical tests, for now, just give your opinion on what the data show.

  1. So far we have just investigated the relationship between coffee drinking and smoking categorically.  We will now do this quantitatively.  
    1. Find the means and standard deviations for the number of cigarettes smokers smoke based on whether or not they drink coffee.

    1. Based on your means and standard deviations, do you think that coffee drinkers are more likely to smoke more?