The data set shown below gives the calories and salt content in 17 brands of meat hot dogs.
• Enter the data set into excel. Using excel perform a least squares regression on the data set. Use Sodium as the y variable and Calories as the x variable. Make a scatter plot with the regression line plotted as well. Using excel compute the correlation coefficient.
• Insert the plot your created into a word document. Discuss the scatter plot and regression line. Are their outliers? Does your correlation coefficient support your analysis? Does it tell you anything else?
• If there is an outlier remove the outlier from the data set and repeat the previous two parts.
• Can you conclude that hot dogs with more calories will have more sodium (i.e. discuss correlation and causation)?
| Brand |
Calories |
Sodium (mg) |
| 1 |
173 |
458 |
| 2 |
191 |
506 |
| 3 |
182 |
473 |
| 4 |
190 |
545 |
| 5 |
172 |
496 |
| 6 |
147 |
360 |
| 7 |
146 |
387 |
| 8 |
139 |
386 |
| 9 |
175 |
507 |
| 10 |
136 |
393 |
| 11 |
179 |
405 |
| 12 |
153 |
372 |
| 13 |
107 |
144 |
| 14 |
195 |
511 |
| 15 |
135 |
405 |
| 16 |
140 |
428 |
| 17 |
138 |
339 |