What is the size of your data set - what level scale of


Problem I - Write your first name, middle name, and last name in capital letters. The lettersinvolved in your full name would comprise your data set. In case you do not have a middle name, or you do not want to include your real middle name, make one up. Then, do the following

1. Write your data in order from A to Z and double check. For example, the student whose complete name is First Middle Last would have

A     D   D    E        F      I      I      L     L     M   R    S        S      T     T

Your full name:....................

Letters in order with existing repetitions:

2. What is the type of your data? Circle, or list, all that apply:

Numerical, continuous, discrete, categorical, non-numerical, quantitative, qualitative

3. What is the size of your data set?

4. What level (scale) of measurement is applicable to your data (nominal, ordinal, interval, ratio)? Supportyour answer briefly.

5. Is the word "range," with its actual definition in statistics, applicable to your data set? How can you say something about your data involving "range" in your statement, anyway?

6. Is your data set a sample or a population? Support your answer briefly.

7. Depending on your answer to Question 6 above, and recalling what we said in class, what is the correct notation to show the size of your data set in statistics?

8. What is (are) the mode(s) within your data set, if any? Is your data set unimodal, bimodal, trimodal, ...?

9. What is the frequency of the mode(s)?

10. Recalling the example discussed in class or provided in your eTextbooks, construct a "Frequency Distribution Table," a three-column frequency distribution table. You should choose a title for your table and use the following headings for your table columns:

Letter, Frequency (F), Relative Frequency (RF), and Cumulative Frequency (CF).

11. Using the frequency table created in Step 10 above and, preferably, hand drawing on graph paper (show at least some work, in case you use technology),

(a) Construct a bar chart for the F distribution, preferably a Pareto Bar Chart (See NOTE below)

(b) Construct a bar chart for the RF distribution, preferably a Pareto Bar Chart; you may present your relative frequencies in percent. (See NOTE below)

(c) Compare your F distribution with your RF distribution. Briefly explain your finding(s).

NOTE: You may do Parts (a) and (b) displaying the categories from highest F (or RF) tolowest F (or RF) from left to right; the resulting bar chart is called a "Pareto Bar Chart."

Please note that each bar chart must have a descriptive title, and the x and y axes must have descriptive labels.

12. Construct a pie chart to graphically display the relative frequencies in percentages; show the basis for the "exact" size of each "slice" of the "pie," noting that a full circle is 360 degrees. (In case youuse technology to produce the pie chart, show some calculations to demonstrate that you know what is involved in finding the share of each category from the whole circle.)

Problem II- Choose and write down 10 distinct (different) whole numbers (no modes) less than100 in a way that your data set would have a range of 87, a mean of 57, and a median of 50.

(a) In order not to treat the data as an abstract set, state what your data might represent with an applicable unit.

(b) Show your data set and your work to demonstrate that your data set does have the statistical characteristics mentioned.

(c) Calculate the midrange.

(d) Estimate the standard deviation using the "range rule of thumb," which is based on the fact that four standard deviations practically cover the span of the data (about 95% for normal distributions), when rank ordered.Do not calculate the actual standard deviation value.

(e) Determine the 34th percentile.

(f) Determine the Interquartile Range (IQR),

(g) Are your verified/computed values "statistics" or "parameters"? Explain your answer briefly.

(h) Construct a boxplot for your data set. Any outliers?

(i) Based on the appearance of your boxplot, is the distribution of your data set normal, close-to-normal, left skewed, or right skewed?

(j) Using your estimate of the standard deviation, determine what percentage of the data points fall within one standard deviation from the mean? Briefly explain why your computed percentage is close to, or far from, what the "Empirical Rule" says.

Problem III- In problem 84 of Chapter 1 of Illowsky's eTextbook (Table 1.37), which was one ofyour homework problems, the class intervals or bins have been listed under "Age." We are interested in knowing the mean age of the chief executive officers (CEOs) involved in the study.

(a) Can we calculate the exact mean age of the CEOs studied, based on the information provided in the table? Briefly explain your answer.

(b) To estimate the mean age of the CEOs studied, we can resort to the class intervals under Age (the left-most column of the table); See Section 2.5 of Illowsky's eTextbook. Consider the midpoint (midrange) of each class interval to represent the age of each CEO in that class interval. For example, the three CEOs in the class interval "40-44" are considered to be (44-40)/2 = 42 years of age each. Then, three values of our data set would be 42, 42, 42, noting that the class frequency is 3. Find the midpoints of the remaining class intervals and note their corresponding class frequencies to complete your data set; you may devote one column to the midpoint values. Then, find the estimated mean CEO age and report the value to two decimal places.

(c) Find an estimate for the median age. Support your answer by relating your answer to the actual definition of median.

(d) Having estimated mean and median, respectively in parts (b) and (c) above, estimate the

"midrange value" for the data represented in the table, as the third measure of the center of the data.

(e) Among the estimates found in Parts (b), (c), and (d) above, which one is relatively more accurate than others? Support your answer by a brief explanation.

(f) Draw a histogram based on the information provided in the table.

(g) Based on the appearance of your histogram (drawn for Part (f) above), state whether the frequency distribution is almost normal, skewed to the left or skewed to the right; explain youranswer briefly.

(h) Estimate the standard deviation to accompany the estimated mean, as the mean and standard deviation go hand in hand. Hand calculations would be easy for the case in hand, and is highly recommended for this quiz; please see the formulas provided in Section 2.7 of Illowsky's eTextbook. Standard deviation is simply the square root of the variance.

In case you use technology to do the calculation, you should show some hand calculations todemonstrate that you know how it is down manually; otherwise, you will not earn full credit.

(i) As a "sanity check," show that your result for Part (h) is somewhere between one fourth of the range and one sixth of the range; how does it compare with the mean of the two bounds?

Problem IV - Briefly explain, with reason,thelevel(scale) of measurement applicable to numerical grades on an academic exam.

Request for Solution File

Ask an Expert for Answer!!
Basic Statistics: What is the size of your data set - what level scale of
Reference No:- TGS01423117

Expected delivery within 24 Hours