Estimate the least square regression lines and explain the


There are 2 excel assignments that are to be used for the assignment. first attachment (data set for assignment for 1 and 2)- questions 1-5. second attachment(extra file for assignment 2) question 6-7.

The Data Set for Assignments in file DataSetForAss presents Weekly Income (WI), Weekly Expenditure on Food (WEF), Highest Level of Education (HLE)1, Family Size (FS), and Gender of the Head of Household (GHH) for the a population of 1000 households.

1. Column A consists of Households are named by number from 1 to 1000.

2. Columns B to F record these households' WI, WEF, HLE, FS and GHH, respectively.

3. Column I presents 50 samples of households as your sample, consisting:

o 1st household is based on the first 3 digit of your student ID
o 2nd household is based on the last 3 digit of your student ID
o 3rd - 5th is based on the day, month and the last 2 digits year of your birthday, respectively.
o 6th - 50th is randomly chosen.

Example: if your student ID is: 181XX728 and you were born in 31st of March 1985

1st household is no. 181
2nd household is no. 728
3rd household is no. 31,
4th is no. 03, and
5th is no. 85
6th to 50th is randomly chosen

1. Provide 95% confidence interval estimates for the mean FS (µFS) WEF (µWEF) and WI (µWI) for the population of 1000 households based on your selected sample of size 50 households.

2.Test the following Hypothesis that the population average

(i) Family Size is three. (ii) WI is at least $500.

(iii) WEF is greater than $300

Against a suitable alternative at the 10% level of significance. What can you conclude? (15 Marks)

3. Estimate the following least square regression lines and explain the meaning of the y-intercept (b0) term and the estimate of slope coefficients (b1) in (i) and (ii) below but (b1 & b2) in (iii) :
(i) WEF on WI

(ii) WEF on FS

(iii) WEF on WI and FS

4. Construct confidence intervals for the estimates of the slope coefficients in (i) to (iii) of Question 3. Explain the meaning of these confidence intervals and comment on your results about the width of the confidence intervals, if sample size increases from 50 to 100.

5. Find the values of the correlation coefficients and the coefficients of determination, and explain their meaning for the 50 pairs of:

(i) WI and WEF values

(ii) WEF and FS values

For question 6 and 7 below please refer to the excel files (extra file for assignment 2 on LMS)

6. In that excel file (Oslo tab), there is 50 sample data of Weekly Income (WI), Weekly Expenditure on Food (WEF) of one of the most prosper city in the world, Oslo, Norway. Prime Minister of Norway Erna Solberg claimed that:

(i) The population average of Weekly Income (WI) for her city is higher compared to your sample of 50 households.

(ii) The population mean of Family Size (FS) for her city is at least the same with your sample of 50 households.

Based on the Erna Solberg's statement, perform the analysis on hypothesis testing with level of significance of 5%. Do you think Erna Solberg's statement is true?

There are few assumptions that you may need to put in your mind, when you perform this test:

A. Populations for both of your sample data and OSLO are normally distributed and samples are independent.

B. Population variances for Weekly Income (WI) are unknown and unequal.

C. Population variances for FS are unknown and equal.

7. As one of the largest city in USA, New York also known as the food city. In this city people spend so much money in food, and Bill de Blasio, Mayor of New York believe, that the average amount of weekly food expenditure (WEF) spent by households is equal with your sample data. In order to prove that he collects a random sample of 50 households data of his city. (The data is attached on extra file for assignment 2, New York tab).

Based on the Bill de Blasio's statement, perform the analysis on hypothesis testing with level of significance of 5%. Do you think Bill de Blasio's statement is correct?

You may consider the following assumptions while performing this test:

A. Populations for both of your sample data and New York are normally distributed and samples are not independent.

B. Population variances of Weekly Food on Expenditure (WEF) are unknown and unequal.

Attachment:- Assignment.rar

Solution Preview :

Prepared by a verified Expert
Applied Statistics: Estimate the least square regression lines and explain the
Reference No:- TGS01129776

Now Priced at $75 (50% Discount)

Recommended (94%)

Rated (4.6/5)