Problem on multiple regression


Assignment:

My data set contains five variables for 96 nations in the world.
 
Onlinepop        Online Population
PC's                 Number of Personal Computeres
Phones            Number of landline phones
Educ                Percent of GNP spent on education
GNPPC            Gross National Product per capita
 
Here is correlation matrix on data set:

 

Onlinepop

PCs

Phones

Educ

GNPPC

Onlinepop

1

 

 

 

 

PCs

0.990643

1

 

 

 

Phones

0.319927

0.275276

1

 

 

Educ

0.049997

0.049423

0.369801

1

 

GNPPC

0.509078

0.477851

0.874735

0.318304

1

 

 

 

 

Assigment was to Regress Onlinepop against PC's, Phones, and Educ and then to regress the predicted values of the dependent variable Onlinepop against the residuals resulting in the following scatterplot in order to detect for heteroskedasticity.

1708_Chart.jpg
Next based on results of model I was to consider three scenarios.  First, triple Education expenditure.  Second, double PC's.  Third, double Phones.  Here are results of model.  R square  was .98

 

Onlinepop

PCs

Phones

Educ

GNPPC

Onlinepop

1

 

 

 

 

PCs

0.990643

1

 

 

 

Phones

0.319927

0.275276

1

 

 

Educ

0.049997

0.049423

0.369801

1

 

GNPPC

0.509078

0.477851

0.874735

0.318304

1

 

 

 

 

Question- I don't understand why variable Educ has a negative coefficient.  Intutitively, I would expect a positive sign.  Can you explain why?  Could it be multilcollinearity or is it the heteroskedasticity at work?

Solution Preview :

Prepared by a verified Expert
Basic Statistics: Problem on multiple regression
Reference No:- TGS01913969

Now Priced at $30 (50% Discount)

Recommended (97%)

Rated (4.9/5)