Suppose that the model is estimated with ui included rather


1. Consider the following linear regression model:

yi = β1 + β2xi2 + β3xi3 + εi = xi′ β + εi

(a) Explain how the ordinary least squares estimator for β is determined and derive an expression for b.

(b) Which assumptions are needed to make b an unbiased estimator for β?

(c) Explain how a confidence interval for β2 can be constructed. Which additional assump- tions are needed?

(d) Explain how one can test the hypothesis that β3 = 1.

(e) Explain how one can test the hypothesis that β2 + β3 = 0. (f) Explain how one can test the hypothesis that β2 = β3 = 0.

(g) Which assumptions are needed to make b a consistent estimator for β?

(h) Suppose that xi2 = 2 + 3xi3. What will happen if you try to estimate the above model?

(i) Suppose that the model is estimated with xi2 = 2xi2 - 2 included rather than xi2. How are the coefficients in this model related to those in the original model? And R2s?

(j) Suppose that xi2 = xi3 + ui, where ui and xi3 are uncorrelated. Suppose that the model is estimated with ui included rather than xi2. How are the coefficients in this model related to those in the original model? And the R2s?

2. Using a sample of 545 full-time workers, a researcher is interested in the question as to whether women are systematically underpaid compared with men. First, she estimates the average hourly wages in the sample for men and women, which are $5.91 and $5.09 respectively.

(a) Do these numbers give an answer to the question of interest? Why not?

The researcher also runs a simple regression of an individual's wage on a male dummy, equal to 1 for males and 0 for females. This gives the results reported in Table 1.

(b) How can you interpret the coefficient estimate of 0.82? How do you interpret the esti- mated intercept of 5.09?

(c) How do you interpret the R2 of 0.26?

(d) Explain the relationship between the coefficient estimates in the table and the average wage rates of males and females.

(e) A student is unhappy with this model as ‘a female dummy is omitted from the model'. Comment upon this criticism.

(f) Construct a 95% confidence interval for the average wage differential between males and females in the population.

Subsequently, the above ‘model' is extended to include differences in age and education by including the variables age (age in years) and educ (education level, from 1 to 5). Simultane- ously, the dependent variable is adjusted to be the natural logarithm of the hourly wage rate. The results are reported in Table 2.

(g) How do you interpret the coefficients of 0.13 for the male dummy and 0.09 for age?

(h) A student is unhappy with this model as ‘the effect of education is rather restrictive'. Can you explain this criticism? How could the model be extended or changed to meet the above criticism? How can you test whether the extension has been useful?

The researcher re-estimates the above model including age2 as an additional regressor. The t- value on this new variable becomes -1.14, while R2a increases to 0.683.

(i) Could you give a reason why the inclusion of age2 might be appropriate?

(j) Would you retain this new variable given the R2a measure? Would you retain age2 given its t-value? Explain this apparent conflict in conclusions.

Table 1: Hourly wages explained from gender: OLS results

Variable

Estimate

Standard error

t-ratio

constant

5.09

0.58

8.78

male

0.82

0.15

5.47

N = 545 s = 2.17 R2a = 0.26

Table 2: Log hourly wages explained from gender, age and education level: OLS results

Variable

Estimate

Standard error

t-ratio

constant

-1.09

0.38

2.88

male

0.13

0.03

4.47

age

0.09

0.02

4.38

educ

0.18

0.05

3.66

N = 545 s = 0.24 R2a = 0.682

3. The most important assumption for small sample OLS regression is the exogeneity of explana- tory variables. Consider the following regression equation based on a random sample,

yi = β0 + β1x1i + β2x2i + εi

In order for OLS to yield unbiased estimates, we need to assume that both x1 and x2 are exogenous to the error term ε, i.e.,

E(εi|x1i, x2i) = E(εi) = 0 (1)

Sometimes our main interest is in the marginal effect of x1 on y, with x2 only serving as a control variable. However, if x2 is not exogenous to the error term, the estimate of β2 will be biased. Moreover, the estimates of other coefficients, especially β1, will also be contaminated, which invalidates the entire research design.

In practice, the control variable x2 is very likely to be endogenous, i.e., assumption (1) does not hold. But if we are willing to impose the following assumption instead,

E(εi|x1i, x2i) = E(εi|x2i) (2)
we can still guarantee that the estimate of β1 is unbiased. This assumption is named "condi- tional mean independence". It implies that, once we control for x2, we can safely regard x1 as exogenous. For simplicity, let us assume further that

E(εi|x2i) = γ0 + γ1x2i
(a) Why is the conditional mean independence assumption (2) sufficient for the unbiasedness of the OLS estimate of β1?

(b) Does the conditional mean independence assumption (2) also guarantee unbiased esti- mates of β0 and β2?

4. Consider the following simple regression model

yi = β0 + β1xi + εi, i = 1, . . . , n

{yi, xi} are randomly sampled from the population. Assume that E(xiεi) = 0, Var(xi) = σx2 > 0, Var(εi) σ2ε > 0.

In practice, a student mistakenly treated x as the dependent variable, and y as the independent variable and obtained the OLS estimate b of the coefficient on y. When she realized the mistake, she used β^1 = b-1 as an estimate of β1.

(a) Show that β^1 is not a consistent estimator of β1.

(b) Show the direction of bias in β^1.

5. According to the great economist John Maynard Keynes, consumption per capita in the economy is a linear function of per capita income, i.e.,

Ct = β0 + β1Yt + εt (3)

t = 1, . . . , T denotes period of measure. ε assembles all the non-income factors that have an impact on consumption. β1 is called marginal propensity of consumption, 0 < β1 < 1. This function implies a causal relationship between consumption and income: by how much will consumption increase if income increases by one unit? Meanwhile, we know from the national income identity that income is comprised of consumption and investment (assume the absence of government), i.e.,

Yt = Ct + It

where It denotes per capita investment. Assume investment is exogenous, i.e.,

E(Itεt) = 0

Give a critical comment on the research design that tries to estimate the marginal propensity of consumption by running an OLS regression with respect to (3). In particular, will this sort of research over-estimate or under-estimate the true marginal propensity of consumption?

Request for Solution File

Ask an Expert for Answer!!
Econometrics: Suppose that the model is estimated with ui included rather
Reference No:- TGS01720255

Expected delivery within 24 Hours