How many individuals are there in the data set


Homework

I. Experiment. Suppose that the state of Ohio wishes to estimate how additional school funding affects academic outcomes (e.g., test scores, graduation rate). To do this, the state plans to conduct an experiment in which a subset of districts are selected to receive an additional $2,000 per student. There 611 school districts in Ohio.

i. List the steps that the experiment should follow in order to ensure that the correct causal estimates are found. Be specific.

ii. Explain how using a randomized experiment eliminates the potential for omitted variable bias. Use your own words.

iii. Is it possible to test if the treatment and control districts are balanced on unobservable characteristics? Explain.

II. Group Fixed Effects. This question examines whether office workers are better paid than other employees. You should use the data set "industry_benefits.dta" to examine this question.

i. What is the average income for those who do and do not work in an office? Based on this, what is the wage gap between office workers and other employees?

ii. Estimate the effect of working in an office on annual earnings and write the resulting regression equation. Is working in an office a statistically significant determinant of income?

iii. Now add education and experience as additional variables and write the resulting regression equation. How did these control variables affect the coefficient magnitude and statistical significance of being an office worker on income?

iv. Explain why these changes occurred in part iii) by examining the correlation between the new variables (education and experience) and being an office worker and earnings. Use the expression for omitted variable bias.

v. Add industry fixed effects to the regression in part iii). Write the resulting regression equation and explain how this affected the coefficient on being an office worker.

vi. Explain the comparisons that are being used in the fixed effects regression in part v) and how they differ from the comparisons used in part iii).

vii. Rank the industries from the lowest to the highest paying while holding education, experience, and being male fixed.

III. Group and Time Fixed Effects. This question examines the relationship between being married and wages. You should use the data set "wage_panel.dta" to examine this relationship.

i. How many individuals are there in the data set? For how many years is each person observed?

ii. Regress the log of wages on being married and write the results. What is the estimated effect of being married?

iii. Add individual fixed effects to the regression in order to make comparisons within a person over time (you may use the "areg" command). What is the estimated effect of being married?

iv. You suspect that time is an important omitted variable. Discuss how "year" is likely to be correlated with wages and being married and how this will bias your estimate in part iii).

v. Add "year" to the regression and write the results. Interpret the coefficient on "year". How did adding year affect the coefficient on married? Is this consistent with your prediction in part iv)?

vi. Now add year fixed effects. How did this affect the coefficient on being married?

vii. Rank the years from lowest to highest in terms of wages while holding marital status fixed.

IV. Instrumental Variables. You wish to identify the causal effect of increased agricultural productivity (output per acre) on weekly income. You have data on the average weekly income (in dollars) for five provinces in India, the average weekly output of crops (in kilograms per acre), and average monthly rainfall in centimeters.

province

income

output

rain

prov1

10

6

5

prov2

14

10

25

prov3

8

2

10

prov4

6

4

5

prov5

12

8

10

i. Regress income on output per acre and report the results. Interpret the coefficient on output.

ii. Identify a potentially omitted variable that may bias the coefficient on output (and identify the expected sign of the bias generated by omitting this variable).

iii. Under what assumptions is rainfall a valid instrument for output? Be specific.

iv. Find the estimated effect of output per acre on income using rain as an instrumental variable and report the results. Interpret the coefficient on output.

v. Compare the OLS from part a) and IV estimates from part d) and discuss if each can be interpreted as causal effects.

V. Instrumental Variables. Using the same data as above you are going to examine an alternative way of generating the instrumental variable estimates.

i. Estimate the effect of rain on output and report the results.

ii. Estimate the effect of rain on income and report the results.

iii. Show how you can use the estimates from these two regressions to generate the IV estimate you found in #3. Explain the intuition behind this approach.

VI. Regression Discontinuity. A university offers scholarships to applicants based on their high school grade point averages (GPA). In 2016, any student who earned a 3.65 GPA or higher was offered a scholarship of $10,000 (scholar). We estimate if this program results in more students attending the university using data for 2,500 applicants.

(attendi)^ = 0.38 + 0.20 (GPAi -3.65) + 0.07scholari
                   (0.09)   (0.05)                      (0.03)

i. Interpret the coefficient on "GPA-3.65" in a sentence. Does this make sense? Explain.
ii. Interpret the coefficient on "scholar" in a sentence.
iii. Explain what it means for the coefficient in part ii) to be called a local average treatment effect.
iv. Is the effect of the scholarship on attending the university statistically significant at the 95% level?
v. What is the probability of a student attending the university if they have a GPA of 3.64? 3.66?
vi. Draw graph of this regression equation and label the slope and the discontinuity.

Format your homework according to the following formatting requirements:

(1) The answer should be typed, double spaced, using Times New Roman font (size 12), with one-inch margins on all sides.

(2) The response also includes a cover page containing the title of the homework, the student's name, the course title, and the date. The cover page is not included in the required page length.

(3) Also include a reference page. The Citations and references should follow APA format. The reference page is not included in the required page length.

Solution Preview :

Prepared by a verified Expert
Econometrics: How many individuals are there in the data set
Reference No:- TGS03047345

Now Priced at $40 (50% Discount)

Recommended (93%)

Rated (4.5/5)