The beer data gives information for 50 us states and, Dissertation

The beer data gives information for 50 us states and

Part 1 -

Q1. Star Field is home to the Lions professional baseball team. The team's new marketing director, Janna Kay, has been trying to develop a better understanding of the key drivers of attendance at the stadium to increase ticket revenues, optimize concession inventories and staffing, and schedule the timing of promotional giveaways. Using some historical data on a set of information, we build up the following model:

Attendance=b0+bl nightgame+b2 temp_f+b3 sunday+b4 saturday+b5 friday+b6 promo+b7 openingday+b8 school+u,

where Attendance is the total attendance of the game, temp_f is the high temperature of the game day, nightgame is a dummy variable indicating whether game is played during the night, Friday, Saturday and Sunday are the dummies for the day of the week, promo, opening_day and school are dummies indicating whether there are some promotional activities, whether it is the opening day, and whether local public school

1. Estimate the model.

2. According to the estimation results, which day of the week usually has the highest attendance? Why?

3. How much is the attendance expected to change if the high temperature on the game day becomes 10 degrees higher?

4. What is the estimated difference of the average attendance between night games and daytime games?

5. Predict the attendance on a regular (not opening day) Monday afternoon with 100 degree high temperatures when the school system is not in session, also assuming that no promotion is available.

Suppose that we believe the attendance is not always increasing as the high temperature increases and we add the square term of high temperature, tempf-2, in the model

attendance=b0+bl nightgame+b2 temp_f+b3 sunday+b4 saturday+b5 friday+b6 promo+b7 openingday+b8 school+b9 temp_f^2+u

6. Does the above result support our conjecture or not?

7. Using the estimated coefficient of temp_f and temp_f -2, briefly explain how does the average attendance change as the high temperature increases.

8. According to the results, do you want to keep temp_f^2 in the model or not? Why?

Q2. Suppose you want to estimate the seasonal effect on the revenue. There is a constant term included in the regression as usual. How many dummies are needed to perform such analysis?

Q3. Determinants of price per ounce of cola. Cathy Schafer, a student of mine, estimated the following regression from cross-section data of 77 observations.

P_i = B₀ + B₁D_1i + B₂D_2i + B₃D_3i + u_i

where P_i = price per ounce of cola

D_1i = 001 if discount store, = 010 if chain store, = 100 if convenience store

D_2i = 10 if branded good, = 01 if unbranded good

D_3i = 0001 if 67.6 ounce (2 liter) bottle, = 0010 if 28-33 ounce bottles, = 0100 if 16 ounce bottle, and 1000 = if 12 ounce cans

The results were as follows:

P^{^}_i = 0.143 - 0.00000D_1i + 0.0090D_2i + 0.00001D_3i

t = (-0.3837) (8.3927) (5.8125) R² = 0.6033

where the figures in parentheses are the estimated t values.

(a) Comment on the way the dummies have been introduced in the model.

(b) How would you interpret the results, assuming the dummy setup is acceptable?

Q4. Load package gcookbook and type data (diamonds) to load the data set. The definition of table and depth can be found in the following picture

1. A diamond's quality can be measured by cut, ordered by Ideal, Premium, Very Good, Good, and Fair. Create dummy D₁ to represent Ideal and Premium, and D₂ to represent Very Good and Good.

2. Regress price on carat, depth, table, D₁ and D₂, all interactions terms between dummies and quantitative variables (carat, depth and table). Interpret your result.

3. Create a random sample of size 1000 from the diamonds data. Draw the scatterplot of carat vs log(price), color coded by cut.

Q5. Load the me.csv file. The population regression function is

Y = 2 + X + u.

W₁ = X + ∈₁ and W₂ = X + ∈₂ are two measurements for X with errors.

1. Regress Y on X, plot the scatter plot of (Y, X) and the regression line. Is β₁ close to 1?

2. Regress Y on W₁, plot the scatter plot of (Y, W₁) and the regression line.

3. Regress Y on W₂, plot the scatter plot of (Y, W₂) and the regression line.

4. The bias of β^{^}_i is given by β^{^}₁ - 1. Which case yields the largest bias? Why?

5. Now use 2SLS to solve the measurement error problem. Regress Y on W₁ and use W₂ as the IV for W₁. Compare your result with (1)(2)(3).

Part 2 -

Q1. Suppose you wish to estimate the effect of class attendance on student performance on econometrics. We denote by Y_i the student i's final exam score, att_i the attendance rate and GPA_i the GPA of the previous semester.

Y_i = β₀ + β₁att_i + β₂GPA_i + u_i

1. Let dist be the distance from the students' living places to the classroom. Do you think dist is uncorrelated with u?

2. Assuming that dist and u are uncorrelated, what other assumption must dist satisfy to be a valid IV for att? Also argue that dist satisfies this assumption.

Q2. The beer data gives information for 50 US states and Washington, DC for the year 1985-2000 on the following variables:

Variable	Definition
beer_sales	per capita beer sales in the state
income	in dollars
beer_tax	state tax rate on beer
fips_state	state id

1. Fit an pooled OLS regression of beer sales on income and tax

2. Fit a fixed effect model

3. Repeat 1. and 2., using logs of the three variables

4. What is the expected effect of beer tax on beer sales? Do the results support your expectation?

5. Would you expect income to have positive or negative effect on beer consumption? If it is negative, what does that mean?

Q3. Suppose we want to estimate the Cobb-Douglas production function for different production plants using panel data:

Y_it = K_it^β_1L_it^β_2exp(η_i)exp(u_it),

where (K_it, L_it) are capital and labor inputs. Different plants will be indexed by i(= 1, 2, ... N) and time will be indexed by t(= 1,2, ... T). u_it is the disturbance term with zero mean and independent of K and L. Because different plants may use different technology or expose to different technological shocks, we introduce the plant-specific fixed effect η_i. It is easy to see that the higher the η_i, the higher the output level given the same input level (K, L).

1. Describe how to remove the fixed effect η_i.

2. Load data_production.csv. Estimate (β₁, β₂) via dummy variable regression.

3. Estimate (β₁, β₂) via fixed effect regression (plm). Do you obtain the same regression coefficients as in 2?

4. Suppose that there is no fixed effect (so pooled OLS would work) Y_it = K_it^β_1L_it^β_2 explicit) but we don't have a good measurement for capital. Instead, we have the book value of capital, W1it; and the market value of capital, W2_it. Describe how to use 2SLS to estimate the production function.

5. Regress log(Y_it) on log(W1_it), log(L_it). Consider 4 cases: First, simple regression; Second, 2SLS with log(W2_it) as IV; Third, 2SLS with fixed effect; Fourth, fixed effect only (no IV). The true coefficient is (0.7, 0.3). Which model gives you more precise answer? Also compare results from these 4 models.

Attachment:- Assignment Files.zip

View Complete Question

Request for Solution File

Ask an Expert for Answer!!

Dissertation: The beer data gives information for 50 us states and

Reference No:- TGS02253774

Expected delivery within 24 Hours

Have a Question? (oR Write a Review)

Write atleast 100 words!!

Request for Solution File

Ask an Expert for Answer!!

Dissertation: The beer data gives information for 50 us states and

Reference No:- TGS02253774

Have a Question? (oR Write a Review)

Recent Questions Asked Dissertation

Q : The learning outcome for this unit involves the process of

Q : Special motors corporations stock price s is 59 the strike

Q : Assume the black-scholes framework for a futures exchange

Q : Define integrated marketing communications how is it

Q : The beer data gives information for 50 us states and

Q : How might the companys culture of not buying into hype and

Q : What major organizations are leading the way toward

Q : Procter and gamble versus bankers trust caveat emptor

Q : Summarize the steps necessary to set up a wireless network

Discuss signs and symptoms of hpv related cancer

Describe structured multimodal pain management program

Discuss client with severe atherosclerotic disease

Reflect on the definition and goal of ebp

Examine the process of putting a new policy into place

Essential information for early childhood professionals

Discuss about the value of examining your personal biases

Request for Solution File

Ask an Expert for Answer!!

Dissertation: The beer data gives information for 50 us states and

Reference No:- TGS02253774

Recent Questions Asked Dissertation

Q : The learning outcome for this unit involves the process of

Q : Special motors corporations stock price s is 59 the strike

Q : Assume the black-scholes framework for a futures exchange

Q : Define integrated marketing communications how is it

Q : The beer data gives information for 50 us states and

Q : How might the companys culture of not buying into hype and

Q : What major organizations are leading the way toward

Q : Procter and gamble versus bankers trust caveat emptor

Q : Summarize the steps necessary to set up a wireless network

Asked Questions