--%>

correlation analysis and the regression statistics

1).  When you take out a mortgage, there are many different kinds of costs.  Usually the two largest are the interest rate (annual percentage that determines the size of your monthly payment) and the loan fee (a one-time percentage charged to you at the time the loan is made).  Based on the data in the MORTAGE tab of the Excel file provided, what type of relationship exists between the interest rate and loan fee?  How strong of a relationship exists between these two variables?  If one wanted to predict the loan fee given a certain interest rate, would you recommend using a model derived from this data?  Only use correlation analysis and the regression statistics to briefly justify your reasoning.  Use α = .05.  

2).  The data in the MATH tab of the Excel file provided represents a sample of mathematics achievement test (MAT) scores and calculus grades for independently selected college freshmen.  From this evidence, would you say that the achievement test scores and calculus grades are independent? 

Use α = .01.  

3).  Part of a study to determine factors influencing family medical expenses involves finding a regression relationship between the number of people in a family and the monthly medical expense.  The data for the pilot study is located in the MEDICAL tab in the Excel file provided. 

            a).  Develop a regression model at the .05 level of significance.

b).  What can be said regarding the slope and correlation coefficient?  Conduct both the (t)-test and F-test for the slope, and (t)-test for the correlation coefficient.

c).  Use your results in (a) to determine the monthly medical expenses for a family of (4)?  Is this meaningful?

d).  Use your results in (a) to determine the monthly medical expenses for single person household?  Is this meaningful?

 

4).  Consider the earnings per share and the closing stock price of selected biotechnical firms with large market capitalization located in the STOCK tab in the Excel file provided.  Given the importance of many analysts place on earnings per share, you might expect to find a strong correlation between earnings per share and stock price.  Of course, it may be premature to judge since the market price may depend more on the expectation of (random) future earnings than on the actual achieved earnings.  (20 pts)

 

            a).  Draw a scatterplot of the stock price against earnings per share.

            b).  Determine the coefficient of determination and interpret its meaning.

            c).  Using α = .05, develop a regression model.

d).  Conduct a residual analysis and determine the validity of the model.  Include the Durbin-Watson test.

e).  You are head of a biotech firm planning to go public soon.  Your earnings per share are $.05.  Based on your model in (c), what stock price would you anticipate.

 

5).  A sample of 30 computer hardware companies were observed from Stock Investor Pro and is located in the INVESTOR tab in the Excel file provided.  The data includes price per share, book value per share, and the return on equity per share for each. 

a).  Develop an estimated regression model that can be used to predict the price per share given the book value per share and the return on equity per share.  Use the .05 level of significance.

b).  Test the significance of the overall regression model.

c).  Use the (t)-test and partial F-test to determine the significance of each independent variable.

d).  Do the independent variables make a significant contribution to the regression model?  Which one(s) should be included?

            e).  Compute the coefficients of partial determination and interpret the results.

            f).  Add an interaction term to the model.  Does it make a significant contribution to the model?

6).  Your firm is worried about being sued for gender discrimination.  There is a growing perception that males are being paid more than females in your department.  Using the data in the SALARY tab in the Excel file provided, please complete the following using α = .05: 

            a).  Do the men appear to earn more on average than women based on the information provided?

            b).  Derive a regression model, and provide a model for men and a model for women.

c).  Do the independent variables make a significant contribution to the regression model?  Which one(s) should be included?

            d).  Compute the coefficients of partial determination and interpret the results.

            e).  Add an interaction term to the model.  Does it make a significant contribution to the model?

            f).  How does the salary differ for men and women if each one has 13-years experience?

            g).  Does your results imply discrimination against women?

   Related Questions in Basic Statistics

  • Q : Creating Grouped Frequency Distribution

    Creating Grouped Frequency Distribution: A) At first we have to determine the biggest and smallest values. B) Then we have to Calculate the Range = Maximum - Minimum C) Choose the number of classes wished for. This is generally between 5 to 20. D) Find out the class width by dividing the range b

  • Q : Compute two sample standard deviations

    Consider the following data for two independent random samples taken from two normal populations. Sample 1 14 26 20 16 14 18 Sample 2 18 16 8 12 16 14 a) Com

  • Q : Probability how can i calculate

    how can i calculate cumulative probabilities of survival

  • Q : Principles of data analysis For the

    For the data analysis project, you will address some questions that interest you with the statistical methodology we are learning in class. You choose the questions; you decide how to collect data; you do the analyses. The questions can address almost any topic,

  • Q : Safety and Liveness in Model Checking

    Safety and Liveness in Model Checking Approach; •? Safety: Nothing bad happens •? Liveness: Something good happens •? Model checking is especially good at verifying safety and liveness properties    –?Concurrency i

  • Q : Data Description 1. If the mean number

    1. If the mean number of hours of television watched by teenagers per week is 12 with a standard deviation of 2 hours, what proportion of teenagers watch 16 to 18 hours of TV a week? (Assume a normal distribution.) A. 2.1% B. 4.5% C. 0.3% D. 4.2% 2. The probability of an offender having a s

  • Q : Building Models Building Models • What

    Building Models • What do we need to know to build a model?– For model checking we need to specify behavior • Consider a simple vending machine – A custome rinserts coins, selects a beverage and receives a can of soda &bul

  • Q : What is your conclusion The following

    The following data were collected on the number of emergency ambulance calls for an urban county and a rural county in Florida. Is County type independent of the day of the week in receiving the emergency ambulance calls? Use α = 0.005. What is your conclusion? Day of the Week<

  • Q : Statistics basic question This week you

    This week you will analyze if women drink more sodas than men.  For the purposes of this Question, assume that in the past there has been no difference.  However, you have seen lots of women drinking sodas the past few months.  You will perform a hypothesis test to determine if women now drink more

  • Q : Problems on ANOVA We are going to

    We are going to simulate an experiment where we are trying to see whether any of the four automated systems (labeled A, B, C, and D) that we use to produce our root beer result in a different specific gravity than any of the other systems. For this example, we would l