--%>

correlation analysis and the regression statistics

1).  When you take out a mortgage, there are many different kinds of costs.  Usually the two largest are the interest rate (annual percentage that determines the size of your monthly payment) and the loan fee (a one-time percentage charged to you at the time the loan is made).  Based on the data in the MORTAGE tab of the Excel file provided, what type of relationship exists between the interest rate and loan fee?  How strong of a relationship exists between these two variables?  If one wanted to predict the loan fee given a certain interest rate, would you recommend using a model derived from this data?  Only use correlation analysis and the regression statistics to briefly justify your reasoning.  Use α = .05.  

2).  The data in the MATH tab of the Excel file provided represents a sample of mathematics achievement test (MAT) scores and calculus grades for independently selected college freshmen.  From this evidence, would you say that the achievement test scores and calculus grades are independent? 

Use α = .01.  

3).  Part of a study to determine factors influencing family medical expenses involves finding a regression relationship between the number of people in a family and the monthly medical expense.  The data for the pilot study is located in the MEDICAL tab in the Excel file provided. 

            a).  Develop a regression model at the .05 level of significance.

b).  What can be said regarding the slope and correlation coefficient?  Conduct both the (t)-test and F-test for the slope, and (t)-test for the correlation coefficient.

c).  Use your results in (a) to determine the monthly medical expenses for a family of (4)?  Is this meaningful?

d).  Use your results in (a) to determine the monthly medical expenses for single person household?  Is this meaningful?

 

4).  Consider the earnings per share and the closing stock price of selected biotechnical firms with large market capitalization located in the STOCK tab in the Excel file provided.  Given the importance of many analysts place on earnings per share, you might expect to find a strong correlation between earnings per share and stock price.  Of course, it may be premature to judge since the market price may depend more on the expectation of (random) future earnings than on the actual achieved earnings.  (20 pts)

 

            a).  Draw a scatterplot of the stock price against earnings per share.

            b).  Determine the coefficient of determination and interpret its meaning.

            c).  Using α = .05, develop a regression model.

d).  Conduct a residual analysis and determine the validity of the model.  Include the Durbin-Watson test.

e).  You are head of a biotech firm planning to go public soon.  Your earnings per share are $.05.  Based on your model in (c), what stock price would you anticipate.

 

5).  A sample of 30 computer hardware companies were observed from Stock Investor Pro and is located in the INVESTOR tab in the Excel file provided.  The data includes price per share, book value per share, and the return on equity per share for each. 

a).  Develop an estimated regression model that can be used to predict the price per share given the book value per share and the return on equity per share.  Use the .05 level of significance.

b).  Test the significance of the overall regression model.

c).  Use the (t)-test and partial F-test to determine the significance of each independent variable.

d).  Do the independent variables make a significant contribution to the regression model?  Which one(s) should be included?

            e).  Compute the coefficients of partial determination and interpret the results.

            f).  Add an interaction term to the model.  Does it make a significant contribution to the model?

6).  Your firm is worried about being sued for gender discrimination.  There is a growing perception that males are being paid more than females in your department.  Using the data in the SALARY tab in the Excel file provided, please complete the following using α = .05: 

            a).  Do the men appear to earn more on average than women based on the information provided?

            b).  Derive a regression model, and provide a model for men and a model for women.

c).  Do the independent variables make a significant contribution to the regression model?  Which one(s) should be included?

            d).  Compute the coefficients of partial determination and interpret the results.

            e).  Add an interaction term to the model.  Does it make a significant contribution to the model?

            f).  How does the salary differ for men and women if each one has 13-years experience?

            g).  Does your results imply discrimination against women?

   Related Questions in Basic Statistics

  • Q : Hypothesis homework A sample of 9 days

    A sample of 9 days over the past six months showed that a clinic treated the following numbers of patients: 24, 26, 21, 17, 16, 23, 27, 18, and 25. If the number of patients seen per day is normally distributed, would an analysis of these sample data provide evidence that the variance in the numbe

  • Q : Designing a system What are the

    What are the questions that comes into mind when designing a system?

  • Q : Cumulative Frequency and Relative

    Explain differences between Cumulative Frequency and Relative Frequency?

  • Q : Sample z test and Sample t test A

    A random sample X1, X2, …, Xn is from a normal population with mean µ and variance σ2. If σ is unknown, give a 95% confidence interval of the population mean, and interpret it. Discuss the major diff

  • Q : Define Service Demand Law

    Service Demand Law:• Dk = SKVK, Average time spent by a typical request obtaining service from resource k• DK = (ρk/X

  • Q : STATISTICS Question This week you will

    This week you will analyze if women drink more sodas than men.  For the purposes of this Question, assume that in the past there has been no difference.  However, you have seen lots of women drinking sodas the past few months.  You will perform a hypothesis test to determine if women now drink more

  • Q : Average think time Software monitor

    Software monitor data for an interactive system shows a CPU utilization of 75%, a 3 second CPU service demand, a response time of 15 seconds, and 10 active users. Determine the average think time of these users?

  • Q : Define Utilization Law Utilization Law

    Utilization Law: • ρk = XK . SK = X . DK • Utilization of a resource is the fraction

  • Q : Model Checking Approach Model Checking

    Model Checking Approach: • Specify program model and exhaustively evaluate that model against a speci?cation        –Check that properties hold   

  • Q : Statics for each of the following

    for each of the following studies a and b decide whether to reject the null hypothesis that groiups come from identical populations. Use the .01 level. (c) Figure the effects size for each study. (d) ADVANCED TOPIC: Carry out an analysis of variance for study (a) using the strucurtal method.