Describe the fit of the linear regression line to the data


Assignment

Part 1: Data Analysis

Prepare the data provided in the attached, "Raw Data and Linear Regression." Remove any potential errors or outliers, duplicate records, or data that are not necessary to address the problem or scenario.

A. Explain why you removed each column or row from the raw data file or why you imputed data in the empty fields as you prepared the data for analysis. Include a clean data set with your submission.

B. Create data sheets using the cleaned data. Provide the following tables with accurate counts, and vertical or horizontal bar graphs to represent the requested aggregated data. Be sure all tables are appropriately labeled.

o Table: date and number of events

o Bar graph: date and number of events

o Table: number of incident occurrences by event type

o Bar graph: number of incident occurrences by event type

o Table: sectors and total number of events

o Bar graph: sectors and total number of events

C. Describe the fit of the linear regression line to the data, using the linear regression model that is provided in the attachment. Provide graphical representations or tables as evidence to support your description.

D. Describe the impact of the outliers on the data, using the linear regression model that is provided in the attachment. Provide graphical representations or tables as evidence to support your description.

E. Provide a residual plot and explain how to improve the linear regression model based on your interpretation of the plot.

Part 2: Simulation and Recommendation

Run a simulation (Monte Carlo) based on a normally distributed random variable of the same mean and standard deviation as the variable "Number of officers at the scene" in the clean data set.

F. Determine if the police department currently qualifies for the funding. Provide your simulation results as evidence to support your findings.

G. Calculate the probability that the department will or will not qualify for the funding in the future. Provide evidence to support your findings.

H. Describe the precautions or behaviors that should be exercised when working with and communicating about the sensitive data in this scenario.

I. Acknowledge sources, using in-text citations and references, for content that is quoted, paraphrased, or summarized.

Request for Solution File

Ask an Expert for Answer!!
Database Management System: Describe the fit of the linear regression line to the data
Reference No:- TGS02620041

Expected delivery within 24 Hours