Describe how you tested your approach to ensure


Project Assignment

The course project uses the data set: Communities and Crime Unnormalized Data Set.

This data set has 147 attributes and 2215 instances or observations. This data set is relatively recent, March 2, 2011. The data set will require pre-processing before beginning data mining. For example, missing data is designated by a "?".

In addition to the basic data set you will need to update the data using the neighborhoodscout website for specific areas including:

Amherst, New York
Ardmore, Oklahoma
Atlanta, Georgia
Baltimore, Maryland
Buffalo, New York
Cleveland, Ohio
Detroit, Michigan
Ferguson, Missouri
Frisco, Texas
Glendale, California
Irvine, California
Memphis, Tennessee
Naperville, Illinois
Oakland, California
Sunnyvale, California

You will use this data set to complete your course project which comprises an analysis of the data using a statistical and a graphical analysis. Your team should create a proposal to describe exactly what you will do. You are required, at a minimum, to assess whether or not an area will have a greater than or less than/equal to the national average rate of violent and non-violet crimes. Violent crimes include those which involve force or the threat of force.

Although you can use as many appropriate variables as you want, you should take into account at least: population; ethnicity; age; income; and, education. Note that you may have to pre-process the data set to obtain these variables or some set of equivalent normalized variables.

One thing that you need to specifically look for is how imbalances affect crime rates. That is, it is well known (look for articles and information online) that income inequality increases crime rates. It has also been demonstrated that other types of imbalances result in increased crime, e.g. when the demographics of an area are not reflected in groups such as law enforcement. This has been proposed as having had a major impact on the situation in Ferguson, MO. The additional cities/towns you are required to find updated data on include some with such imbalances, some with higher crime rates, and some with lower crime rates. This is so that you can compare and contrast different statistics, etc.

Proposal

Create a minimum 10 page proposal with sections including:

• What is the problem you are trying to solve or question you are trying to answer?

• What work do you plan to do in the project?

• Which algorithms/techniques/models do you plan to use/develop? Be as specific as you can?

• How will you evaluate what you've done?

• What do you expect to submit/accomplish by the end of the project?

Status Report

Create a minimum 10 page status report with sections including:

• What the problem is that you are trying to solve or question you are trying to answer.

• All relevant background information including any relevant literature you have/will use.

• The overall process you will follow for the entire project.

• A description of any relevant, interesting exploratory data analyses.

• A description of the methods/techniques/tools/algorithms you have/will use to complete the project.

• A description of the challenges you have had working on the project so far.

• A discussion of the parts of the project that have been completed.

• A discussion of the parts of the project that remain to be completed.

• A discussion of how you will finish the final project report and presentation.


Project Final Report (60% of the total project score) due December 11:

Create an approximate 10 page final report with at least the following sections:

• Introduction, motivation and general description of the situation, problem or challenge.

• Following the proposal and status report, what is the situation, problem or challenge you are addressing?

• Related work.

• Provide a thorough background for the project; e.g. you can use information from other related work you have found online - don't forget to properly cite others work.

• Data

• Give a complete description of the data you use during the project, including any you reject.

• Include your Code Book as an Appendix to your final report.

• Technical Approach

• Give a detailed description of the process for your entire project including the analyses you completed.

• Give a detailed description of the analytics you have used including any algorithms, methods, tools or techniques. You do not have to describe well known approaches themselves, e.g. linear regression. You do have to describe how you applied the approach you used.

• Test and evaluation

• Describe how you tested your approach to ensure that it is valid.

• Discuss the validity of your approach.

• Describe how you evaluated your results and/or conclusions including any specific metrics, output data, completed analyses, etc.

• Discuss how well your approach worked to address the situation or challenge, solve the problem or answer the research question.

• Evaluate and report whether or not someone unfamiliar with your work could accurately replicate it.

• Written work

• Written work will be graded using the rubric provided.

Format your assignment according to the following formatting requirements:

1. The answer should be typed, double spaced, using Times New Roman font (size 12), with one-inch margins on all sides.

2. The response also include a cover page containing the title of the assignment, the student's name, the course title, and the date. The cover page is not included in the required page length.

3. Also Include a reference page. The Citations and references should follow APA format. The reference page is not included in the required page length.

Request for Solution File

Ask an Expert for Answer!!
Basic Statistics: Describe how you tested your approach to ensure
Reference No:- TGS02951819

Expected delivery within 24 Hours