Discuss and reflect on the quality of the data sets


Data Mining Assignment: Explore and Prepare Data

You work for a hypothetical university as an entry level data analyst and your supervisor has task you to learn more about the data mining process associated with problem definitions, data exploration and data preparation by completing the steps below:

1. In the discussion this week, a task to install Rapid Miner was requested so to get started, your supervisor has asked you to prepare feedback based on at least two Rapid Miner data samples.

2. Important Reminder: In support of this feedback and assignment, everyone should go through all introductory and data preparation video tutorials. Additional learning videos could be found youtube using keyword searches like "Rapid Miner Tutorials." For example, check out the resource found below:

• RapidMiner, Inc. (2018). Various Rapid Miner Support Video Resources. YouTube.

• Rapid Miner. (2018). Operator reference manual. Rapid Minder.

1. The feedback needs to be a minimum of five body pages of written content not including illustrations and supported with at least three academic sources of research. Furthermore, the feedback needs to be professionally formatted using APA including an APA cover page, abstract, body pages, and reference page. The feedback needs to address the following:

• Problem Definitions: When looking at the data sets, think about, develop and discuss some potential problem definitions for these data sets. In other words, what are some potential ideas of working with and handling these data sets.

• Data exploration: In further exploration of the data sets, discuss and reflect on the quality of these data sets and use some of the basic statistical output and charts provided with Rapid Miner. When exploring the data sets, also remember to think about any potential data problems you see.

• Data Preparation: After exploring the data, discuss, reflect, and apply any ideas to cleanse or make the data better for data analysis and modeling efforts.

1. Remember to be very illustrative embedding any charts used or other screen captures to verify any work completed to explore and prepare the data sets.

2. For the conclusions of this feedback, no modeling has yet been accomplished; however, use the basic statistical and chart options to draw initial conclusions about these data sets assuming a case where there were no options to go further creating models. In other words, what types of decisions could be made about these data sets after data exploration and data preparations are conducted.

3. Complete and submit this assignment for grading on or before the due date. Remember, it is not a good idea to complete or attempt completing work late. See the course syllabus and the associated late policy.

Format your assignment according to the following formatting requirements:

1. The answer should be typed, double spaced, using Times New Roman font (size 12), with one-inch margins on all sides.

2. The response also includes a cover page containing the title of the assignment, the student's name, the course title, and the date. The cover page is not included in the required page length.

3. Also include a reference page. The Citations and references should follow APA format. The reference page is not included in the required page length.

Attachment:- References.rar

Solution Preview :

Prepared by a verified Expert
Database Management System: Discuss and reflect on the quality of the data sets
Reference No:- TGS02977828

Now Priced at $30 (50% Discount)

Recommended (96%)

Rated (4.8/5)