How can noise be reduced in a dataset


Discussion Post:

What's simple random sampling? Is it possible to sample data instances using a distribution different from the uniform distribution? If so, give an example of a probability distribution of the data instances that is different from uniform (i.e., equal probability).

You must make at least two substantive responses to your classmates' posts. Respond to these posts in any of the following ways:

o Build on something your classmate said.
o Explain why and how you see things differently.
o Ask a probing or clarifying question.
o Share an insight from having read your classmates' postings.
o Offer and support an opinion.
o Validate an idea with your own experience.
o Expand on your classmates' postings.
o Ask for evidence that supports the post.

Homework:

What's an attribute? What's a data instance?

i. What's noise? How can noise be reduced in a dataset?

ii. Define outlier. Describe 2 different approaches to detect outliers in a dataset.

iii. Describe 3 different techniques to deal with missing values in a dataset. Explain when each of these techniques would be most appropriate.

iv. Given a sample dataset with missing values, apply an appropriate technique to deal with them.

v. Give 2 examples in which aggregation is useful.

vi. Given a sample dataset, apply aggregation of data values.

vii. What's sampling?

viii. What's simple random sampling? Is it possible to sample data instances using a distribution different from the uniform distribution? If so, give an example of a probability distribution of the data instances that is different from uniform (i.e., equal probability).

ix. What's stratified sampling?

Format your homework according to the following formatting requirements:

(1) The answer should be typed, double spaced, using Times New Roman font (size 12), with one-inch margins on all sides.

(2) The response also includes a cover page containing the title of the homework, the student's name, the course title, and the date. The cover page is not included in the required page length.

(3) Also include a reference page. The Citations and references should follow APA format. The reference page is not included in the required page length.

Solution Preview :

Prepared by a verified Expert
Database Management System: How can noise be reduced in a dataset
Reference No:- TGS03047725

Now Priced at $45 (50% Discount)

Recommended (93%)

Rated (4.5/5)