Solved: Cs 5644 assignment implement a decision tree and nave bayes, Computer Engineering

Cs 5644 assignment implement a decision tree and nave bayes

Assignment -

1. Machine learning has now permeated multiple disciplines, even politics. The current landscape in the US is rife with data scientists and other quantitative experts making predictions about ongoing and upcoming elections. Consider the Congressional Voting Records dataset from the UCI machine learning repository

The dataset contains two files: one with a ".names" suffix and one with a ".data" suffix. The actual data is in the ".data" suffix and ".names" describes the metadata (i.e., describes what the different columns mean). Note that each row of the ".data" file contains one instance and includes both features and the class label (please take care to note the order). The machine learning problem here is to take the votes of US congressmen/congresswomen as input and predict whether they are a Republican or a Democrat. In particular, our goal is to solve this problem using both decision trees and a naïve Bayes classifier.

First, spend some time understanding the structure of the dataset, how the instances are organized, how the features/class are organized, and so on. You need to "massage" this data into the form that scikit-learn requires before you can apply either a decision tree or a naïve Bayes classifier. So spend some time understanding and planning how you will do this massaging. You can do this in Python or in Excel or any way you choose. Note that this step is a natural part of the machine learning and knowledge discovery process. Data is rarely given in the form that machine learning can be directly applied, so that considerable effort goes into cleaning, manipulating, and massaging it. Do not apply scikit-learn before ensuring that it is in the form required.

Just like the PlayTennis dataset, the features are binary-valued but note that some features have missing values for some rows (instances). You need to decide how you will handle them. There are three possibilities here: i) discard instances that have missing feature values, ii) treat "missing" as if it is a value (and thus a binary feature becomes a ternary, or three-valued, feature), iii) impute missing values (i.e., for each feature, replace missing values with the most common value for that feature), so that they are no longer missing or unknown. If you read the ".notes" file, it explains why some values are missing and what they mean.

Implement a decision tree and Naïve Bayes classifier for classification, with each of the above three ways of dealing with missing values. So you are experimenting with 6 scenarios.
Perform 5-fold cross validation and report precision, recall, and F1-scores for each of the 6 scenarios.

2. For what type of dataset would you choose decision trees as a classifier over Naive Bayes? Vice versa?

View Complete Question

Solution Preview :

Prepared by a verified Expert

Computer Engineering: Cs 5644 assignment implement a decision tree and nave bayes

Reference No:- TGS02468132

Now Priced at $30 (50% Discount)

Recommended (90%)

Rated (4.3/5)

Have a Question? (oR Write a Review)

Write atleast 100 words!!

Solution Preview :

Prepared by a verified Expert

Computer Engineering: Cs 5644 assignment implement a decision tree and nave bayes

Reference No:- TGS02468132

Have a Question? (oR Write a Review)

Recent Questions Asked Computer Engineering

Q : Bill watts president of western publications accepts a

Q : Under which conditions did men and women recall the

Q : Many organizations change so frequently that they are

Q : The additional number of stockouts under the new jit system

Q : Cs 5644 assignment implement a decision tree and nave bayes

Q : How well the data collection process worked or did not work

Q : Determine the fundamental period of xt and the value of the

Q : Explain how the hps model interacts with a process diagram

Q : Npv and aarr goal-congruence issues liam mitchell a manager

Discuss reflective questions to help gain self-awareness

What positive feedback incident would include

What term theoretical integration refer to in criminology

Problem about training sessions and competitions

Why athlete prefers to exercise on an empty stomach

Absence of intentional transformational leadership

Ways the project manager can assist others

Solution Preview :

Prepared by a verified Expert

Computer Engineering: Cs 5644 assignment implement a decision tree and nave bayes

Reference No:- TGS02468132

Recent Questions Asked Computer Engineering

Q : Bill watts president of western publications accepts a

Q : Under which conditions did men and women recall the

Q : Many organizations change so frequently that they are

Q : The additional number of stockouts under the new jit system

Q : Cs 5644 assignment implement a decision tree and nave bayes

Q : How well the data collection process worked or did not work

Q : Determine the fundamental period of xt and the value of the

Q : Explain how the hps model interacts with a process diagram

Q : Npv and aarr goal-congruence issues liam mitchell a manager

Asked Questions