If we used the above k-nn algorithm to score the training, Computer Engineering

If we used the above k-nn algorithm to score the training

Problem

Predicting Housing Median Prices. The file BostonHousing.csv contains information on over 500 census tracts in Boston, where for each tract multiple variables are recorded. The last column (CAT.MEDV) was derived from MEDV, such that it obtains the value 1 if MEDV > 30 and 0 otherwise. Consider the goal of predicting the median value (MEDV) of a tract, given the information in the first 12 columns. Partition the data into training (60%) and validation (40%) sets.

a. Perform a k-NN prediction with all 12 predictors (ignore the CAT.MEDV column), trying values of k from 1 to 5. Make sure to normalize the data and choose function knn() from package class rather than package FNN. To make sure R is using the class package (when both packages are loaded), use class: knn(). What is the best k? What does it mean?

b. Predict the MEDV for a tract with the following information, using the best k:

c. If we used the above k-NN algorithm to score the training data, what would be the error of the training set?

d. Why is the validation data error overly optimistic compared to the error rate when applying this k-NN predictor to new data?

e. If the purpose is to predict MEDV for several thousands of new tracts, what would be the disadvantage of using k-NN prediction? List the operations that the algorithm goes through in order to produce each prediction.

View Complete Question

Request for Solution File

Ask an Expert for Answer!!

Computer Engineering: If we used the above k-nn algorithm to score the training

Reference No:- TGS02721508

Expected delivery within 24 Hours

Have a Question? (oR Write a Review)

Write atleast 100 words!!

Request for Solution File

Ask an Expert for Answer!!

Computer Engineering: If we used the above k-nn algorithm to score the training

Reference No:- TGS02721508

Have a Question? (oR Write a Review)

Recent Questions Asked Computer Engineering

Q : You use best practices of survey design to create a patient

Q : Find a security case study regarding a hack or data breach

Q : Articulate the significance of identifying first second and

Q : Search the ushmm site or yad vashem the usc shoah

Q : If we used the above k-nn algorithm to score the training

Q : Note that per unit manufacturing overhead costs include

Q : Write down a run that causes the controller to crash throw

Q : What are some of the key events in the evolution of global

Q : What were the causes and consequences of the french

Explain the conflict between america and iran today

What justifies an organization to use centralized staffing

What manifestations nurse expect to find during breast exam

Block diagram presents a closed-loop control system

Informatics role exploration paper

What affect a persons nutritional health

Eating disorders greatly impact nutritional health and body

Request for Solution File

Ask an Expert for Answer!!

Computer Engineering: If we used the above k-nn algorithm to score the training

Reference No:- TGS02721508

Recent Questions Asked Computer Engineering

Q : You use best practices of survey design to create a patient

Q : Find a security case study regarding a hack or data breach

Q : Articulate the significance of identifying first second and

Q : Search the ushmm site or yad vashem the usc shoah

Q : If we used the above k-nn algorithm to score the training

Q : Note that per unit manufacturing overhead costs include

Q : Write down a run that causes the controller to crash throw

Q : What are some of the key events in the evolution of global

Q : What were the causes and consequences of the french

Asked Questions