Compute the ideal number of clusters assign each data point


Assignment

Included with this assignment is an Excel spreadsheet that contains data with two dimension values.

The purpose of this assignment is to demonstrate steps performed in a K-Means Cluster analysis.

Review the "k-MEANS CLUSTERING ALGORITHM" section in Chapter 4 of the Sharda et. al. textbook for additional background.

Use Excel to perform the following data analysis.

1. Plot the data on a scatter plot.
2. Determine the ideal number of clusters.
3. Choose random center points (centroids) for each cluster. (Note: Each student will select a different random set of centroids.)
4. Using a standard distance formula measure the distance from each data point to each center point.
5. Assign each data point to an initial cluster region based on closeness.
6. For each cluster calculate new center points.
7. Repeat steps 4 through 6 as a second pass.
8. Repeat steps 4 through 6 as a third pass.

Provide final answers on your Excel spreadsheet indicating your initial center points, second pass center points, and third pass center points. You must submit your spreadsheet and these answers to receive full credit.

You will use Excel to help with calculations, but only standard functions should be used (i.e. don't use a plug-in to perform the analysis for you.) You need to show your work doing this analysis the long way. If you were to repeat steps 4 through 6, what will likely happen with the cluster centroids?

Attachment:- Cluster-Data.rar

Request for Solution File

Ask an Expert for Answer!!
Computer Engineering: Compute the ideal number of clusters assign each data point
Reference No:- TGS02736592

Expected delivery within 24 Hours