Solved: Define cluster density remains constant, Data Structure & Algorithms

Define cluster density remains constant

Assignment:

1. In CLIQUE, the threshold used to find cluster density remains constant, even as the number of dimensions increases. This is a potential problem since density drops as dimensionality increases; i.e., to find clusters in higher dimensions the threshold has to be set at a level that may well result in the merging of low-dimensional clusters. Comment on whether you feel this is truly a problem and, if so, how you might modify CLIQUE to address this problem.

2. Name at least one situation in which you would not want to use clustering based on SNN similarity or density.

3. Give an example of a set of clusters in which merging based on the closeness of clusters leads to a more natural set of clusters than merging based on the strength of connection (interconnectedness) of clusters.

4. We take a sample of adults and measure their heights. If we record the gender of each person, we can calculate the average height and the variance of the height, separately, for men and women. Suppose, however, that this information was not recorded. Would it be possible to still obtain this information? Explain.

5. Explain the difference between likelihood and probability.

6. Traditional K-means has a number of limitations, such as sensitivity to outliers and difficulty in handling clusters of different sizes and densities, or with non-globular shapes. Comment on the ability of fuzzy c-means to handle these situations.

7. Clusters of documents can be summarized by finding the top terms (words) for the documents in the cluster, e.g., by taking the most frequent k terms, where k is a constant, say 10, or by taking all terms that occur more frequently than a specified threshold. Suppose that K-means is used to find clusters of both documents and words for a document data set.

(a) How might a set of term clusters defined by the top terms in a document cluster differ from the word clusters found by clustering the terms with K-means?

(b) How could term clustering be used to define clusters of documents?

8. Suppose we find K clusters using Ward's method, bisecting K-means, and ordinary K-means. Which of these solutions represents a local or global minimum? Explain.

9. You are given a data set with 100 records and are asked to cluster the data. You use K-means to cluster the data, but for all values of K, 1 ≤ K ≤ 100, the K-means algorithm returns only one non-empty cluster. You then apply an incremental version of K-means, but obtain exactly the same result. How is this possible? How would single link or DBSCAN handle such data?

View Complete Question

Solution Preview :

Prepared by a verified Expert

Data Structure & Algorithms: Define cluster density remains constant

Reference No:- TGS03192717

Now Priced at $30 (50% Discount)

Recommended (95%)

Rated (4.7/5)

Have a Question? (oR Write a Review)

Write atleast 100 words!!

Solution Preview :

Prepared by a verified Expert

Data Structure & Algorithms: Define cluster density remains constant

Reference No:- TGS03192717

Have a Question? (oR Write a Review)

Recent Questions Asked Data Structure & Algorithms

Q : Explain drivers of value-driven enterprise risk management

Q : Describe any challenges you anticipate facing or have faced

Q : Compare the original agree and agree ii items

Q : How you would use the advocacy actions to shape your issue

Q : Define cluster density remains constant

Q : Describe the characteristics that make a disease appropriate

Q : How might religious leaders assist healthcare providers

Q : Define the database

Q : How each entity supports air traffic management

Why who actively promotes breastfeeding

Review the ana code of ethics for nurses

Discussion on photography analysis and cinema analysis

Evaluating sample annotated webliography introductions

Review the how to win friends and influence people

Describe how the adolescent brain weighs risk and reward

Review article marzano 13 best practices by sarah

Solution Preview :

Prepared by a verified Expert

Data Structure & Algorithms: Define cluster density remains constant

Reference No:- TGS03192717

Recent Questions Asked Data Structure & Algorithms

Q : Explain drivers of value-driven enterprise risk management

Q : Describe any challenges you anticipate facing or have faced

Q : Compare the original agree and agree ii items

Q : How you would use the advocacy actions to shape your issue

Q : Define cluster density remains constant

Q : Describe the characteristics that make a disease appropriate

Q : How might religious leaders assist healthcare providers

Q : Define the database

Q : How each entity supports air traffic management

Asked Questions