# COSC6337 Web Mining and Information Retrieval Homework Two Due Date: 10/25/2015 11:59pm 1. Based on the following confusion matrix Cluster...

A. Compute the entropy and purity of each cluster and the overall clustering. Purity is computed as counting the number of the class which is the most frequent in the cluster dividing by the total number.

B. Compute the precision of the class “Sports” with respect to each cluster and overall clustering. Precision measures the proportion of the correct pages returned to all the pages returned.

C. Compute the recall of the class “Entertainment” with respect to each cluster and overall clustering. Recall measures the proportion of the correct pages returned to all the correct pages available on the Web.