Assignment 9
Contents
9. Assignment 9#
Due: 2022-11-11
9.1. #
Eligible skills: (links to checklists)
9.3. Instructions#
Use the same dataset you used for assignment 7, unless there was a problem, or pick one of the recommended ones for that assignment if you did not complete assignment 7.
Describe what question you’d be asking in applying clustering to this dataset.
Apply Kmeans using the known, correct number of clusters, \(K\).
Evaluate how well clustering worked on the data:
using a true clustering metric and
using visualization and
using a clustering metric that uses the ground truth labels
Include a discussion of your results that addresses the following:
describes what the clustering means
what the metrics show
Does this clustering work better or worse than expected based on the classification performance (if you didn’t complete assignment 7, also apply a classifier)
Repeat your analysis using a 2 different numbers (1 higher, one lower) of clusters:
can you interpret the new clusters?
how to they relate to the original clusters? are they completely different, did one split?
is there a reasonable explanation for more clusters than there are classes in this dataset?