Lab 7: Clustering and Dimensionality Reduction

The slides I showed this week can be found here.

We discussed various dimensionality reduction techniques, which are used to project high-dimensional data into a low-dimensional space while preserving the clusters from the high-dimensional space. These included:
- Principal Component Analysis (PCA)
- Multidimensional Scaling (MDS)
- Sparse Random Projection
- Locally Linear Embedding
- t-Distributed Stochastic Neighbor Embedding (t-SNE)
- Uniform Manifold Approximation and Projection (UMAP)
We also applied each of these methods to the MNIST dataset of hand-drawn digits, projecting the 784-dimensional MNIST vectors into both 2 and 3 dimensions and visualizing the results. The code we used to create these visualizations can be found here.
We discussed common pitfalls that can lead to misreadings of t-SNE plots