Logo

CB2030

Here we store material for the Data Analysis part of the CB2030, Systems Biology course.

This project is maintained by statisticalbiotechnology

Preparations for lecture on Clustering

As a preparation for the Clustering convocation, make yourself acquainted with the material below, and subsequently complete any tasks assigned to you in canvas.

  1. Watch the Video Lecture on Clustering, and its slides
  2. Read the section in VaderPlas on k-Means Clustering from beginning of section to the examples.
  3. Read up VaderPlas on Gaussian Mixture Models from beginning of section to the example.
  4. Investigate the jupyter notebook on Clustering of Breast Cancer Data. There are associated study questions here.
  5. Read previous years questions and answers on the material.

Additional Material:

  1. Comparison of k-means and GMMs
  2. StatQuest on k-means
  3. Wikipedia on Voronoi diagrams, k-means