Repository for the course "Statistical Learning for Big Data" at Chalmers University of Technology.
Project 1 - Classifier Strength
Comparing three different classifications methods: KNN, Logistic Regression and LDA. Using k-fold CV and missclassification rate to determine strength.
Investigating the impact of linkage (ward and centroid) on hierarchical clustering and covariance type (VEE and EEV) on GMM clustering.
Exploring group lasso with simulated data. Escpecially looking at how the method handles false/scrambled grouping.
Comparing NMF clustering and k-means on a subset of the MNIST fashion dataset.
Code and answers to the exam.