ML@LSE Bootcamps - Michaelmas + Lent Term 2018

Materials are written by Adrien Couturier. Taught by Ivan Sayapin, Jiong Wei Lua and Adrien Couturier

Useful Resources for Maximising your Learning from our Bootcamps

Introduction to Statistical Learning by Hastie et al.

Bootcamp Taster Session & Get to Know the Committee - 26 September 2018, PAR2.03

This will be a great opportunity for all students who are interested to get involved with ML@LSE to get to know the committee as well as the slew of events we have lined up for the upcoming Academic Year.

As we only have one hour, we will try to focus on:

A gentle introduction to machine learning with interactive visualisations and interesting use cases
Sharing of the events we have lined up for the term ahead such as our pioneering Industry Mentorship Programme with Datatonic, as well as what our bootcamp expects to cover
Getting to know what people are interested in so that we can improve upon our activities
If there is time and/or demand, we may also do a short interactive activity on how we can build a highly accurate image classifier for the MNIST digits dataset in < 500 lines of code.

The machine learning wave is coming, so join us to ride the wave!

Bootcamp 1: Introduction to Machine Learning - 10 October 2018, 1600 - 1800, PAR.LG.03

This first workshop aims to cover the general principles of machine learning theories and techniques.

We also aim to help out with the installation of the Jupyter Notebook development environment, which we will be using as our implementation environment for all subsequent bootcamps.

If time permits, we may also introduce some of the packages that we will be using frequently for subsequent bootcamps, such as pandas, numpy and sklearn.

Theory

Objectives: Understand the basics concepts and notions underpinning machine learning theories and techniques.

Requirements: Basic definitions of random variable, expectation and variance. Basic knowledge of Linear Algebra may be useful.

Keywords: Dataset, Number of Observations, Dimensionality, Machine learning techniques, High dimensional statistics, Statistical pattern, Supervised Learning, Unsupervised learning, Learning Function, Inputs and Outputs, Training Data, Test Data, Irreducible Error, Regression, Classification, Loss, Risk, Empirical Risk, MSE, MER, Testing Errors, Overfitting, Generalization, Training vs. Testing Errors, Bias- Variance Trade-off.

What is Anaconda? Anaconda is a python and R distribution. It aims to provide everything you need (python wise) for data science "out of the box".

It includes:

The core python language
100+ python "packages" (libraries) such as scikitlearn, numpy and pandas
Spyder (IDE/editor - like pycharm) and Jupyter

Guide to Installing Anaconda Distribution

Bootcamp 2: Tree-Based Models - 17 October 2018, 1500 - 1700, CLM.7.02

This bootcamp aims to introduce members to a category of commonly used machine learning models known as tree-based models.

We will assume that all attending members have installed the Anaconda Distribution on their computer. Otherwise, you may follow the instructions here to download it.

Theory

Objectives: Understand what tree-based methods are and how they are buildt. Understand how Cross-Validation can be applied to decision trees. Understand how ensemble methods can improve the power of our techniques.

Requirements: Introductory Bootcamp (you can read the slides if you didn’t attend). Familiarity with the notions of independence and correlation may be useful.

Keywords: Decision tree, Nodes, Recursive binary splitting, Pruning, Cost complexity, Bootstrapping, Bagging, Random forest, Boosting.

Implementation

We will build a Random Forest model to predict survivorship in the Titanic dataset.

Bootcamp 3: Support Vector Machines - 31 October 2018, 1500 - 1700, OLD.4.10

Objectives: Understand linear classification methods. Understand their generalization to non-linear classification. Get a sense of the Kernel idea and Support Vector Machines.

Requirements: Introductory Bootcamp (you can read the slides if you didn’t attend). Although not necessary, familiarity with notions of linear algebra may greatly help: hyperplanes, dot and inner products. Some familiarity with constrained optimization may help.

Keywords: Hyperplane, Margin, Maximal Margin Classifier, Soft Margin Classifier, Non-linear Boundaries, Inner Product, Kernels, Sup- port Vector Machines.

Implementation

We will build a Support Vector Machine to classify bank customers' risk of defaulting on their credit card.

Bootcamp 4: Introduction to Unsupervised Learning Methods - 14 November 2018, 1500 - 1700, NAB.2.16

Objectives: Understand two important unsupervised learning methods: Principal Component Analysis and Clustering. Understand the difference between k-means clustering and Hierarchical Clustering.

Requirements: Introductory Bootcamp (you can read the slides if you didn’t attend). Although not necessary, familiarity with notions of linear algebra may help: inner product and orthogonal projections. Strong understanding of the summation operator (􏰀ni=1, 􏰀j∈C ) may help.

Keywords: PCA, Loading vector, Principal Components, Propor- tion of variance explained, Clustering, k-meanS Clustering, Within- cluster Variation, Hierarchical Clustering, Minimal Intercluster Dissim- ilarity, Dendrogram.

Implementation

We will use PCA and Clustering Techniques to segment an unlabelled customer dataset into meaningful sub-groups.

Bootcamp 5: Introduction to Feedforward Neural Networks - 28 November 2018, 1500 - 1700, NAB.2.14

Objectives: Understand the structure of feedforward neural net- works. Understand how feedworward neural networks are trained using backpropagation.

Requirements: Introductory Bootcamp (you can read the slides if you didn’t attend). Although not necessary, familiarity with partial derivatives, the chain rule and the gradient of a multivariate function may help.

Keywords: Perceptron, Weights, Biases, Activation function, Sig- moid activation function, Neural Network, Gradient descent, Backprop- agation.

Implementation

We will use the Keras package to implement a simple neural network on the canonical MNIST dataset.

Bootcamp 6: Natural Language Processing - 30 Jan 2019, 1500 - 1700, NAB.1.19

Objectives: Understand how to extract features from text and machine learning can be applied to text data, and how to scrape text data from the web

Requirements: Introductory Bootcamp (you can read the slides if you didn’t attend), and basic Python programming skills

Bootcamp 7: Explainable Machine Learning with LIME - 13 Feb 2019, 1500 - 1700, OLD.1.27

Objectives: Understand how the LIME technique works in terms of intuition, and be able to leverage the LIME package in Python for your own uses.

Requirements: Introductory Bootcamp (you can read the slides if you didn’t attend), and basic Python programming skills

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ML@LSE Bootcamps - Michaelmas + Lent Term 2018

Useful Resources for Maximising your Learning from our Bootcamps

Bootcamp Taster Session & Get to Know the Committee - 26 September 2018, PAR2.03

Bootcamp 1: Introduction to Machine Learning - 10 October 2018, 1600 - 1800, PAR.LG.03

Theory

Bootcamp 2: Tree-Based Models - 17 October 2018, 1500 - 1700, CLM.7.02

Theory

Implementation

Bootcamp 3: Support Vector Machines - 31 October 2018, 1500 - 1700, OLD.4.10

Implementation

Bootcamp 4: Introduction to Unsupervised Learning Methods - 14 November 2018, 1500 - 1700, NAB.2.16

Implementation

Bootcamp 5: Introduction to Feedforward Neural Networks - 28 November 2018, 1500 - 1700, NAB.2.14

Implementation

Bootcamp 6: Natural Language Processing - 30 Jan 2019, 1500 - 1700, NAB.1.19

Bootcamp 7: Explainable Machine Learning with LIME - 13 Feb 2019, 1500 - 1700, OLD.1.27

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
Bootcamp 1		Bootcamp 1
Bootcamp 2		Bootcamp 2
Bootcamp 3		Bootcamp 3
Bootcamp 4		Bootcamp 4
Bootcamp 5		Bootcamp 5
Bootcamp 6		Bootcamp 6
Bootcamp 7		Bootcamp 7
give-it-a-go		give-it-a-go
.gitignore		.gitignore
README.md		README.md

mlatlse/bootcamps-201819

Folders and files

Latest commit

History

Repository files navigation

ML@LSE Bootcamps - Michaelmas + Lent Term 2018

Useful Resources for Maximising your Learning from our Bootcamps

Theory

Theory

Implementation

Implementation

Implementation

Implementation

About

Resources

Stars

Watchers

Forks

Languages