Skip to content

Workshop (6 hours): Clustering (Hdbscan, LCA, Hopach), dimension reduction (UMAP, GLRM), and anomaly detection (isolation forests).

License

Notifications You must be signed in to change notification settings

dlab-berkeley/Unsupervised-Learning-in-R

Repository files navigation

Unsupervised Learning in R

Unsupervised machine learning is a class of algorithms that identifies patterns in unlabeled data, i.e. without considering an outcome or target. This workshop will describe and demonstrate powerful unsupervised learning algorithms used for clustering (hdbscan, latent class analysis, hopach), dimensionality reduction (umap, generalized low-rank models), and anomaly detection (isolation forests). Participants will learn how to structure unsupervised learning analyses and will gain familiarity with example code that can be adapted to their own projects.

Author: Chris Kennedy

Prerequisites

This is an intermediate machine learning workshop. Participants should have significant prior experience with R and RStudio, including manipulation of data frames, installation of packages, and plotting.

Prerequisite workshops

Recommended workshops

Technology requirements

Participants should have access to a computer with the following software:

Initial steps for participants

To prepare for the workshop, please download the materials and work through the package installation in 0-install.Rmd. Please report any errors to the GitHub issue queue.

There is also an RStudio Cloud workspace that can be used.

Reporting errors or giving feedback

Please create a GitHub issue to report any errors or give feedback on this workshop.

Resources

Books

Releases

No releases published

Packages

No packages published

Languages