Instructor: Quan Nguyen, Ph.D., Teaching Fellow, Master of Data Science
Department of Statistics, University of British Columbia
- GitHub Repository
- Course Jupyter Book
- Zoom link
- Dataset can be downloaded here: https://drive.google.com/file/d/1QzQ6P0ewgx_hgHC-qJ3nCBRAr1139w6N/view?usp=sharing
About the workshop:
Data in learning analytics research (e.g. SIS, clickstream, log-files) are often rich in temporal features that could allow us to explore the dynamic changes in learning behavior at different time granularities (e.g. seconds, days, weeks, semesters). This workshop will introduce participants to several common temporal/sequential analysis methods and techniques using R. During the workshop, we will discuss how temporal analysis can be applied to answer RQs in learning analytics and which learning constructs are relevant to temporal analysis. Next, we will go through techniques to explore and visualize temporal/sequential data. Participants will learn and apply two types of temporal models: a) explanatory models using statistical techniques, such as Sequence Analysis with application to identify common patterns of learning activities from log data and b) association rule mining (apriori) with application to detect courses that are frequently taken together and create association rules for course recommendations.
Target Audience:
This workshop is designed for anyone interested in temporal/sequential analysis. No experience in temporal and sequential analysis is required. To get the best learning experience, participants should familiarize themselves with basic statistics and machine learning concepts (e.g. regression, variance, autocorrelation, classification, cross-validation, overfitting).
Day | Lecture | Lab | Resources |
---|---|---|---|
1 | Exploratory sequence analysis with TraMineR | "labs/release/Lab1.Rmd" | Chap 1-8 |
2 | Sequence clustering using TraMineR | "labs/release/Lab2.Rmd" | Chap 9 |
3 | Association rules and frequent sequence mining | TBD | Introduction to arules, SPADE algorithm |
To make the workshop as engaged as possible, I would encourage you to:
- Turn on your camera during the workshop
- Raise your hand if you have questions throughout the workshop
- Feel free to share your thoughts, reactions, questions in the zoom chat. Use emojis! 😀
- Please participate in breakout rooms activities
The workshop will use R and draw on packages such as TraMineR
, cluster
, arules
, arulesViz
, arulesSequences
, and tidyverse
.
You can install all the dependencies you need for the workshop in Rstudio using:
install.packages(c("TraMineR","cluster","arules","arulesViz","arulesSequences","tidyverse"))
I am a Teaching Fellow at the UBC Master of Data Science where I develop and teach data science courses. Prior to UBC, I was a Postdoctoral Research Fellow in Learning Analytics at the School of Information, University of Michigan. My research focuses on analysing digital traces in educational settings to detect temporal discrepancies between course design and student engagement, and the subsequent effects on academic performance. My second research strand is to model peer interactions and peer influence on academic choices and performance. My work employed a wide range of statistical and computational techniques, such as multilevel modeling, network analysis, time-series/temporal analysis, and machine learning. My work has received multiple best paper awards at established conferences (e.g. LAK18 and HCI International 17).
-
Forecasting Principles and Practice (FPP), Hyndman & Athanasopoulos (2021)
-
Time Series Analysis and its Application: With R Examples (TSAA), Shumway & Stoffer (2017)
-
Knight, S., Friend Wise, A., & Chen, B. (2017). Time for Change: Why Learning Analytics Needs Temporal Analysis. Journal of Learning Analytics, 4(3), 7–17. https://doi.org/10.18608/jla.2017.43.2
-
Chen, B., Knight, S., & Wise, A. F. (2018). Critical Issues in Designing and Implementing Temporal Analytics. Journal of Learning Analytics, 5(1), 1–9. https://doi.org/10.18608/jla.2018.53.1
© 2022 Quan Nguyen
Software licensed under the MIT License, non-software content licensed under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License.