This repository contains Jupyter notebooks of various topics, focusing on data analysis.
A descriptive analysis on the distribution of the use of light and heavy verbs in children, data obtained from the CHILDES Corpus.
A webpage scrapping exercise, using UCSD's Linguistics Department course offering webpage as a data set to look into the distribution of course offerings over the past four years.
An experiment based on a simulated dataset, focusing on experimental design and statistical methods; in addition to a section on how misleading results can be obtained.
(Final Project for LIGN 251: Statistical Methods in Linguistics)