Welcome to the repository of the NLP course for Digital Methods in the Humanities
This course offers an introduction to Natural Language Processing (NLP) and its application in digital humanities. The course is part of the Master's program "Digital Methods for Humanities and Cultural Studies (DMGK)" in Mainz.
Course Website: https://ieg-dhr.github.io/NLP-Course4Humanities_2024/
The course covers the following topics:
- Introduction to NLP, Jupyter Notebooks, and Python
- Using SpaCy, SKLEARN and NLTK for NLP tasks
- German Newspaper Portal and its API
- Transformer models for semantic search and text similarity (Word Embeddings)
- Large Language Models (LLMs) for Semantic Text Extraction (Article Extraction) and Post-OCR Correction
- Named Entity Recognition (NER) and Text Classification
index.html
: Main page of the coursestyles.css
: CSS stylesheet for the course websitedatasets/
: Folder for course materials and resources