A toy project to put in practice some concepts of Data Science, analysing songs from Vagalume (vagalume.com.br).
I don't know yet how I'll achieve each step of data pipeline, so as soon as I'll discover, I'll create new tasks here
- structure the directories
- configure linter
- create automated tests
- create docs about the software architecture
- write instructions about how to run the project properly
- create connection with database
- create database structure
- get all songs from a single artist
- save those songs in a sqlite database
- Remove stopwords?
- Create wordcloud?
- Get some insights and document them
- ?
- ?