Linguistics student at UFMG | Python & AI enthusiast | Building tools for corpus linguistics, NLP, and machine learning | Death Metal vocalist.
-
Universidade Federal de Minas Gerais - UFMG
- Brazil
- jhonatanhlopes
Highlights
- Pro
Popular repositories Loading
-
CorpusAid
CorpusAid PublicAutomated text preprocessing pipeline for large corpora. Features customizable filters for diacritics, stop words, punctuation, and regex.
Python
-
CorpusAid-PDF
CorpusAid-PDF PublicA robust and efficient PDF text extractor with intelligent column detection and layout preservation, supporting multiple output formats (TXT, HTML, Markdown, DOCX). Ideal for accurately extracting …
Python
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.