Bugfix and maintenance release:
- Refine heuristic to filter out tall-ish whitespace elements that can throw off text chunking by considering realistic font sizes (thanks @travisbeale !)
- Lots of code cleanups and refactors (thanks @ZaqueuCavalcante, @Milchreis, @GustavAT !)
- Update PDFBox to 2.0.24