Skip to content

Distributed system based on Apache Kafka to run OCR on images and extract images and texts from PDF files.

License

Notifications You must be signed in to change notification settings

diging/giles-eco-giles-web

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Giles Ecosystem

The Giles Ecosystem is a distributed system to run OCR on images and extract images and texts from PDF files. This repository contains the user-facing component of this system called "Giles". The system requires the following software:

The core components of the Giles Ecosystem are located in the following repositories:

The above applications have dependencies to libraries located in the following repositories:

Additionally, Giles depends on:

There are some additional components of the Giles Ecosystem that can be added if required:

There is a Docker Compose file for testing and evaluation purposes that sets up the Giles Ecosystem in Docker. You can find that file here: https://github.com/diging/giles-eco-docker

You can detailed installation information and the documentation of Giles' API here.