Docr 🚀

1. Overview 🌟

🛠️ Component design with module-based functionality, allowing for on-demand feature acquisition, 🚀 easy to expand, and flexible to use, just like playing with building blocks!

Docr is a modular component-based toolkit for document analysis and processing. It's designed with flexibility and extensibility in mind, making it easy to expand and use various document processing functionalities as needed.

2. Features 🛠️

📄 Layout Analysis
🔢 Formula Detection and Recognition
📝 Optical Character Recognition (OCR)
📊 Table Structure Recognition
📚 Reading Order Analysis
🖼️ Image Processing Utilities

3. Installation and Usage 📦

3.1 Prerequisites

Python 3.10 or higher
Poetry (for dependency management)

3.2 Setup

Clone the repository:

git clone https://github.com/yjmm10/docr.git
cd docr
git clone https://huggingface.co/liferecords/Telos.git docr/models

Install dependencies:
```
poetry install -v
```

3.3 Usage

Here's a quick example of how to use Docr for OCR:

from docr import OCR
import cv2

# Initialize the OCR model
ocr_model = OCR()

# Read an image
image = cv2.imread("path/to/your/image.png")

# Perform OCR
result = ocr_model(image)

print(result)

Docr comes with a Streamlit-based web UI for easy demonstration of its capabilities:

Run the demo:
```
streamlit run webui/demo.py
```
Open your browser and navigate to the provided URL (usually http://localhost:8501)
Upload an image and select the model you want to use for processing

Docr also provides a FastAPI-based API service for integration into other applications:

Start the API server:

uvicorn api.docr_api:app --host 0.0.0.0 --port 8000

The API documentation will be available at http://localhost:8000/docs

4. Development 🔬

For detailed information on development, please refer to the development guide. This guide will help you set up your IDE for working with Docr, including SRC Layout configuration.

5. Contributing 🤝

We welcome contributions! Please see our Contributing Guidelines for more details.

6. License 📄

Docr is released under the MIT License. See the LICENSE file for more details.

7. Contact 📧

For any questions or feedback, please contact the project maintainer: liferecords [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 71 Commits
.github/workflows		.github/workflows
api		api
docr		docr
docs		docs
tests		tests
webui		webui
.dockerignore		.dockerignore
.editorconfig		.editorconfig
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
README_zh.md		README_zh.md
Telos_test.py		Telos_test.py
demo.py		demo.py
mkdocs.yml		mkdocs.yml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
tox.ini		tox.ini

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Docr 🚀

1. Overview 🌟

2. Features 🛠️

3. Installation and Usage 📦

3.1 Prerequisites

3.2 Setup

3.3 Usage

4. Development 🔬

5. Contributing 🤝

6. License 📄

7. Contact 📧

About

Releases 2

Packages

Languages

License

yjmm10/Docr

Folders and files

Latest commit

History

Repository files navigation

Docr 🚀

1. Overview 🌟

2. Features 🛠️

3. Installation and Usage 📦

3.1 Prerequisites

3.2 Setup

3.3 Usage

4. Development 🔬

5. Contributing 🤝

6. License 📄

7. Contact 📧

About

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Languages

Packages