Entity Extraction

An interactive web application that extracts structured entities (like names, addresses, dates, etc.) from unstructured documents.

Features

Document Support: Load and analyze various document types, e.g. Property Appraisal Reports, Credit Reports, Title Deeds, Home Loan Agreements
Flexible Entity Types: Extract multiple types of entities. For example: Names, Addresses, Dates, Personal identifiers, Document identifiers
Interactive UI:
- Real-time entity highlighting
- Dark/Light theme support
- Responsive design
- Sticky navigation for easy document review

Getting Started

Clone this repository
Serve the files using any static web server
Open index.html in a modern browser

Usage

Select a document type from the dropdown or paste your text
Specify the entities you want to extract (one per line)
Click "Analyze" to process the document
Review the extracted entities in the results panel

Technology Stack

Frontend: Vanilla JavaScript with ESM modules
Styling: Bootstrap 5.3 with Bootstrap Icons
Data Processing: Python (via Pyodide Web Worker)
Visualization: D3.js
Markdown Processing: Marked
Syntax Highlighting: highlight.js

Authentication

Requires login through LLM Foundry
Authentication handled via llmfoundry.straive.com

Development

Built with modern web standards:

ES Modules for JavaScript
Async/await for asynchronous operations
Web Workers for computation
Responsive Bootstrap components

Credits

Designed by Gramener

License

This project is licensed under the MIT License. See the LICENSE file for details.