An interactive web application that extracts structured entities (like names, addresses, dates, etc.) from unstructured documents.
- Document Support: Load and analyze various document types, e.g. Property Appraisal Reports, Credit Reports, Title Deeds, Home Loan Agreements
- Flexible Entity Types: Extract multiple types of entities. For example: Names, Addresses, Dates, Personal identifiers, Document identifiers
- Interactive UI:
- Real-time entity highlighting
- Dark/Light theme support
- Responsive design
- Sticky navigation for easy document review
- Clone this repository
- Serve the files using any static web server
- Open
index.html
in a modern browser
- Select a document type from the dropdown or paste your text
- Specify the entities you want to extract (one per line)
- Click "Analyze" to process the document
- Review the extracted entities in the results panel
- Frontend: Vanilla JavaScript with ESM modules
- Styling: Bootstrap 5.3 with Bootstrap Icons
- Data Processing: Python (via Pyodide Web Worker)
- Visualization: D3.js
- Markdown Processing: Marked
- Syntax Highlighting: highlight.js
- Requires login through LLM Foundry
- Authentication handled via
llmfoundry.straive.com
Built with modern web standards:
- ES Modules for JavaScript
- Async/await for asynchronous operations
- Web Workers for computation
- Responsive Bootstrap components
Designed by Gramener
This project is licensed under the MIT License. See the LICENSE file for details.