title | authors | |
---|---|---|
Open-Source Search Engine with Apache Lucene / Solr |
|
Provides integrated research tools for easier searching, monitoring, analytics, discovery & text mining (of heterogenous & large document sets & news) with free software on your own server.
Easy full text search across multiple data sources and many different file formats. Just enter a search query (which can include powerful search operators) and navigate through the results.
Based on a thesaurus the multilingual semantic search engine will find synonyms, hyponyms and aliases, too. Using heuristics for grammar rules like stemming it can find other word forms, too.
Easy navigation through many results with interactive filters (faceted search) which aggregate an overview over and interactive filters for (meta) data like authors, organizations, persons, places, dates, products, tags or document types.
Explore your data or search results with an overview of aggregated search results by different facets with named entities (i.e. file paths, tags, persons, locations, organisations or products), while browsing with comfortable navigation through search results or document sets.
View previews (i.e. PDF, extracted Text, Table rows or Images).
Analyze or review document sets by preview, extracted text or wordlists for textmining.
Tag your documents with keywords, categories, names or text notes that are not included in the original content to find them better later (document management & knowledge management) or in other research or search contexts or to be able to filter annotated or tagged documents by interactive filters (faceted search).
Or evaluate, value or assess or filter documents (i.e. for validation or collaborative filtering).
Visualizing data such as:
- document dates as trend charts
- text analysis as word clouds
- connections and networks in visual graph view
- view results with geodata as interactive maps.
Stay informed via watchlists for:
- news alerts from media monitoring
- activity streams of new or changed documents on file shares
You can subscribe to searches and filters as RSS-Newsfeed and get notifications when there are changed or new documents, news or search results for your keywords, search context or filter.
Open Semantic Search can help you index and search your data whether you are working with:
- structured data like databases, tables or spreadsheets
- unstructured data like text documents
- E-Mails
- even scanned legacy documents
- text files
- Microsoft Office, OpenOffice, and LibreOffice docuemnts including Excel and Calc
- CSV
- Images (photos, pictures, JPG, TIFF)
- Videos
And that isn't all, see a full list of supported file formats.
You can find all your data in one place. Search many different data sources like files and folders, file server, file shares, databases, websites, Content Management Systems, RSS-Feeds and more.
The Connectors and Importers of the Extract Transform Load (ETL) framework for Data Integration connect and combine multiple data sources and, as an integrated document analysis and data enrichment framework, it enhances the data with the analysis results of diverse analytics tools.
Optical character recognition (OCR) or automatic text recognition for images and text content stored in graphical format like scanned legacy documents, screenshots or photographed documents in the form of image files or embedded in PDF files.
Open-Source enterprise search and information retrieval technology based on interoperable open standards
Open Semantic Search can be used with every desktop (Linux, Windows or Mac) and web browser. With its responsive design and open standards like HTML5 it is possible to search with tablets, smartphones and other mobile devices as well.
Structure your research, investigation, navigation, document sets, collections, metadata forms or notes in a Semantic Wiki, Drupal or another content management system (CMS) or with an innovative annotation framework with taxonomies and custom fields for tagging documents, annotations, linking relationships, mapping and structured notes. You can integrate powerful and flexible metadata management or annotation tools using interoperable open standards like the Resource Description Framework (RDF) and the Simple Knowledge Organization System (SKOS).
Using file monitoring, new or changed files are indexed within seconds without requiring frequent recrawls (which is not possible often if there are many files). Colleagues are able to find new data immediately without (often forgotten) uploads to a data or document management system (DMS) or filling out a data registration form for each new or changed document or dataset in a data management system, data registry or digital asset management (DAM) system.