- Implemented a Dynamic-Programming-based Word-Break Tokenizer for English and Janpanese.
- Implement a LSM disk-based inverted index with tiering merging policy and positional information which support insertion, keyword search, boolean AND search, boolean OR search, Phrase Search and further enhance the performance with data compression using Gamma encoding.
- Use Term Frequency - Inverse Document Frequency(TF-IDF) and page rank algorithm enable the search function for all UCI ICS webpages.
-
Notifications
You must be signed in to change notification settings - Fork 0
A search engine based on LSM tree architecture.
License
FuyaoLi2017/LSM-Tree-Based-Search-Engine
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
A search engine based on LSM tree architecture.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published