Skip to content

Literature

Max Marrone edited this page Nov 2, 2023 · 8 revisions

Full text search

https://swtch.com/~rsc/regexp/regexp4.html

https://github.com/cmu-sei/BigGrep

https://www.postgresql.org/docs/current/pgtrgm.html#PGTRGM-INDEX

https://github.com/bvaughn/js-search

https://www.npmjs.com/package/trigram-search (but this looks suspicious)

https://www.cockroachlabs.com/docs/stable/trigram-indexes

https://www.cockroachlabs.com/blog/use-cases-trigram-indexes/ briefly discusses shortcomings with precision

https://alexklibisz.com/2022/02/18/optimizing-postgres-trigram-search GiN and GiST indices?

https://pypi.org/project/setix/

https://en.wikipedia.org/wiki/MinHash

https://en.wikipedia.org/wiki/K-way_merge_algorithm

https://github.blog/2023-02-06-the-technology-behind-githubs-new-code-search/

https://github.blog/2021-12-15-a-brief-history-of-code-search-at-github/

https://www.cockroachlabs.com/docs/stable/trigram-indexes

https://en.wikipedia.org/wiki/Compressed_suffix_array

https://github.com/bvaughn/js-search

https://en.wikipedia.org/wiki/Compressed_suffix_array

https://news.ycombinator.com/item?id=28237556 and https://about.sourcegraph.com/blog/zoekt-memory-optimizations-for-sourcegraph-cloud

https://www.sciencedirect.com/science/article/abs/pii/S0020025513003472

https://www.researchgate.net/profile/Derrick-Coetzee/publication/221615072_Tinylex_Static_n-gram_index_pruning_with_perfect_recall/links/0912f50637b4b56d5c000000/Tinylex-Static-n-gram-index-pruning-with-perfect-recall.pdf

Memory management

Optimal Resizable Arrays, Tarjan and Zwick

Resizable Arrays in Optimal Time and Space, Brodnik et. al.

Clone this wiki locally