Better normalization cache #738

masylum · 2024-06-17T10:20:17Z

They key seems to be too specific. Specially by using the prop, which
basically makes it redudant to cache tokens that are found in different
props. The goal of that cache seems to be to trade memory for time, but
right now seems to be storing equal computations in different keys which
basically is inefficient. The only thing that the prop is needed for is
the stemmerSkipProperties.

vercel · 2024-06-17T10:20:20Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
orama-docs	🛑 Canceled (Inspect)			Jun 17, 2024 10:23am

They key seems to be too specific. Specially by using the prop, which basically makes it redudant to cache tokens that are found in different props. The goal of that cache seems to be to trade memory for time, but right now seems to be storing equal computations in different keys which basically is inefficient. The only thing that the prop is needed for is the `stemmerSkipProperties`.

micheleriva · 2024-06-28T10:17:31Z

Hi there! Thank you so much for your PR. Were you able to run the tests locally?

masylum · 2024-07-24T20:36:21Z

hey, not really! I also found another positive side-effect of simplifying the normalization cache. Right now, these are always cache misses, which means that using the highlight plugin is pretty slow: https://github.com/askorama/orama/blob/7512ca936ffa1543a0e267ae9e9b3d4187be0bdd/packages/plugin-match-highlight/src/index.ts#L74

masylum · 2024-07-24T20:40:05Z

I can't run tests locally, I get this when doing npm install

npm ERR! Unsupported URL Type "workspace:": workspace:*

masylum · 2024-07-24T21:02:01Z

so it looks like the regression was introduced here: https://github.com/askorama/orama/pull/350/files

I'm currently working it around doing my own tokenizer, but it's not ideal

micheleriva · 2024-07-28T19:32:04Z

I can't run tests locally, I get this when doing npm install

You should use pnpm! pnpm install works just fine :)

masylum force-pushed the normalization-cache branch from 6303b59 to 4ded31a Compare June 17, 2024 10:21

vercel bot had a problem deploying to Preview June 17, 2024 10:23 Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better normalization cache #738

Better normalization cache #738

masylum commented Jun 17, 2024

vercel bot commented Jun 17, 2024 •

edited

Loading

micheleriva commented Jun 28, 2024

masylum commented Jul 24, 2024

masylum commented Jul 24, 2024 •

edited

Loading

masylum commented Jul 24, 2024

micheleriva commented Jul 28, 2024

Better normalization cache #738

Are you sure you want to change the base?

Better normalization cache #738

Conversation

masylum commented Jun 17, 2024

vercel bot commented Jun 17, 2024 • edited Loading

micheleriva commented Jun 28, 2024

masylum commented Jul 24, 2024

masylum commented Jul 24, 2024 • edited Loading

masylum commented Jul 24, 2024

micheleriva commented Jul 28, 2024

vercel bot commented Jun 17, 2024 •

edited

Loading

masylum commented Jul 24, 2024 •

edited

Loading