Which mode/method to use that is agnostic to word orders? #51

skwskwskwskw · 2022-11-16T09:21:38Z

Hi,

Would like to understand which matching algo/model is agnostic to word orders? I realised for instance Levenshtein Distance might be affected by word orders.

Thanks

MaartenGr · 2022-11-17T07:19:04Z

You can use TF-IDF for that since it typically only considers n-grams on a token level. Due to its bag-of-words like approach, it does not take the order of n-grams into account and therefore also not the order of words.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Which mode/method to use that is agnostic to word orders? #51

Which mode/method to use that is agnostic to word orders? #51

skwskwskwskw commented Nov 16, 2022

MaartenGr commented Nov 17, 2022

Which mode/method to use that is agnostic to word orders? #51

Which mode/method to use that is agnostic to word orders? #51

Comments

skwskwskwskw commented Nov 16, 2022

MaartenGr commented Nov 17, 2022