Add support for lazy word generation #92

tgross35 · 2023-11-25T02:28:51Z

No description provided.

codecov · 2023-11-25T02:31:16Z

Codecov Report

Attention: 14 lines in your changes are missing coverage. Please review.

Comparison is base (0e1f7c0) 77.19% compared to head (fc3fa0e) 77.04%.

Files	Patch %	Lines
zspell/src/dict/rule.rs	57.14%	12 Missing ⚠️
zspell/src/dict/tests_rule.rs	94.44%	2 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #92      +/-   ##
==========================================
- Coverage   77.19%   77.04%   -0.16%     
==========================================
  Files          27       27              
  Lines        3464     3502      +38     
==========================================
+ Hits         2674     2698      +24     
- Misses        790      804      +14

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

tgross35 · 2023-11-25T03:05:55Z

I need to take a look at how hunspell does this - we need a way to take the rules and apply them to only entries in the hashmap that were from the dictionary.

Maybe we should iterate hashmap for entries that have Dictionary as source. Then we could change Source::Dict to also hold the Vec<Arc<AfxRule>> of the rules attached to that entry. Then we find these entries and test if our given word matches.

tgross35 · 2023-11-25T03:28:53Z

Hm. Maybe it would be more efficient if instead we stored a vector or hashset of all AffixRules. Then we iterate those rules and apply them, and see if the word exists in the

We might even be able to optimize this if we store it in a vector, dedup it. Then if we sort it by something useful, we could use a binary search to locate applicable rules.

As part of this, it could make sense to reverse AfxRulePattern and AfxRule so patterns point to rules rather than the other way around. Or a bidirectional reference where AfxRulePattern holds Arc<AfxRule> and AfxRule holds a Vec<Weak<AfxRulePattern>>. Or just store the needed data inline, AfxRule doesn't store much aside from the patterns.

Add affix stripping for rules

e2d3211

tgross35 force-pushed the lazy-wordgen branch from 53ea69f to 63b51c8 Compare November 25, 2023 02:30

tgross35 force-pushed the lazy-wordgen branch from 63b51c8 to b981d1b Compare November 25, 2023 02:31

tgross35 force-pushed the lazy-wordgen branch from b981d1b to 51aabc3 Compare November 25, 2023 03:33

Add 'strip_patterns' function

fc3fa0e

tgross35 force-pushed the lazy-wordgen branch from 51aabc3 to fc3fa0e Compare November 25, 2023 03:33

tgross35 mentioned this pull request Jun 13, 2024

Multiple suffix stripping does not work #116

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for lazy word generation #92

Add support for lazy word generation #92

tgross35 commented Nov 25, 2023

codecov bot commented Nov 25, 2023 •

edited

Loading

tgross35 commented Nov 25, 2023

tgross35 commented Nov 25, 2023

Add support for lazy word generation #92

Are you sure you want to change the base?

Add support for lazy word generation #92

Conversation

tgross35 commented Nov 25, 2023

codecov bot commented Nov 25, 2023 • edited Loading

Codecov Report

tgross35 commented Nov 25, 2023

tgross35 commented Nov 25, 2023

codecov bot commented Nov 25, 2023 •

edited

Loading