New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merriam Webster Backend, Antonyms, and Result Caching #48

Open

ahayman wants to merge 42 commits into Ron89:testing from ahayman:master

ahayman commented Oct 2, 2020

Added the Merriam-Webster Backend
Added support for antonym queries (currently only supported by Merriam-Webster Backend)
Added Caching Logic

Note: I recommend squashing the commits. I was playing around a lot with different devices, and so there's a ton of "work in progress" commits.

Aaron Hayman added 30 commits

September 16, 2020 11:08


          Added DictionaryAPI (Merriam-Webster) Lookup

90060f5


          Added back query_result_trunc

7e173f7


          Fixed https error in DictionaryAPI

f12da71


          Tentative output format change

e44e54d


          Tentative output format change

680c834


          Removed Test Files

f66545c


          Fixed Dictionary API Parser errors on missing values

23ba143


          Fixed word guessing for unknown words in Dictionary API

c34f47c


          Fixed - Dictionary API parser returned wrong code for unknown word

6794ee3


          Fixed - Dictionary API parser did not return full correction list

b379849


          Fixed - Dictionary API parser returned wrong code for unknown word

849ec76


          Updated Readme with Back end info

3bd32ad


          Added by_def format to dictionary api backend

cf8d392


          dictionary api - removed extra bracked (list) for by_def format

2756b14


          dictionary api - default return for to break up synonyms by def

7fbeb30


          Updated dictionary api parser to handle multiple result sets

3657aef


          Updated dictionary api parser to handle multiple result sets

d95f8f6


          Updated Core code to handle antonyms using a separate list

5e3c2fd


          Updated Core code to handle antonyms using a separate list

fdd1599


          Updated Core code to handle antonyms using a separate list

fda15bf


          Attempt to fix antonyms not showing

064f0e7


          Attempt to fix antonyms not showing


          Attempt to fix antonyms not showing

1f03c87


          Attempt to fix antonyms not showing

d7d3433


          Attempt to fix antonyms not showing

247b7e5


          Attempt to fix antonyms not showing

ccb538a


          Attempt to fix antonyms not showing

7668f06


          Attempt to fix antonyms not showing

ba8df57


          Attempt to fix missing query_type variable

c9c45df


          Attempt to fix missing query_type variable

23f54b5

Aaron Hayman added 11 commits

September 24, 2020 10:34


          Attempt to fix missing query_type variable

68daff5


          Attempt to fix missing query_type variable

207ce40


          Updated Readme to reflect antonyms

42cdc99


          Edits

eae6023


          Merge branch 'master' of https://github.com/ahayman/thesaurus_query.vim

bb75af9


          Caching, Rename, cleanup

8582be4

 - Caching logic is finished
 - Renamed dictionary_api backend to merriam_webster for clarity
 - Cleaned up docs and code


          Fixed exceptions in case of missing or incorrect api key for merriam-…

7cc2f0a

…webster


          Fixes for newer version of Python/Vim

d25162d


          Revert "Fixed exceptions in case of missing or incorrect api key for …

…merriam-webster"

This reverts commit 7cc2f0a.


          Fixed iOS need for explicit SSL (not required for later version of Py…

6a31f25

…thon)


          forgot to add SSL import

c47f860

Ron89 requested changes

View reviewed changes

Owner

Ron89 left a comment •

edited

Loading

Hey, sorry for the extremely long delay. A lot in my life were evolving in the past year. so I neglected these repos that I should have been maintaining on GitHub. And look what I missed out here...

Thank you for this large and helpful PR. It is very nicely written and meaningfully broaden the framework. I would very much like to merge it into my repo. I did noticed some logic that may affect other features, so I made some suggestions. And I do believe they are necessary before I can pull these changes in. Could you help patching them up? Or if you think differently regarding those suggestions, please also comment down, we can discuss more.

autoload/thesaurus_query/thesaurus_query.py

+                      synonym_list=[]
+                      # Check cache first
+                      if not self.cached_used and cache_results > -1:

Owner

Ron89 Jul 8, 2021

I have a question, if cache is used unconditionally... how does user request the thesaurus engine to query from a different backend (next backend in line, for example)? Wouldn't the logic stuck and always return from this cache?

Owner

Ron89 Jul 8, 2021

I am thinking of using a flag firstQuery that gets set to true in session_init(). And cache is only used when firstQuery is true. And right before we check the query, we set the self.firstQuery to false. So the next time query is triggered for any reason, the cache won't be touched. What do you think?

autoload/thesaurus_query/thesaurus_query.py

		for query_backend_curr in to_use_list: # query each of the backend list till found
		specified_language = get_variable("tq_language", ['en'])

Owner

Ron89 Jul 8, 2021

hmm, originally, I get the specified_language here because I would like to accomodate the case where the user change preferred language in different queries.
If specified language is obtained only when the plugin is initiated, then it won't reflect the later customizations made by the user until program restart.

autoload/thesaurus_query/thesaurus_query.py

-                          [state, synonym_list]=self.query_backends[query_backend_curr].query(word)
+                          query_result = query_backend.query(word)
+                          if (len(query_result) >= 3):
+                              [state, synonym_list, antonym_list] = query_result

Owner

Ron89 Jul 8, 2021

I understand that in most of the cases, so long as the word exists, it should have both synonym and antonym. But could some word list based engine contain some word's antonym but not synonym and vise versa? Do you think it's more prudent to add following logic:

if state==0:
    if query_type==0 and no synonym_list:
        state=1
    if query_type==1 and no antonym_list:
        state=1

autoload/thesaurus_query/thesaurus_query.py

+                          else:
+                              [state, synonym_list] = query_result
+                              antonym_list = []
+                              if query_type == 1:

Owner

Ron89 Jul 8, 2021

this condition should be contained within a larger clause:

if state!=-1:

Or else it will fail to mark non-functional backend as bad backend, as it will fake a normal but empty state.

autoload/thesaurus_query/thesaurus_query.py

                               continue
                           if state == 0:
+                              # Update caches
+                              update_cache(self.antonym_cache, word, antonym_list, query_backend.identifier)

Owner

Ron89 Jul 8, 2021

since our state determination is based on the type of related word we try to search. Shouldn't our cache saving also based on our current query? Or else if the current search is on synonym from a backend that does not support antonym, then it will generate an empty cache for the antonym here.
Let's do

if state==0:
    if query_type==0:
        update_cache(self.antonym_cache, word, antonym_list, query_backend.identifier)
    else:
        update_cache(self.synonym_cache, word, synonym_list, query_backend.identifier)

autoload/thesaurus_query/thesaurus_query.py

                       else:
-                          self.last_valid_result=synonym_list
+                          self.last_valid_synonyms=synonym_list

Owner

Ron89 Jul 8, 2021

let's only fill in one of the list with valid result at a time.

if query_type==0:
    self.last_valid_synonyms=synonym_list
else
    self.last_valid_antonyms=antonym_list


          Removed SSL in urlopen, which was causing issues

c39f4aa

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet