You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We have some issues with the free-text entries of lookup_df$synonym. For example, some drug names have " " at the end (e.g. 'blanca ", "monos ", "nieve "). I have noticed that most of these are Spanish words.
There are symbols in some of the drug names, like "c & m", "m-cat", or "el perico ("parrot")". I think parse() removes these symbols, which means that these drug names will never be matched if we called parse() first (which is bad, because this is our recommended workflow).
The string "mixed with" shows up 25 times. Can this formula of "drug a (mixed with drug b)" be re-expressed?
The word "and" is a stop word, but " and " shows up in 40 times. We can't match to these drug synonyms either.
There are 20 synonyms that include one or more ".", for example "l.a. ice" or "m.j.". We remove all periods in parse(), and lookup("m j") returns no matches.
lookup_df$synonym
. For example, some drug names have" "
at the end (e.g.'blanca "
,"monos "
,"nieve "
). I have noticed that most of these are Spanish words."c & m"
,"m-cat"
, or"el perico ("parrot")"
. I thinkparse()
removes these symbols, which means that these drug names will never be matched if we calledparse()
first (which is bad, because this is our recommended workflow)." and "
shows up in 40 times. We can't match to these drug synonyms either."l.a. ice"
or"m.j."
. We remove all periods inparse()
, andlookup("m j")
returns no matches.@RaymondBalise, @labouz, what do you recommend we do here?
The text was updated successfully, but these errors were encountered: