You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Another issue:
If I call lookup(m-CAT) (for mephedrone), I get
original_word class category synonym
1 m-cat stimulant synthetic cathinone m-cat
However, when we put this word in a sentence:
> lookup(parse("try some m-CAT my friend"))
original_word class category synonym
1 friend narcotic (opioid) fentanyl friend
The parse() function removes m-CAT because of the hyphen. Additionally, even if parse() just removed the hyphen and left mCAT or mcat, lookup(mcat) returns NAs. In fact, there are 129 slang terms which contain - in their strings. Our current framework for parse() would fail to match these drugs. Code to find the 129 drugs mentioned:
Consider the following example:
The regular expression fails to remove these quotation marks before matching against stop words. Thus, "a" is still included in the output of
parse()
.The text was updated successfully, but these errors were encountered: