TypeError: Wrong number or type of arguments for overloaded function 'Rule_extract' #27

impredicative · 2023-11-26T01:26:23Z

With Python 3.12, hext.Rule('').extract('') gives the error:

  File "python3.12/site-packages/hext/__init__.py", line 139, in extract
    return _hext.Rule_extract(self, html, max_searches)
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
TypeError: Wrong number or type of arguments for overloaded function 'Rule_extract'.
  Possible C/C++ prototypes are:
    Rule::extract(Html const &,std::uint64_t) const
    Rule::extract(Html const &) const

I am of course also getting this error with a more real-life example. At this time I cannot use hext for anything new.

The text was updated successfully, but these errors were encountered:

thomastrapp · 2023-11-26T10:33:25Z

Rule.extract does not accept a string, only hext.Html.

import hext
rule = hext.Rule("<a href:link/>")
# (1) Ok, the argument for extract is of type hext.Html
results = rule.extract(hext.Html("""<a href="b"></a>"""))
# (2) Error, the argument for extract is of type string:
results = rule.extract("""<a href="b"></a>""")

If this was possible in a previous version of Hext (≥1.0.0), please let me know, as this would be a breaking change in the API.

The error message is unfortunately very unhelpful, and I will fix that in a future release with #28.

Thank you for creating this issue.

brandonrobertz · 2023-11-26T17:14:34Z

If this was possible in a previous version of Hext (≥1.0.0), please let me know, as this would be a breaking change in the API.

This was not possible in 0.8 (just re-tested to be sure). AFAIK you always needed to pass a Html object.

impredicative · 2023-11-26T17:27:06Z

Yes, it had been a while since I used hext, and I misremembered. Indeed hext.Rule('').extract(hext.Html('')) is what works.

As an aside, I think there really needs to exist at least one comprehensive page (or tabs) per supported programming language in the documentation. It would contain various necessary examples to train the user to use hext effectively.

impredicative · 2023-11-26T17:28:51Z

As an example, please see the organization and tabs here (one tab per supported language).

thomastrapp · 2023-11-27T08:26:00Z

As an aside, I think there really needs to exist at least one comprehensive page (or tabs) per supported programming language in the documentation. It would contain various necessary examples to train the user to use hext effectively.

I agree and have added another issue for this: html-extract/html-extract.github.io#4.

thomastrapp self-assigned this Nov 26, 2023

thomastrapp added the python label Nov 26, 2023

impredicative closed this as completed Nov 26, 2023

thomastrapp mentioned this issue Nov 27, 2023

Website: Improve documentation for language bindings html-extract/html-extract.github.io#4

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TypeError: Wrong number or type of arguments for overloaded function 'Rule_extract' #27

TypeError: Wrong number or type of arguments for overloaded function 'Rule_extract' #27

impredicative commented Nov 26, 2023

thomastrapp commented Nov 26, 2023

Uh oh!

brandonrobertz commented Nov 26, 2023

Uh oh!

impredicative commented Nov 26, 2023 •

edited

Loading

Uh oh!

impredicative commented Nov 26, 2023 •

edited

Loading

Uh oh!

thomastrapp commented Nov 27, 2023

Uh oh!

TypeError: Wrong number or type of arguments for overloaded function 'Rule_extract' #27

TypeError: Wrong number or type of arguments for overloaded function 'Rule_extract' #27

Comments

impredicative commented Nov 26, 2023

thomastrapp commented Nov 26, 2023

Uh oh!

brandonrobertz commented Nov 26, 2023

Uh oh!

impredicative commented Nov 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

impredicative commented Nov 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

thomastrapp commented Nov 27, 2023

Uh oh!

impredicative commented Nov 26, 2023 •

edited

Loading

impredicative commented Nov 26, 2023 •

edited

Loading