Feature request: print html tag of the node (not including its children) for lexbor engine #64

gshashank84 · 2022-08-18T05:24:24Z

To check if two nodes are equal, we try to compare the html tag of them. But here the tag of its children also comes in the output. Please add an method that print html string of single node only (i.e. not including its parents).

gshashank84 · 2022-08-18T13:21:16Z

Also can we make the __eq__ method of LexborNode performant? As we are internally comparing html of the nodes for the equality operator, it takes huge computation time for comparing two nodes of a big tree DOM.

rushter · 2022-08-19T09:55:55Z

Also can we make the __eq__ method of LexborNode performant? As we are internally comparing html of the nodes for the equality operator, it takes huge computation time for comparing two nodes of a big tree DOM.

Yeah, this is an old problem, but there is no easy way to solve it, since lexbor/modest engines do not have internal IDs or something like that to simply compare them. We could compare their location in memory, but we still need to perform a big check in case of a miss. It's also possible to get a race condition when using memory address as the main way to check it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: print html tag of the node (not including its children) for lexbor engine #64

Feature request: print html tag of the node (not including its children) for lexbor engine #64

gshashank84 commented Aug 18, 2022

gshashank84 commented Aug 18, 2022 •

edited

Loading

rushter commented Aug 19, 2022 •

edited

Loading

Feature request: print html tag of the node (not including its children) for lexbor engine #64

Feature request: print html tag of the node (not including its children) for lexbor engine #64

Comments

gshashank84 commented Aug 18, 2022

gshashank84 commented Aug 18, 2022 • edited Loading

rushter commented Aug 19, 2022 • edited Loading

gshashank84 commented Aug 18, 2022 •

edited

Loading

rushter commented Aug 19, 2022 •

edited

Loading