Convert Python objects to C objects for further speedups #1

sylvinus · 2016-02-07T15:36:45Z

It is already done for nesting_limit. Each of the options should be transformed into C objects during init so that _traverse_node() uses as few Python objects as possible.

The gumbocy.html file generated after make cythonize is useful for seeing which lines use Python objects.

What would be the most efficient C type for lookups, to replace the Python sets like attributes_whitelist?

The text was updated successfully, but these errors were encountered:

…ncluding the .cpp file

sylvinus · 2016-07-06T16:11:05Z

Most of the options have been converted to C variables.

There are probably some more optimizations left in the parsing of CSS class names (split in C instead of using Python's re?), but we should do more profiling first to see where the real bottlenecks are.

From my tests, >80% of the time is usually spent in gumbo.parse, not sure what we can do about it but look upstream for the largest speedups.

sylvinus · 2016-07-06T18:58:27Z

This one is also a good candidate for micro-optimization: 8e864c8#diff-51db9a1af8644d65b7f79981d2b0a7c2R62

sylvinus · 2016-07-14T20:10:07Z

A huge general speedup was gained thanks to #8, but it also re-introduced a lot of Python objects in the code.

There are a bunch of places where we go through Python strings for instance just to lowercase them. The attribute values are also stored as a Python dict, but a C++ map would probably be much faster (mostly because it would keep all its values as char*).

sylvinus added enhancement help wanted labels Feb 22, 2016

sylvinus added a commit that referenced this issue Jul 6, 2016

Speed optimizations (#1), first naive benchmarks (#3) and fix #6 by i…

9d640d6

…ncluding the .cpp file

sylvinus changed the title ~~Convert options to C objects for further speedups~~ Convert Python objects to C objects for further speedups Jul 14, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert Python objects to C objects for further speedups #1

Convert Python objects to C objects for further speedups #1

sylvinus commented Feb 7, 2016

sylvinus commented Jul 6, 2016 •

edited

Loading

sylvinus commented Jul 6, 2016

sylvinus commented Jul 14, 2016

Convert Python objects to C objects for further speedups #1

Convert Python objects to C objects for further speedups #1

Comments

sylvinus commented Feb 7, 2016

sylvinus commented Jul 6, 2016 • edited Loading

sylvinus commented Jul 6, 2016

sylvinus commented Jul 14, 2016

sylvinus commented Jul 6, 2016 •

edited

Loading