Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix url regex speed #16

Merged
merged 2 commits into from
May 19, 2020
Merged

Fix url regex speed #16

merged 2 commits into from
May 19, 2020

Conversation

GjjvdBurg
Copy link
Collaborator

This is a follow-up on issue #13, but regarding the URL regex. The URL regex can similarly end up in catastrophic backtracking, and the proposed fix by @kaskawu can speed up detection significantly (as for example in the file mention in #15).

(this should've been addressed in #14)

The url regex also leads to catastrophic backtracking for the
file in #15, which causes a massive slowdown.
@GjjvdBurg GjjvdBurg merged commit 2806a3a into master May 19, 2020
GjjvdBurg added a commit that referenced this pull request Jul 9, 2020
This was still using the old url regex, which was updated in
#16. With this change filter_urls uses the same url regex
as the one used for type detection.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant