Detect quadratic patterns #23

RunDevelopment · 2020-10-12T15:54:23Z

Some seemly innocent patterns can have a run time of O(n^2). This can be a vulnerability as pointed out here and further explained here.

"Even extremely simple regexes like /a+b/ show this O(n^2) behavior for inputs like 'a'*n." ('a'*n means n-many a characters.)

The purpose of this rule is to detect these patterns.

From what I've seen, the general rule seems to be: If there exists some set of paths AB*C in the regex R such that x = (L(A) ∩ L(B*)) \ ({ε} ∪ L(C)) is not the empty set, then R will take Ω(n^2) many steps to reject a word w ∈ x^n \ L(R).

Please note the Omega in the time complexity bound. This is not a typo. The backtracking algorithm might actually take more than O(n) steps to reject a suffix of the input string.

The text was updated successfully, but these errors were encountered:

RunDevelopment · 2021-05-07T18:31:45Z

Already covered ota-meshi/eslint-plugin-regexp#159.

RunDevelopment added rule It's about new or existing rules new rule labels Oct 12, 2020

RunDevelopment closed this as completed May 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Detect quadratic patterns #23

Detect quadratic patterns #23

RunDevelopment commented Oct 12, 2020

RunDevelopment commented May 7, 2021

Detect quadratic patterns #23

Detect quadratic patterns #23

Comments

RunDevelopment commented Oct 12, 2020

RunDevelopment commented May 7, 2021