perf: Optimize scalar fast path for `regexp_like` by kumarUjjawal · Pull Request #20354 · apache/datafusion

kumarUjjawal · 2026-02-14T06:00:11Z

Which issue does this PR close?

Part of [EPIC] Optimize performance for slow expressions datafusion-comet#2986

Rationale for this change

regexp_like was converting scalar inputs into single‑element arrays, adding avoidable overhead for constant folding and scalar‑only evaluations.

What changes are included in this PR?

Add a scalar fast path in RegexpLikeFunc::invoke_with_args that evaluates regexp_like directly for scalar inputs
Add benchmark

Type	Before	After	Speedup
regexp_like_scalar_utf8	12.092 µs	10.943 µs	1.10x

Are these changes tested?

Yes

Are there any user-facing changes?

NO

Jefffrey · 2026-02-14T06:35:58Z

datafusion/functions/src/regex/regexplike.rs

                ColumnarValue::Array(a) => Some(a.len()),
            });

-        let is_scalar = len.is_none();


We should take the chance here to do further refactors; for example we can use ColumnarValue::values_to_arrays here

Jefffrey · 2026-02-14T06:36:54Z

datafusion/functions/src/regex/regexplike.rs

+            .iter()
+            .all(|arg| matches!(arg, ColumnarValue::Scalar(_)));
+
+        if is_scalar {


I think we should think in terms of how people may use this function; while this is a fast path for pure scalar inputs, it could be likely that users would have the first argument as an array but the 2nd/3rd arguments as scalar inputs, so would be worth considering a fast path for that too?

perf: Optimize scalar fast path for rlike

b8edddb

github-actions bot added the functions Changes to functions implementation label Feb 14, 2026

Jefffrey reviewed Feb 14, 2026

View reviewed changes

fast path when values are an array and pattern is scalar

f264e6a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Optimize scalar fast path for `regexp_like`#20354

perf: Optimize scalar fast path for `regexp_like`#20354
kumarUjjawal wants to merge 2 commits intoapache:mainfrom
kumarUjjawal:perf/rlike_scalar_path

kumarUjjawal commented Feb 14, 2026

Uh oh!

Jefffrey Feb 14, 2026

Uh oh!

Jefffrey Feb 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kumarUjjawal commented Feb 14, 2026

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Jefffrey Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

Jefffrey Feb 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants