CircularBuffer: Add benchmarks and improve performance #641

vyu · 2020-07-08T02:53:05Z

This PR adds a set of benchmarks for CircularBuffer and also improves its performance.

Here are some PkgBenchmark reports using the benchmarks and comparing the new performance:

Comparison with baseline on Julia 1.4.
Comparison with baseline on Julia 1.0.
Timings on Julia 1.4, on an N2D instance on GCP.

The benchmark file can be called directly by PkgBenchmark using the script keyword. For example, PkgBenchmark.benchmarkpkg("DataStructures"; script="benchmark/bench_circular_buffer.jl").

I've added review comments to explain the rationale behind some of the changes.

There was some inconsistency in the code over whether to use accessor functions (capacity(cb)) or dot notation (cb.capacity) to read field values. For consistency within the file, I've modified them to use accessor functions. They generate identical code since the functions get inlined.

I haven't worked on append! and fill! yet because improving those is more complicated. I might get to them in the future if this PR goes through.

Thanks!

vyu · 2020-07-08T02:59:09Z

src/circular_buffer.jl

-    return ifelse(idx > n, idx - n, idx)
+    return idx > n ? idx - n : idx


Based on testing, using the ternary operator here rather than the branchless conditional helps the compiler optimize away an allocation when iterating through Iterators.Reverse(cb). This is used for example in foldr.

vyu · 2020-07-08T02:59:45Z

src/circular_buffer.jl

@@ -18,6 +18,8 @@ end

 CircularBuffer(capacity) = CircularBuffer{Any}(capacity)

+Base.IndexStyle(::Type{<:CircularBuffer}) = IndexLinear()


IndexLinear (as opposed to the default IndexCartesian for AbstractArrays) allows functions to use an optimized Base._mapreduce. This, for example, speeds up sum.

vyu · 2020-07-08T03:01:03Z

src/circular_buffer.jl

-Base.@propagate_inbounds function _buffer_index_checked(cb::CircularBuffer, i::Int)
-    @boundscheck if i < 1 || i > cb.length
-        throw(BoundsError(cb, i))
-    end
-    _buffer_index(cb, i)
-end


This can be collapsed into _buffer_index, which can be simplified without loss of performance. Callers can use @inbounds to enable or disable bounds checking.

vyu · 2020-07-08T03:04:54Z

src/circular_buffer.jl

-@inline Base.@propagate_inbounds function Base.getindex(cb::CircularBuffer, i::Int)
-    cb.buffer[_buffer_index_checked(cb, i)]
+Base.@propagate_inbounds function Base.getindex(cb::CircularBuffer, i::Int)
+    j = _buffer_index(cb, i)
+    @inbounds return cb.buffer[j]
 end


If i is inbounds for cb, then _buffer_index(cb, i) is guaranteed inbounds for cb.buffer. We can separate these into two lines and annotate the second with @inbounds.

Also, @inline here is redundant because @propagate_inbounds implies @inline.

vyu · 2020-07-08T03:06:19Z

src/circular_buffer.jl

@@ -154,25 +154,21 @@ end

 Return the number of elements currently in the buffer.
 """
-Base.length(cb::CircularBuffer) = cb.length
-
-Base.eltype(::Type{CircularBuffer{T}}) where T = T


This is already defined for the AbstractArray{T} supertype.

vyu · 2020-07-08T03:07:19Z

src/circular_buffer.jl

@@ -154,25 +154,21 @@ end

 Return the number of elements currently in the buffer.
 """
-Base.length(cb::CircularBuffer) = cb.length


Base.length(::AbstractArray) is already a generic function that calls size. For consistency with Julia Base, we can define just size instead.

vyu · 2020-07-08T03:08:13Z

src/circular_buffer.jl


 """
    size(cb::CircularBuffer)

 Return a tuple with the size of the buffer.
 """
-Base.size(cb::CircularBuffer) = (length(cb),)
-
-Base.convert(::Type{Array}, cb::CircularBuffer{T}) where {T} = T[x for x in cb]


This convert was slower than the generic convert.

vyu · 2020-07-08T03:09:59Z

src/circular_buffer.jl

-Base.@propagate_inbounds function Base.last(cb::CircularBuffer)
-    @boundscheck (cb.length == 0) && throw(BoundsError(cb, 1))
-    return cb.buffer[_buffer_index(cb, cb.length)]
-end


This was no faster than the generic last function. The situation is different for first because the generic first does not know about cb.first, so the specialized first here is a shortcut. There is no such shortcut for last because we don't track the last index directly.

Reverted the complete removal and instead replaced with a shorter definition. I was mistaken. Even though benchmarks showed no benefit, it might still be helpful on some architectures or code because the specialized method allows the bounds check to be elided by @inbounds, whereas the generic last does not.

eulerkochy

Great work!! A few minor changes spotted at first glance !

eulerkochy · 2020-07-08T10:40:00Z

benchmark/bench_circular_buffer.jl

+    cap = capacity(cb)
+    cb.length = cap
+    total = 0
+    for _ in 1:cap


In this codebase, we use the convention for i = 1:num for iterating over ranges. Also, I know it's trivial, but consider replacing _. It's there in few other places too.

eulerkochy · 2020-07-08T10:40:33Z

benchmark/bench_circular_buffer.jl

+    cap = capacity(cb)
+    cb.length = cap
+    total = 0
+    for _ in 1:cap


vyu · 2020-07-08T11:47:26Z

Thanks! I've modified lines with range iteration to follow your suggestion.

vyu · 2020-07-10T02:38:04Z

I just noticed that setindex!(cb::CircularBuffer, item, i) returns cb. This has been the case since the beginning (when CircularBuffer was added in v0.4.3) and this PR retains that behavior. But per setindex! in Julia Base, it should really return item, not cb. I think this is a bug that ought to be fixed, but fixing it is potentially breaking if there is code out there that depends on this unconventional behavior. I can open a separate PR for it.

Previous removal was in error because adding this definition allows the boundscheck to be elided by `@inbounds`.

oxinabox

Sorry for losing track of this. This PR looks good.
Can it be rebased, then we can merge it.

vyu added 3 commits July 7, 2020 09:54

CircularBuffer: Add benchmarks

287e381

CircularBuffer: Improve performance

78e9535

Correct typo

df3cc79

vyu commented Jul 8, 2020

View reviewed changes

vyu marked this pull request as ready for review July 8, 2020 03:21

Fix indent

454ae83

eulerkochy reviewed Jul 8, 2020

View reviewed changes

Fix to follow convention for range iteration

12e0cb6

vyu added 2 commits July 8, 2020 12:32

Fix typo in code

abbab75

Remove redundant isempty method

aec557e

Re-add Base.last(::CircularBuffer) method

05f6df6

Previous removal was in error because adding this definition allows the boundscheck to be elided by `@inbounds`.

vyu force-pushed the vyu/cb-performance branch from 23890ba to 05f6df6 Compare July 11, 2020 03:21

vyu mentioned this pull request Jun 16, 2022

Improve middle(::AbstractRange) performance JuliaStats/Statistics.jl#116

Merged

laborg mentioned this pull request Oct 5, 2023

[bug] Type assert of filter fails for empty CircularBuffer #810

Open

oxinabox approved these changes Oct 10, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CircularBuffer: Add benchmarks and improve performance #641

CircularBuffer: Add benchmarks and improve performance #641

vyu commented Jul 8, 2020 •

edited

Loading

vyu Jul 8, 2020

vyu Jul 8, 2020

vyu Jul 8, 2020

vyu Jul 8, 2020

vyu Jul 8, 2020

vyu Jul 8, 2020

vyu Jul 8, 2020

vyu Jul 8, 2020

vyu Jul 11, 2020 •

edited

Loading

eulerkochy left a comment

eulerkochy Jul 8, 2020

eulerkochy Jul 8, 2020

vyu commented Jul 8, 2020

vyu commented Jul 10, 2020 •

edited

Loading

oxinabox left a comment

		return ifelse(idx > n, idx - n, idx)
		return idx > n ? idx - n : idx

		@@ -18,6 +18,8 @@ end

		CircularBuffer(capacity) = CircularBuffer{Any}(capacity)

		Base.IndexStyle(::Type{<:CircularBuffer}) = IndexLinear()

CircularBuffer: Add benchmarks and improve performance #641

Are you sure you want to change the base?

CircularBuffer: Add benchmarks and improve performance #641

Conversation

vyu commented Jul 8, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vyu Jul 11, 2020 • edited Loading

Choose a reason for hiding this comment

eulerkochy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vyu commented Jul 8, 2020

vyu commented Jul 10, 2020 • edited Loading

oxinabox left a comment

Choose a reason for hiding this comment

vyu commented Jul 8, 2020 •

edited

Loading

vyu Jul 11, 2020 •

edited

Loading

vyu commented Jul 10, 2020 •

edited

Loading