Provide peekInt8 to reduce allocations #1373

shanson7 · 2019-05-13T20:03:33Z

Background:
In a project I use, sarama produces by far the most ongoing allocations (mainly, I think, because have lots of small messages) which negatively impacts our GC stats. I recently went through and tried to reduce them where I could and have been quite successful in that sarama doesn't show up in the pprof top10 by alloc_objects. This is the simplest change I made so far. The others follow the pattern from #1161 and are more invasive.

I benchmarked this change by consuming 5 million messages and then dumping out a heap profile.
Before the change:

Showing nodes accounting for 30777165, 99.24% of 31012468 total
Dropped 6 nodes (cum <= 155062)
Showing top 10 nodes out of 19
      flat  flat%   sum%        cum   cum%
  10362760 33.41% 33.41%   25777082 83.12%  github.com/Shopify/sarama.(*MessageBlock).decode
   5172719 16.68% 50.09%   25887933 83.48%  github.com/Shopify/sarama.(*MessageSet).decode
   5123816 16.52% 66.62%    5123816 16.52%  github.com/Shopify/sarama.(*partitionConsumer).parseMessages
   5087542 16.40% 83.02%    5087542 16.40%  github.com/Shopify/sarama.(*realDecoder).peek

Here you can see that peek is responsible for 16.4% of the objects allocated, a little over one per message (makes sense, because of the record batching in kafka).

After the change:

Showing nodes accounting for 25577632, 99.73% of 25647802 total

Peek doesn't show up at all and we allocate 25.6 million objects vs 31 million, which is the expected reduction.

real_decoder.go

varun06 · 2019-05-14T12:24:24Z

packet_decoder.go

@@ -27,6 +27,7 @@ type packetDecoder interface {
 	remaining() int
 	getSubset(length int) (packetDecoder, error)
 	peek(offset, length int) (packetDecoder, error) // similar to getSubset, but it doesn't advance the offset


If this change work, do we still need peek method?

There are no existing uses of peek. Should I remove it?

let's wait. @bai you have any inputs?

shanson7 · 2019-05-20T21:29:22Z

Any update?

bai · 2019-05-22T06:09:04Z

Thanks for your contribution!

Provide peekInt8 to reduce allocations

4ed1ab2

varun06 reviewed May 14, 2019

View reviewed changes

real_decoder.go Outdated Show resolved Hide resolved

varun06 reviewed May 14, 2019

View reviewed changes

real_decoder.go Outdated Show resolved Hide resolved

varun06 reviewed May 14, 2019

View reviewed changes

Remove useless tmp var

1d23237

shanson7 force-pushed the peekint8 branch from c13e9fc to 1d23237 Compare May 14, 2019 15:00

Remove use of magic number

dec71e9

bai merged commit 72033d7 into IBM:master May 22, 2019

shanson7 deleted the peekint8 branch May 23, 2019 18:01

shanson7 mentioned this pull request May 30, 2019

Pool internal objects allocated per message #1385

Merged

shanson7 mentioned this pull request Jul 5, 2019

Update Shopify/sarama from v1.19.0 to v1.23.0 grafana/metrictank#1383

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide peekInt8 to reduce allocations #1373

Provide peekInt8 to reduce allocations #1373

shanson7 commented May 13, 2019

varun06 May 14, 2019

shanson7 May 14, 2019

varun06 May 14, 2019

shanson7 commented May 20, 2019

bai commented May 22, 2019

Provide peekInt8 to reduce allocations #1373

Provide peekInt8 to reduce allocations #1373

Conversation

shanson7 commented May 13, 2019

varun06 May 14, 2019

Choose a reason for hiding this comment

shanson7 May 14, 2019

Choose a reason for hiding this comment

varun06 May 14, 2019

Choose a reason for hiding this comment

shanson7 commented May 20, 2019

bai commented May 22, 2019