Go speed testing #165

rw · 2015-04-02T18:36:24Z

No description provided.

Identifies alloc-heavy codepaths.

…ffers into go-faster

Change the signature for 'string' getters and settings to use byte slices instead of strings.

Builder has a new CreateByteString function that writes a null-terimnated byte slice to the buffer. This results in zero allocations for writing strings.

Add the function `Reset` to the Builder, which facilitates reuse of the underlying byte slice.

ghost · 2015-04-08T18:33:44Z

So, do you want to change this so it generates accessors for both strings and byte vectors? Or do you think Go programmers will be happy with just the the byte vectors?

Also, if you can squash related commits, that's helpful in code review.

rw · 2015-04-19T21:10:08Z

The problem with generating multiple accessors is that we can't guarantee they won't collide with other schema-defined fields. For example, a field called 'Name' would get 'Name' and 'NameBytes' as accessors. If there is a 'NameBytes' field (say, a [uchar]), then we'd get a collision.

Personally, I'd rather let the user make the choice whether to allocate a string, at the cost of some syntax noise. That means returning byte slices in all cases.

Anyone else want to share an opinion?

ghost · 2015-04-20T18:48:39Z

I'm fine either way, to me it really depends on how people typically use Go.

As far as collisions go, we already have this problem with "Type" and "Length" suffixes (depending on language), which is currently handled in an ad-hoc way in the parser by giving an error for any fields defined that would clash (see CheckClash() in idl_parser.cpp. We could add "Bytes" to that, though maybe "ByteVector" is even better, since it is less likely to clash.

rw · 2015-05-09T23:14:55Z

Removed remaining allocs during building, as reporting during the gold data benchmarks.
Added collision check in CheckClash for _byte_vector and ByteVector for the BASE_TYPE_STRING type.

rw · 2015-05-09T23:34:58Z

Reduced number of backing buffer growth events, bringing BuildGold speed to within a factor of 2.5x ParseGold.

rw · 2015-05-10T19:45:02Z

I'm hearing that users want to be able to provide their own backing buffer, to have more control over memory usage, and potentially reduce allocs. Something like

NewBuilderFromSlice(...)

or

bldr:= &Builder{Bytes: myslice}
bldr.Reset() // ensures correct initial state of the byte slice

is what is being asked for.

This approach isn't as simple as it seems, though, because a Go Builder actually has three GC'ed objects: the Bytes slice that holds the flatbuf data, the vtables slice that holds already-seen vtables, and vtable which holds the current (if applicable) vtable.

(Note that to set any of these values directly would require making them publicly visible. I'd rather not make vtables and vtable publicly visible, since they are just for the internal state of the Builder.)

So, if users want this, we have a few paths.

Change the Builder constructor to take a Config-like object, then look inside of it to pull out any pre-existing buffers the user gave us. Each field would be optional. You'd use it like this:

myCfg := BuilderConfig{
        InitialBytesBuffer: mySlice1,
        InitialVTableBuffer: mySlice2,
        InitialVTablesBuffer: mySlice3,
}
bldr := NewBuilder(&myCfg)

(Simple use cases will just pass nil for the config.)

Make each alloc'ed field publicly visible, then call Reset on an already-existing Builder, so that users could pass in anything they want. It would look like this:

bldr := &Builder{Bytes: mySlice, VTables: mySlice2}
bldr.Reset() // Sets up bookkeeping

I'm not a fan of this because, like I said, we'd have to make the internal state fields public so that users could set them.

Feedback?

dgnorton · 2015-05-10T21:28:25Z

@rw how hard would it be for the builder to take just one buffer and segment it as needed internally so that implementation details are hidden from client code? If any one of the segments (bytes, vtable, vtables, etc.) needed to grow, allocate a whole new buffer, copy each of the segments into the new buffer, and continue.

rw · 2015-05-10T22:04:45Z

@dgnorton That's an interesting idea. As a caller, that's simpler (but less flexible) than using a Config. One issue I see is that it makes internal bookkeeping code more complicated. Another issue is that the Bytes slice (the largest of these three objects) will lose its exact power-of-2 growth behavior.

@gwvo Any thoughts?

ghost · 2015-05-11T21:28:11Z

In C++ we provide an allocator call-back, is there something similar that can be done in Go?

rw · 2015-05-11T23:58:03Z

An allocator callback in Go would be something like this:

bldr := NewBuilder(myAllocator)

func myAllocator(existingSlice []byte, desiredCapacity int) []byte {
  // Look in a sync.Pool or somewhere else for a suitable slice.

  // Or make a new one on the heap:
  if existingSlice == nil {
    return make([]byte, desiredCapacity)
  }

  existingSlice = existingSlice[:cap(existingSlice)]
  extension := make([]byte, desiredCapacity - cap(existingSlice))
  extended := append(existingSlice, extension...)
  return extended
}

(This was edited.)

This has the added advantage that we could use it for growing the byte buffer, too.

@gwvo Is this the kind of thing you mean?

@dgnorton Would you use this?

rw · 2015-05-11T23:58:50Z

(Keep in mind that at this point, you could just have a pool of Builder objects and reuse them.)

ghost · 2015-05-12T01:09:40Z

Yes.. though reusing FlatBufferBuilder objects seems preferable in most situations.

dgnorton · 2015-05-12T15:49:49Z

@rw I'm not sure a pool of builders would work. E.g., in our case...

get buf from a pool of []byte
read flat data from socket into buf
get builder from a pool of flatbuffer builders
traverse buf normalizing data into new flatbuffer using the builder
pass newly constructed buf down the data pipe and return builder to pool

We have a problem at that last step because the builder still owns the memory that we passed down the pipe. We could pass the builder down the pipe but that seems awkward.

Maybe I'm thinking about it wrong and there's a better way?

rw · 2015-05-12T21:42:09Z

I'd like to handle a public interface for allocations in a separate feature branch. Let's get these speed improvements shipped.

A workaround (for those who want it) is to manually manage the publicly-accessible Bytes buffer, combined with judicious use of the new Reset function.

Go speed improvements

…eted_script delete obsoleted scripts

bmharper and others added 7 commits April 1, 2015 16:47

Add byte slice accessor to Go code

8fb6c4f

chmod GoTest.sh +x

468124f

Benchmarks for building and parsing 'gold' data.

796be32

Identifies alloc-heavy codepaths.

Merge branch 'go-bytevector-getter' of github.com:benharper123/flatbu…

0a3a09a

…ffers into go-faster

Remove all string allocations during parsing.

f02646e

Change the signature for 'string' getters and settings to use byte slices instead of strings.

Reduce allocations when building strings.

ace7fa8

Builder has a new CreateByteString function that writes a null-terimnated byte slice to the buffer. This results in zero allocations for writing strings.

Reduce allocations when reusing a Builder.

d756efb

Add the function `Reset` to the Builder, which facilitates reuse of the underlying byte slice.

cbandy mentioned this pull request Apr 5, 2015

Go - reuse buffers #85

Closed

rw mentioned this pull request Apr 8, 2015

Port FlatBuffers to Python. #112

Merged

rw added 3 commits May 9, 2015 15:37

update CheckClash for string accesses

5d68493

remove remaining allocs during build

3dd5442

gofmt

e11da87

invoke many fewer growth events

e5c21ec

rw added a commit that referenced this pull request May 12, 2015

Merge pull request #165 from rw/go-faster

4d213c2

Go speed improvements

rw merged commit 4d213c2 into google:master May 12, 2015

kakikubo pushed a commit to kakikubo/flatbuffers that referenced this pull request Apr 19, 2016

Merge pull request google#165 from kiyoto-suzuki/feature/delete_obsol…

68d1253

…eted_script delete obsoleted scripts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Go speed testing #165

Go speed testing #165

rw commented Apr 2, 2015

ghost commented Apr 8, 2015

rw commented Apr 19, 2015

ghost commented Apr 20, 2015

rw commented May 9, 2015

rw commented May 9, 2015

rw commented May 10, 2015

dgnorton commented May 10, 2015

rw commented May 10, 2015

ghost commented May 11, 2015

rw commented May 11, 2015

rw commented May 11, 2015

ghost commented May 12, 2015

dgnorton commented May 12, 2015

rw commented May 12, 2015

Go speed testing #165

Go speed testing #165

Conversation

rw commented Apr 2, 2015

ghost commented Apr 8, 2015

rw commented Apr 19, 2015

ghost commented Apr 20, 2015

rw commented May 9, 2015

rw commented May 9, 2015

rw commented May 10, 2015

dgnorton commented May 10, 2015

rw commented May 10, 2015

ghost commented May 11, 2015

rw commented May 11, 2015

rw commented May 11, 2015

ghost commented May 12, 2015

dgnorton commented May 12, 2015

rw commented May 12, 2015