internal/fuzz: revisit use of shared memory-mapped files for marshaled inputs #48163

jayconrod · 2021-09-02T20:47:15Z

Currently, the fuzzing coordinator process creates a 100MB temporary file for each worker process. Both the coordinator and worker read and write the file via a shared memory map. The file is currently used for 1) a few "header" fields such as iteration count and PRNG state, and 2) passing marshaled inputs for fuzzing in both directions.

We still need to use shared memory for the header fields. If a worker process terminates unexpectedly, the coordinator needs to be able to reconstruct the input that caused the crash using the initial input (sent from the coordinator), the call count, and the PRNG state.

However, it's not strictly necessary to write marshaled inputs to shared memory. It would be simpler to pass these through the pipes we use for RPCs. If we only supported unmarshaled byte slices, there would be a performance advantage to writing and mutating those directly in shared memory without incurring the cost of marshaling and pipe I/O. Since we need to marshal inputs anyway, it's not clear the extra complexity is worthwhile.

We should investigate the performance difference and pass inputs through pipes if it's not too bad. This would simplify our implementation, and would let us use much smaller shared memory files.

Additionally, for inputs that can be read from files, such as those in testdata or the cache, we can pass file names over pipes instead of reading, and sending that data over pipes. The coordinator doesn't need to hold that data in virtual memory at all.

toothrot · 2021-09-22T16:13:27Z

Checking in on this issue as it's labeled a release blocker for Go 1.18. Is there any update?

katiehockman · 2021-10-12T18:55:51Z

@jayconrod do you still think this is a release blocker? Reading it over, it feels more like an optimization that we might not necessarily need right now, but there may be context I'm not seeing here.

jayconrod · 2021-10-12T19:09:55Z

This probably doesn't need to be a release blocker.

#48731 is related though, that's mostly the one I'm worried about.

jayconrod · 2021-10-14T17:29:05Z

Unassigning, since I'm leaving.

I did benchmark whether there would be a significant slowdown to passing inputs through RPC vs pipes: there wasn't a measurable difference for inputs of ~100 bytes. The marshaling / unmarshaling cost is far higher, so I don't think communication overhead is a reason to implement this or not.

Reading entries from corpus files in workers is a good idea regardless. Then the coordinator doesn't need to read them at all.

#48731 is my main concern. We need to be able to reconstruct the entry that caused a problem, whether we're fuzzing or minimizing. Writing the entry to shared memory before every call to the fuzz target is too expensive because of the marshaling overhead, so we need to be able to rebuilt it from the input entry and some known sequence of transformations. For minimization, I think we should store the sequence of transformations in shared memory. For normal fuzzing, we already store initial RNG state and a count, and that works well enough. In any case, we do still need shared memory, but we could use a lot less of it since we don't need to store inputs there.

gopherbot · 2021-10-15T17:34:05Z

Change https://golang.org/cl/356229 mentions this issue: internal/fuzz: pass fuzz inputs over pipe instead of shared memory

gopherbot · 2022-03-18T01:58:53Z

Change https://go.dev/cl/393660 mentions this issue: internal/fuzz: cleanup usage of shared memory

jayconrod added NeedsInvestigation Someone must examine and confirm this is a valid issue and not a duplicate of an existing one. release-blocker fuzz Issues related to native fuzzing support labels Sep 2, 2021

jayconrod added this to the Go1.18 milestone Sep 2, 2021

rsc changed the title ~~[dev.fuzz] internal/fuzz: revisit use of shared memory-mapped files for marshaled inputs~~ internal/fuzz: revisit use of shared memory-mapped files for marshaled inputs Sep 21, 2021

katiehockman assigned jayconrod Oct 6, 2021

jayconrod removed the release-blocker label Oct 12, 2021

jayconrod removed their assignment Oct 14, 2021

katiehockman mentioned this issue Nov 10, 2021

internal/fuzz: improperly handling crash that occurs while minimizing interesting input #48731

Closed

katiehockman modified the milestones: Go1.18, Backlog Nov 10, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

internal/fuzz: revisit use of shared memory-mapped files for marshaled inputs #48163

internal/fuzz: revisit use of shared memory-mapped files for marshaled inputs #48163

jayconrod commented Sep 2, 2021

toothrot commented Sep 22, 2021

katiehockman commented Oct 12, 2021

jayconrod commented Oct 12, 2021

jayconrod commented Oct 14, 2021

gopherbot commented Oct 15, 2021

gopherbot commented Mar 18, 2022

internal/fuzz: revisit use of shared memory-mapped files for marshaled inputs #48163

internal/fuzz: revisit use of shared memory-mapped files for marshaled inputs #48163

Comments

jayconrod commented Sep 2, 2021

toothrot commented Sep 22, 2021

katiehockman commented Oct 12, 2021

jayconrod commented Oct 12, 2021

jayconrod commented Oct 14, 2021

gopherbot commented Oct 15, 2021

gopherbot commented Mar 18, 2022