Use multiprocessing to speed up bundletool #698

kastiglione · 2020-02-03T22:14:47Z

In our experience, the bundling step is a bottleneck in tight edit/compile/{run,test} cycles. On my machine, it can take ~14s which is almost as long as link, and can be longer than incremental compilation.

Part of the problem is Bazel overhead, and part of the problem is that bundletool copies files serially. This change makes use of the multiprocessing module to copy files into the bundle in parallel. Of the ~14s mentioned above, about ~5s is Bazel overhead and the remaining 8-9s is the bundletool. With this change, bundletool runs in about 5-6s (most of which is code signing).

In order to use apply_async, all stateless methods were moved off the Bundler class and converted to free functions. This is because using a method would require supporting picking for the Bundler class. The change from methods to functions is in its own commit.

These functions use no Bundler object state and can be pure functions.

kastiglione · 2020-02-03T22:16:32Z

For more convenient review: https://github.com/bazelbuild/rules_apple/pull/698/files?w=1

thomasvl · 2020-02-03T22:22:55Z

Disclaimer - I have not yet looked at the the patch, but from experience trying to move things into a parallel step based on the number of CPUs –

The danger here is going to be what happens if you need to build multiple things (bazel build //foo/... or just bazel build //foo:my_app where my_app has a few extensions that can thus be handled in parallel because they are each independent).

In cases like that, bazel can end up running one bundling job per core because it also is scaling things out. If you then scale out to cpu count within the script, you can end up going to cpu count squared. Which might not be too bad on a laptop or mini with say 4-8 core, but as you start to get to machines with 16 or more cores, you can actually end up running out of process entries for the average user, causing the build to fail.

I believe there is an open FR on bazel where it would be nice if a rule/action could indicate it need a resource budge (CPU, RAM, etc.) and then as part of of the action being run, it gets told it's assigned budget.

kastiglione · 2020-02-03T22:37:10Z

Thanks for the fast reply. I did think about that, my reasoning is that the file copying is very little cpu and mostly I/O. I wrote a jq script to turn the bundletool_control.json into a series of cp commands:

cp a out &
cp -R b out &
cp c out &
...

and I ran this for about ~4000 files and it completed in something like 1s. This isn't an exact apples to apples comparison, but I do believe it's not a problem in practice for file copying since it doesn't seem to be a cpu needy operation.

We have a large number of resource bundles in our app, many of which are competing with bazel actions being run concurrently. These are much smaller than the .app bundle, but I can still see if a clean build shows any negative performance with this.

kastiglione · 2020-02-03T22:39:22Z

I believe there is an open FR on bazel where it would be nice if a rule/action could indicate it need a resource budge (CPU, RAM, etc.) and then as part of of the action being run, it gets told it's assigned budget.

Yes that's bazelbuild/bazel#10443. It's marked as P2 and nobody has talked about it so I don't know if or when it will get addressed.

kastiglione · 2020-02-03T22:39:43Z

@thomasvl should this be behind a feature?

thomasvl · 2020-02-03T22:51:46Z

How big are the files in your tests? (ie - the copies may be so fast you don't really get that many copies going in parallel) From experience, it will work until it doesn't and when it doesn't it take a while to figure out because the next build works because of the work was done so there are less jobs and things don't max out. Having an action where bazel will fail and just issuing the command again isn't really living up to the bazel promise. 😃

kastiglione · 2020-02-03T22:55:43Z

Relatedly I've filed bazelbuild/bazel#10702 to find out about the Bazel overhead that shows in the trace around the action.

kastiglione · 2020-02-03T22:56:08Z

living up to the bazel promise

"{Fast, Correct} - Choose two"

kastiglione · 2020-02-03T22:57:37Z

because the next build works because of the work was done so there are less jobs and things don't max out

I don't think this would happen with the bundletool because it deletes the output directory at the outset. I think any such performance issues would be repeatable.

kastiglione · 2020-02-03T23:23:56Z

For what it's worth, in a Swift build many or all SwiftCompile actions are using concurrency that bazel is unaware of: swiftc uses parallelism for batch mode, and rules_swift enables parallel object file generation (which is only a small portion of the compile).

thomasvl · 2020-02-04T04:04:07Z

I don't think this would happen with the bundletool because it deletes the output directory at the outset. I think any such performance issues would be repeatable.

If there were multiple bundles going in parallel, some could succeed while others fail, so the next "try" would have less, and the rest would succeed. (been there, done that) 😄

For what it's worth, in a Swift build many or all SwiftCompile actions are using concurrency that bazel is unaware of: swiftc uses parallelism for batch mode, and rules_swift enables parallel object file generation (which is only a small portion of the compile).

Yup, and this is why the hardcoded values in there can't be touched even though we might be able to do better. They appear to be mostly working, and we can't really mess with them without risking something going wrong.

The things I had done like this before all worked great for in some cases years, it wasn't until they got run on new hardware that suddenly things hit the limits because the numbers/loads/ram changed. Taking a look at the core/thread counts on some of the new MacPros, and we could be in for some hiccups with local builds on those machines (even in the current things being done in parallel).

kastiglione · 2020-02-21T01:32:27Z

Thanks for the discussion @thomasvl. Looks like we're going to use a custom bundletool, so I'll close this out.

Copied from #698.

googlebot added the cla: yes label Feb 3, 2020

kastiglione added 2 commits February 3, 2020 14:15

Convert Bundler methods to functions

b68f306

These functions use no Bundler object state and can be pure functions.

Merge bundle files in parallel

541a4b5

kastiglione force-pushed the parallel-bundling branch from 0f887a1 to 541a4b5 Compare February 3, 2020 22:16

kastiglione closed this Feb 21, 2020

brentleyjones pushed a commit that referenced this pull request Nov 18, 2020

Use multiprocess in bundletool

4bb4371

Copied from #698.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use multiprocessing to speed up bundletool #698

Use multiprocessing to speed up bundletool #698

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

thomasvl commented Feb 3, 2020

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

thomasvl commented Feb 3, 2020 •

edited

Loading

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

thomasvl commented Feb 4, 2020

kastiglione commented Feb 21, 2020

Use multiprocessing to speed up bundletool #698

Use multiprocessing to speed up bundletool #698

Conversation

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

thomasvl commented Feb 3, 2020

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

thomasvl commented Feb 3, 2020 • edited Loading

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

kastiglione commented Feb 3, 2020

thomasvl commented Feb 4, 2020

kastiglione commented Feb 21, 2020

thomasvl commented Feb 3, 2020 •

edited

Loading