OpenAPI/Swagger parsing is (?) performance bottleneck #3670

justinsb · 2021-03-03T14:01:40Z

I was wondering why running kustomize (& kpt) is not effectively instant, and I tracked down a large performance bottleneck - initializing the OpenAPI / parsing the swagger.

I believe this is the most significant constant-overhead for kustomize - obviously on very large kustomizations the data manipulation will eventually dominate.

I sent #3669 which adds a benchmark to quantify the impact; based on that it takes about 900ms to json parse the swagger itself; it takes about 20ms to un-gzip from the data embedded into the binary. These roughly tally with the performance overhead I see in the real world.

This particularly matters for e.g. kpt, where a kpt setter which uses kyaml also takes at least 1 second to run.

I think it would be interesting to cache the deserialized form.

But there's also some low-hanging fruit, e.g. the swagger UnmarshalJSON code calls json.Unmarshal twice over the same data (once for SwaggerProps, once for VendorExtensible). But that only gets us 2x, whereas caching (I'd SWAG) could be 100-1000x.

The text was updated successfully, but these errors were encountered:

Shell32-Natsu · 2021-03-04T18:00:49Z

@natasha41575

natasha41575 · 2021-03-04T21:37:12Z

Thank you for filing the issue, will look into it

fejta-bot · 2021-06-02T22:05:13Z

Issues go stale after 90d of inactivity.
Mark the issue as fresh with /remove-lifecycle stale.
Stale issues rot after an additional 30d of inactivity and eventually close.

If this issue is safe to close now please do so with /close.

Send feedback to sig-contributor-experience at kubernetes/community.
/lifecycle stale

natasha41575 · 2021-06-02T22:38:26Z

/remove-lifecycle stale

k8s-triage-robot · 2021-08-31T23:14:25Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

k8s-triage-robot · 2021-09-30T23:48:56Z

The Kubernetes project currently lacks enough active contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle rotten

natasha41575 · 2021-09-30T23:58:37Z

/remove-lifecycle rotten

natasha41575 · 2021-11-16T19:30:50Z

Discussed offline, one option would be to try to switch to using kube-openapi, which stores the the openapi data in proto form. This could be worth looking into.

mengqiy · 2021-12-06T18:22:33Z

/assign

natasha41575 · 2022-04-19T00:49:41Z

Only took a year, but I believe this is fixed now by #4568 :)

justinsb mentioned this issue Mar 3, 2021

Add benchmarks to measure impact of swagger parsing #3669

Merged

Shell32-Natsu added area/kyaml issues for kyaml area/openapi Issues to OpenAPI in kyaml kind/regression Categorizes issue or PR as related to a regression from a prior release. labels Mar 4, 2021

natasha41575 mentioned this issue Mar 4, 2021

Epic: Kpt OpenAPI roadmap kptdev/kpt#1521

Open

8 tasks

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 2, 2021

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Jun 2, 2021

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Aug 31, 2021

k8s-ci-robot added lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. and removed lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. labels Sep 30, 2021

k8s-ci-robot removed the lifecycle/rotten Denotes an issue or PR that has aged beyond stale and will be auto-closed. label Sep 30, 2021

natasha41575 added the lifecycle/frozen Indicates that an issue or PR should not be auto-closed due to staleness. label Nov 16, 2021

k8s-ci-robot assigned mengqiy Dec 6, 2021

This was referenced Jan 19, 2022

Add benchmark test for parsing openapi in protobuf format #4396

Merged

Improve performance of openapi schema parsing in kyaml/openapi kptdev/kpt#2651

Closed

natasha41575 mentioned this issue Feb 14, 2022

new openapi package using gnostic/openapiv2 #4474

Closed

This was referenced Apr 5, 2022

Openapi parsing performance improvements #4569

Open

openapi parsing performance improvement with protobuffer #4568

Merged

k8s-ci-robot closed this as completed in #4568 Apr 18, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenAPI/Swagger parsing is (?) performance bottleneck #3670

OpenAPI/Swagger parsing is (?) performance bottleneck #3670

justinsb commented Mar 3, 2021 •

edited

Loading

Shell32-Natsu commented Mar 4, 2021

natasha41575 commented Mar 4, 2021 •

edited

Loading

fejta-bot commented Jun 2, 2021

natasha41575 commented Jun 2, 2021

k8s-triage-robot commented Aug 31, 2021

k8s-triage-robot commented Sep 30, 2021

natasha41575 commented Sep 30, 2021

natasha41575 commented Nov 16, 2021

mengqiy commented Dec 6, 2021

natasha41575 commented Apr 19, 2022 •

edited

Loading

OpenAPI/Swagger parsing is (?) performance bottleneck #3670

OpenAPI/Swagger parsing is (?) performance bottleneck #3670

Comments

justinsb commented Mar 3, 2021 • edited Loading

Shell32-Natsu commented Mar 4, 2021

natasha41575 commented Mar 4, 2021 • edited Loading

fejta-bot commented Jun 2, 2021

natasha41575 commented Jun 2, 2021

k8s-triage-robot commented Aug 31, 2021

k8s-triage-robot commented Sep 30, 2021

natasha41575 commented Sep 30, 2021

natasha41575 commented Nov 16, 2021

mengqiy commented Dec 6, 2021

natasha41575 commented Apr 19, 2022 • edited Loading

justinsb commented Mar 3, 2021 •

edited

Loading

natasha41575 commented Mar 4, 2021 •

edited

Loading

natasha41575 commented Apr 19, 2022 •

edited

Loading