Improve image upload process #797

ipanova · 2022-05-25T11:51:22Z

I've noticed that we are doing unnecessary steps which have bad implications on the performance, network traffic and S3/Azure, in case object storage is used.

It appears that regular docker/podman push does not use chunked upload. At least I did not find any RANGE headers in the calls.
https://github.com/pulp/pulp_container/blob/main/pulp_container/app/registry_api.py#L624

It means that if it is a 1GB layer it will be uploaded as one chunk.
In such case we make extra read of data to make a chunk out of it, send to storage.
Later on we retrieve that data back to assemble chunks. In this case it is one chunk that is being read again and assembled in into an artifact. Artifact( same binary data as the chunk) is sent to storage.
As a result the upload takes longer because we have extra reads, plus we write twice to storage , i.e occupy more space, which is paid, in case of object storage like S3. In addition every request whether it is GET/PUT, etc is paid too on the object storage.

For 1GB layer, it takes 4.44 minutes to read the uploaded data, create chunk out of it and send to storage.
https://github.com/pulp/pulp_container/blob/main/pulp_container/app/registry_api.py#L638
https://github.com/pulp/pulpcore/blob/main/pulpcore/app/models/upload.py#L32 Need to look into this more in details but it seems like we read even here twice, unnecessary. Not sure why we create twice the ContentFile.
It take another 30secs to read that data back, assemble chunkes, init and validate an artifact and send that artifact to storage.

TLDR: when the upload is performed in one chunk, create directly an artifact out of it and send to the storage.

closes pulp#797 Required PR: pulp/pulpcore#2779

closes pulp#797 Required PR: pulp/pulpcore#2779 Required PR: pulp/pulpcore#2842

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added Issue Triage-Needed labels May 25, 2022

ipanova self-assigned this May 25, 2022

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 14, 2022

Improveed image upload process.

b2ee794

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 14, 2022

Improveed image upload process.

d76aedf

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 14, 2022

Improveed image upload process.

afe2e69

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 14, 2022

Improveed image upload process.

1643078

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 14, 2022

Improved image upload process.

2de3faf

closes pulp#797 Required PR: pulp/pulpcore#2779 Required PR: pulp/pulpcore#2842

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 14, 2022

Improved image upload process.

dde1826

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 15, 2022

Improved image upload process.

af281ed

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 21, 2022

Improved image upload process.

79b7a91

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 21, 2022

Improved image upload process.

efc8fe4

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 21, 2022

Improved image upload process.

1240c22

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 23, 2022

Improved image upload process.

3875a6c

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 23, 2022

Improved image upload process.

acda8d3

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 23, 2022

Improved image upload process.

d5bd851

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 23, 2022

Improved image upload process.

91f57cb

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 23, 2022

Improved image upload process.

3ca9027

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 23, 2022

Improved image upload process.

e08e1ad

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova added a commit to ipanova/pulp_container that referenced this issue Jun 24, 2022

Improved image upload process.

aec1408

closes pulp#797 Required PR: pulp/pulpcore#2779

ipanova closed this as completed in 29b83b5 Jun 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve image upload process #797

Improve image upload process #797

ipanova commented May 25, 2022 •

edited

Loading

Improve image upload process #797

Improve image upload process #797

Comments

ipanova commented May 25, 2022 • edited Loading

ipanova commented May 25, 2022 •

edited

Loading