Optimize memory usage for S3StreamMetadataImage. #618

superhx · 2023-12-29T07:27:18Z

Who is this for and what problem do they have today?

Why is solving this problem impactful?

In the scenario of a 10w partition, even if a stream object compaction is performed every 1 hour, running for a day will still generate 100000 * 24 stream objects, ultimately occupying at least 100+ MiB of metadata memory. If we consider longer retention time and multiple streams per Partition, the actual memory usage of metadata will be even higher.

Additional notes

superhx · 2024-01-03T02:51:01Z

Assuming the strategy of stream compaction is to compact the size of each individual object in a stream up to 1GiB, then for a cluster that stores 1PiB of data, the memory consumption per object would be around 50MiB, which is calculated as 1024 * 1024 * (memory usage per object's metadata).

Therefore, the optimization goals have changed to:

Optimize the memory structure of Image, saving memory overhead caused by data structures like Map.
Optimize the stream compaction strategy, compacting each stream object to 1GiB and eliminating unnecessary blocks based on the startOffset of the stream during the compaction process.

superhx · 2024-01-05T12:43:55Z

1PB of data, 10w partitions, how many stream objects will there be in the end?
If it's accumulated in 10GiB batches, then there will be 100000 stream objects.

Simulating 1w => s3streamobject, 100 partitions, write for 100s is sufficient.

Result: 5000 stream objects generated.
5000, multiplied by 3, Controller Image + Broker Image + Controller processing layer

15000 s3streamobjects occupy 1MiB
s3objects occupy 1.5MiB

S3StreamsMetadataImage occupies 1.5MiB
S3ObjectsImage occupies 1.3MiB

Calculating: Estimated memory consumption is 2MiB for every 1w stream objects generated, and 20MiB for 10w stream objects.

Signed-off-by: Shichao Nie <niesc@automq.com>

superhx added the enhancement New feature or request label Dec 29, 2023

superhx mentioned this issue Dec 29, 2023

10w partition & 1PiB data cluster support #600

Closed

superhx self-assigned this Jan 3, 2024

This was referenced Jan 3, 2024

feat(metadata): replace image map to delta map #629

Merged

feat(kafka_issues618): stream object compact support drop expired data AutoMQ/automq-for-rocketmq#881

Merged

superhx closed this as completed in AutoMQ/automq-for-rocketmq#881 Jan 4, 2024

superhx mentioned this issue Jan 4, 2024

feat(s3stream): copy write based on max part size AutoMQ/automq-for-rocketmq#883

Merged

daniel-y pushed a commit that referenced this issue Mar 14, 2024

feat(s3stream): add s3 operation timeout (#618)

d3545cd

Signed-off-by: Shichao Nie <niesc@automq.com>

ShadowySpirits pushed a commit that referenced this issue Mar 14, 2024

feat(s3stream): add s3 operation timeout (#618)

a26c38c

Signed-off-by: Shichao Nie <niesc@automq.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimize memory usage for S3StreamMetadataImage. #618

Optimize memory usage for S3StreamMetadataImage. #618

superhx commented Dec 29, 2023

superhx commented Jan 3, 2024

superhx commented Jan 5, 2024

Optimize memory usage for S3StreamMetadataImage. #618

Optimize memory usage for S3StreamMetadataImage. #618

Comments

superhx commented Dec 29, 2023

Who is this for and what problem do they have today?

Why is solving this problem impactful?

Additional notes

superhx commented Jan 3, 2024

superhx commented Jan 5, 2024