Server-side performance issue with small I/O #190

wangvsa · 2024-03-15T16:23:04Z

This issue documents the performance challenges observed during the Montage experiments. The high server-side overhead encountered when processing small I/O requests needs to be mitigated to make PDC beneficial for Montage or AI applications.

What does this feature solve or improve?

The Montage results suggest that the server may not be operating at its peak efficiency. In scenarios with numerous concurrent requests, the server could potentially become a bottleneck. We may observe similar I/O patterns from AI applications as well.

Describe the solution you'd like

Server side algorithm for processing I/O requests can be improved.
Server-side multi-threading should also be able to improve the efficiency.

Montage results on Perlmutter

The Montage components execute a large number of small reads and writes. Within the tested workflow, each I/O operation amounts to approximately 3000 bytes.
The performance of PDC, with or without cache, remains similar, indicating that the majority of the time was consumed by server processing

I did some further investigations on one component, mProjExecMPI. This component executes N small writes, followed by one read, and then another M writes. I implemented optimizations, including utilizing session consistency and combining all writes into one batched call. However, the performance remains suboptimal. Especially, the single read operation takes 2 seconds, suggesting it was awaiting processing on the server side.

wangvsa added the type: new feature Request for new feature label Mar 15, 2024

houjun self-assigned this Mar 18, 2024

jeanbez added the priority: medium Medium priority label Jun 4, 2024

This was referenced Jul 1, 2024

Wait all fix #204

Closed

Multi-thread fix and request merging #205

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Server-side performance issue with small I/O #190

Server-side performance issue with small I/O #190

wangvsa commented Mar 15, 2024

Server-side performance issue with small I/O #190

Server-side performance issue with small I/O #190

Comments

wangvsa commented Mar 15, 2024