ZFS resilver can be very slow if there are other heavy disk IO requests, can the resilver priority be adjusted? #11777

wxiaoguang · 2021-03-21T09:53:29Z

Describe the feature would like to see added to OpenZFS

Can the resilver IO priority be adjusted? It gives the users a chance to decide how to allocate IO resources.

In old ZFS there are module parameters like zfs_resilver_delay, however these parameters have been removed in latest versions.

How will this feature improve OpenZFS?

ZFS resilver can be very slow if there are other heavy disk IO requests, it may strave and nearly stop working, and the resilver progress doesn't complete in several days.

The text was updated successfully, but these errors were encountered:

justinianpopa · 2021-03-21T18:59:33Z

Have a look at zfs_resilver_min_time_ms and possibly zfs_scan_min_time_ms via https://openzfs.github.io/openzfs-docs/Performance%20and%20Tuning/Module%20Parameters.html?highlight=resilver#zfs-resilver-min-time-ms

EDIT: urlfix

amotin · 2021-03-21T22:13:39Z

The latest change in this area was mine: #11166 . While the idea was actually opposite -- better throttle resilver to not affect the payload latency, there are number of tunables to allow adjustment, if needed.

wxiaoguang · 2021-03-22T02:47:39Z

Have a look at zfs_resilver_min_time_ms and possibly zfs_scan_min_time_ms via https://openzfs.github.io/openzfs-docs/Performance%20and%20Tuning/Module%20Parameters.html?highlight=resilver#zfs-resilver-min-time-ms

Thank you very much, justinianpopa ~

The document says ZFS spends at least zfs_resilver_min_time_ms time working on a resilver between txg commits and ZFS spends at least zfs_scan_min_time_ms time working on a scrub between txg commits

For my understanding, txg commits only affects "writes".

In my case, if there are heavy read IO requests, it still affects ZFS resilver speed.

eg: When a disk is being resilvering, if there is a rsync copying files out from the raidz, then the resilver becomes very slow. If the rsync is killed, the resilver becomes faster.

Not sure if my understanding is correct.

And I have tested the zfs_resilver_min_time_ms parameter:

disk sdb in a raidz3(data1) is being resilvering, while there are some (light) read/write IO requests running.
the iostat shows that %util of sdb is about 100% (the disk is being writing by resilver)
the otime of data1/txg is around 6s
run rsync to copy large files from data1 to another raidz3(data2)
the otime of data1/txg becomes around 10-20s
the iostat shows that %util of sdb drops to 0-10% frequently, even if I set 10s in zfs_resilver_min_time_ms

About: #11166 , thank you very much amotin, I am not sure whether the problem I meet is caused by 4K random read (somehow related?) I am glad to continue to investigate and help.

justinianpopa · 2021-03-22T05:35:21Z

While i'm unsure if scrub/resilver IO is considered async or sync IO for the zfs scheduler, you may also try (as mentioned in the issue # above) tuning zfs_vdev_async_read_[min,max]_active / zfs_vdev_sync_read_[min,max]_active to lower values to reduce queue depth.

There may not be a tunable combination for exactly prioritising resilver reads however you might be able to tune scheduling of io requests to at least treat normal pool activity and resilvers in a balanced manner for your specific drives. For tuning, there also exists zfs_vdev_scrub_[min,max]_active that refers to "reads and scan IOs" but i think that's only for scrubs and not resilvers and possibly zfs_vdev_aggregation_limit might help you gain a few more IOPS assuming scub/resilver requests get treated and scheduled alongside normal ones in the aggregate limit.

In my experience with ZFS on linux perf. tuning, you could also try setting the read_ahead_kb kernel tunable to 0 (per each block device in the pool, in /sys/block/*/queue/read_ahead_kb) to gain a few more useful IOPS as data is rarely useful from read aheads.

It's worth a few combinations to try.

rincebrain · 2021-03-22T14:57:04Z

The man page for zfs-module-parameters explicitly cites zfs_vdev_async_[...] as affecting resilver performance.

It also explicitly suggests tuning zfs_vdev_scrub_max_active "will cause the scrub or resilver to complete more quickly,", so it should affect resilvers too.

wxiaoguang · 2021-03-23T11:59:19Z

Thanks rincebrain.

I will read the documents about "zfs_vdev_scrub_max_active" and try later.

It may help many users if ZFS document has a topic about resilver performance.

wxiaoguang added the Type: Feature Feature request or new feature label Mar 21, 2021

wxiaoguang closed this as completed Mar 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ZFS resilver can be very slow if there are other heavy disk IO requests, can the resilver priority be adjusted? #11777

ZFS resilver can be very slow if there are other heavy disk IO requests, can the resilver priority be adjusted? #11777

wxiaoguang commented Mar 21, 2021 •

edited

Loading

justinianpopa commented Mar 21, 2021 •

edited

Loading

amotin commented Mar 21, 2021

wxiaoguang commented Mar 22, 2021 •

edited

Loading

justinianpopa commented Mar 22, 2021 •

edited

Loading

rincebrain commented Mar 22, 2021

wxiaoguang commented Mar 23, 2021

ZFS resilver can be very slow if there are other heavy disk IO requests, can the resilver priority be adjusted? #11777

ZFS resilver can be very slow if there are other heavy disk IO requests, can the resilver priority be adjusted? #11777

Comments

wxiaoguang commented Mar 21, 2021 • edited Loading

Describe the feature would like to see added to OpenZFS

How will this feature improve OpenZFS?

justinianpopa commented Mar 21, 2021 • edited Loading

amotin commented Mar 21, 2021

wxiaoguang commented Mar 22, 2021 • edited Loading

justinianpopa commented Mar 22, 2021 • edited Loading

rincebrain commented Mar 22, 2021

wxiaoguang commented Mar 23, 2021

wxiaoguang commented Mar 21, 2021 •

edited

Loading

justinianpopa commented Mar 21, 2021 •

edited

Loading

wxiaoguang commented Mar 22, 2021 •

edited

Loading

justinianpopa commented Mar 22, 2021 •

edited

Loading