crl-release-20.1: vfs: Add vfs.Prefetch, use it for reading ahead data blocks #733

itsbilal · 2020-06-10T15:23:49Z

For sequential-like IO workload where we read
data blocks one after the other in quick succession,
signalling the OS to asynchronously bring them to
cache in advance can deliver significant savings in IOPS
dispatched. In IOPS-bound workloads such as backup on
an EBS disk, this delivers a 3x speedup. Presumably
aggregate queries and compactions will be faster
as well, though this hasn't been benchmarked in
practice yet.

This change maintains a counter for the number of data
block reads performed in a singleLevelIterator, and
when that count exceeds 2, a readahead system
call is made on Linux. RocksDB has almost
the exact same behaviour, including the same
min/max readahead sizes and read count thresholds.

Will address cockroachdb/cockroach#49710
when it lands in cockraoch.

For sequential-like IO workload where we read data blocks one after the other in quick succession, signalling the OS to asynchronously bring them to cache in advance can deliver significant savings in IOPS dispatched. In IOPS-bound workloads such as backup on an EBS disk, this delivers a 3x speedup. Presumably aggregate queries and compactions will be faster as well, though this hasn't been benchmarked in practice yet. This change maintains a counter for the number of data block reads performed in a singleLevelIterator, and when that count exceeds 2, a readahead system call is made on Linux. RocksDB has almost the exact same behaviour, including the same min/max readahead sizes and read count thresholds. Will address cockroachdb/cockroach#49710 when it lands in cockraoch.

petermattis · 2020-06-10T15:23:55Z

This change is

petermattis

Nit: for these manual backports, it is still nice for the PR title to be prefixed with release-20.1: as it reduces confusion.

LGTM

itsbilal · 2020-06-10T19:02:00Z

Ah right. I usually do that, but I forgot this time and ended up confusing myself as well. Thanks for the reminder!

itsbilal requested review from jbowens and petermattis June 10, 2020 15:23

itsbilal self-assigned this Jun 10, 2020

petermattis approved these changes Jun 10, 2020

View reviewed changes

itsbilal changed the title ~~vfs: Add vfs.Prefetch, use it for reading ahead data blocks~~ crl-release-20.1: vfs: Add vfs.Prefetch, use it for reading ahead data blocks Jun 10, 2020

itsbilal merged commit 7928b15 into cockroachdb:crl-release-20.1 Jun 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

crl-release-20.1: vfs: Add vfs.Prefetch, use it for reading ahead data blocks #733

crl-release-20.1: vfs: Add vfs.Prefetch, use it for reading ahead data blocks #733

itsbilal commented Jun 10, 2020

petermattis commented Jun 10, 2020

petermattis left a comment

itsbilal commented Jun 10, 2020

crl-release-20.1: vfs: Add vfs.Prefetch, use it for reading ahead data blocks #733

crl-release-20.1: vfs: Add vfs.Prefetch, use it for reading ahead data blocks #733

Conversation

itsbilal commented Jun 10, 2020

petermattis commented Jun 10, 2020

petermattis left a comment

Choose a reason for hiding this comment

itsbilal commented Jun 10, 2020