[libc++][lit] Atomically update the persistent cache #66538

arichardson · 2023-09-15T19:24:47Z

When running multiple shards in parallel, one shard might write to the cache while another one is reading this cache. Instead of updating the file in place, write to a temporary file and swap the cache file using os.replace(). This is an atomic operation and means shards will either see the old state or the new one.

llvmbot · 2023-09-15T19:25:48Z

@llvm/pr-subscribers-libcxx

Changes

When running multiple shards in parallel, one shard might write to the cache while another one is reading this cache. Instead of updating the file in place, write to a temporary file and swap the cache file using os.replace(). This is an atomic operation and means shards will either see the old state or the new one.

Full diff: https://github.com/llvm/llvm-project/pull/66538.diff

1 Files Affected:

(modified) libcxx/utils/libcxx/test/dsl.py (+5-1)

diff --git a/libcxx/utils/libcxx/test/dsl.py b/libcxx/utils/libcxx/test/dsl.py
index 847cebf5962f6aa..7d4df6b01eabdae 100644
--- a/libcxx/utils/libcxx/test/dsl.py
+++ b/libcxx/utils/libcxx/test/dsl.py
@@ -69,8 +69,12 @@ def f(config, *args, **kwargs):
             if cacheKey not in cache:
                 cache[cacheKey] = function(config, *args, **kwargs)
                 # Update the persistent cache so it knows about the new key
-                with open(persistentCache, "wb") as cacheFile:
+                # We write to a temporary file and rename the result to ensure
+                # that the cahe is not corrupted when running the test suite
+                # with multiple shards.
+                with open(persistentCache + ".tmp", "wb") as cacheFile:
                     pickle.dump(cache, cacheFile)
+                os.replace(persistentCache + ".tmp", persistentCache)
             return cache[cacheKey]
 
         return f

libcxx/utils/libcxx/test/dsl.py

ldionne · 2023-09-15T20:07:31Z

libcxx/utils/libcxx/test/dsl.py

@@ -69,8 +69,12 @@ def f(config, *args, **kwargs):
            if cacheKey not in cache:
                cache[cacheKey] = function(config, *args, **kwargs)
                # Update the persistent cache so it knows about the new key
-                with open(persistentCache, "wb") as cacheFile:
+                # We write to a temporary file and rename the result to ensure


Don't we have the same problem with the creation of the .tmp file now? Don't we need to generate a unique file name instead?

Hmm that's a good point, in my limited testing this was enough to avoid the race, but really it should be a per-shard suffix. Will update to use the PID.

libcxx/utils/libcxx/test/dsl.py

When running multiple shards in parallel, one shard might write to the cache while another one is reading this cache. Instead of updating the file in place, write to a temporary file and swap the cache file using os.replace(). This is an atomic operation and means shards will either see the old state or the new one.

libcxx/utils/libcxx/test/dsl.py

When running multiple shards in parallel, one shard might write to the cache while another one is reading this cache. Instead of updating the file in place, write to a temporary file and swap the cache file using os.replace(). This is an atomic operation and means shards will either see the old state or the new one.

arichardson requested a review from ldionne September 15, 2023 19:24

llvmbot added the libc++ libc++ C++ Standard Library. Not GNU libstdc++. Not libc++abi. label Sep 15, 2023

arichardson commented Sep 15, 2023

View reviewed changes

libcxx/utils/libcxx/test/dsl.py Outdated Show resolved Hide resolved

arichardson force-pushed the libcxx-lit-cache-atomic branch from d288487 to f3fc44c Compare September 15, 2023 19:27

ldionne self-assigned this Sep 15, 2023

ldionne requested changes Sep 15, 2023

View reviewed changes

arichardson force-pushed the libcxx-lit-cache-atomic branch from f3fc44c to 5b36d6c Compare September 15, 2023 20:35

arichardson requested a review from a team as a code owner September 15, 2023 20:35

ldionne requested changes Sep 18, 2023

View reviewed changes

libcxx/utils/libcxx/test/dsl.py Show resolved Hide resolved

ldionne approved these changes Sep 18, 2023

View reviewed changes

ldionne merged commit 14882d6 into llvm:main Sep 18, 2023

arichardson deleted the libcxx-lit-cache-atomic branch September 18, 2023 16:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[libc++][lit] Atomically update the persistent cache #66538

[libc++][lit] Atomically update the persistent cache #66538

Uh oh!

arichardson commented Sep 15, 2023

Uh oh!

llvmbot commented Sep 15, 2023

Uh oh!

Uh oh!

ldionne Sep 15, 2023

Uh oh!

arichardson Sep 15, 2023

Uh oh!

Uh oh!

Uh oh!

Uh oh!

[libc++][lit] Atomically update the persistent cache #66538

[libc++][lit] Atomically update the persistent cache #66538

Uh oh!

Conversation

arichardson commented Sep 15, 2023

Uh oh!

llvmbot commented Sep 15, 2023

Uh oh!

Uh oh!

ldionne Sep 15, 2023

Choose a reason for hiding this comment

Uh oh!

arichardson Sep 15, 2023

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!