Investigate why TimeBasedDirectoryCleaner counts too many deletes on MacOS #143

chrisrohr · 2020-04-27T21:15:58Z

No description provided.

sleberknight · 2020-08-05T16:16:59Z

See the testScheduleCleanup_WithScheduledExecutor_UsingMultipleConcurrentCleaners_IntegrationTest which is @EnabledOnOs(OS.LINUX)

sleberknight · 2021-01-30T21:43:45Z

To clarify, when run on macOS (tested on Catalina and Big Sur), we see more deletes than the total number of directories created.

I modified waitUntilReachExpectedDeleteCount as follows and each time I run the tests I get slightly different results.

    private void waitUntilReachExpectedDeleteCount(TimeBasedDirectoryCleaner cleaner1,
                                                   TimeBasedDirectoryCleaner cleaner2,
                                                   int expectedDeleteCount) {
        await().atMost(TEN_SECONDS).until(() -> {
            long aggregateDeleteCount = cleaner1.getDeleteCount() + cleaner2.getDeleteCount();
            if (aggregateDeleteCount > expectedDeleteCount) {
                LOG.warn("expectedDeleteCount: {} ; aggregateDeleteCount: {}", expectedDeleteCount, aggregateDeleteCount);
            }
            return aggregateDeleteCount >= expectedDeleteCount;
        });
    }

Here is output from the most recent time I ran this test:

For totalFileCount=500:

expectedDeleteCount: 500 ; aggregateDeleteCount: 511

For totalFileCount=2000:

expectedDeleteCount: 2000 ; aggregateDeleteCount: 2053

chrisrohr · 2021-04-21T01:03:29Z

This test is in TimeBasedDirectoryCleanerConfigTest. Putting this here since I kept looking in TimeBasedDirectoryCleanerTest and couldn't find it. The error I consistently get is on line waitUntilReachExpectedDeleteCount(cleaner1, cleaner2, 300); and it timeout at 10 seconds and states that the file delete errors are ~150.

chrisrohr · 2021-04-21T01:18:05Z

Ok, I have a theory as to why this isn't working. When I introduce a small delay in between the two scheduleCleanup calls everything works correctly. I think that there must be something on the MacOS file system that is allowing our delete call to return true when 2 threads delete at the same time for both deletes. This causes our numbers to miscount. I know this test is ensuring that multiple cleaners can run at the same time, but I'm not sure that the use case of them starting at the exact same time is something we need to work around. I will submit a PR for this to at least have the tests pass and remove the @Enable

sleberknight · 2021-04-21T01:20:38Z

Yes, pretty sure the code works, given that we've used it in production for years now. So changing the code isn't necessary, which is what I think you are saying, only the test?

chrisrohr · 2021-04-21T01:21:25Z

Correct, this is only a test change.

chrisrohr · 2021-04-21T01:22:37Z

The small delay in setting up the 2 cleaners can be as low as 200ms for the test using 500 files and 400ms for the test using 2000 files. That is enough time for them both to delete but not step on each other.

sleberknight · 2021-04-21T01:28:10Z

And only add this delay on macOS? e.g. delayIfMacOs() and using "Mac OS X".equals(System.getProperty("os.name")) (or just using one of the many, many, many constants in SystemUtils in commons-lang3, e.g. SystemUtils.IS_OS_MAC

…urrentCleaners_IntegrationTest on all platforms The concurrent cleaners need to be created with a small buffer in between. See #143 for details Fixes #143

…urrentCleaners_IntegrationTest on all platforms (#554) * Re-enable testScheduleCleanup_WithScheduledExecutor_UsingMultipleConcurrentCleaners_IntegrationTest on all platforms * The concurrent cleaners need to be created with a small buffer in between on macOS. See #143 for details Fixes #143

Change the concurrent test in TimeBasedDirectoryCleanerConfigTest so that it uses '>=' instead of '==' on macOS when waiting for number of deletions and when making assertions about the total delete count. This accommodates (though certainly doesn't explain) the behavior we see on macOS in which there are more successful deletions than there are files created. On Linux (CentOS as well as Ubuntu) we have never seen that behavior, so for those OSes we retain the strict equality check. Relates to #143 (which is already closed)

* Refactor TimeBasedDirectoryCleanerConfigTest for macOS vs Linux Change the concurrent test in TimeBasedDirectoryCleanerConfigTest so that it uses '>=' instead of '==' on macOS when waiting for number of deletions and when making assertions about the total delete count. This accommodates (though certainly doesn't explain) the behavior we see on macOS in which there are more successful deletions than there are files created. On Linux (CentOS as well as Ubuntu) we have never seen that behavior, so for those OSes we retain the strict equality check. Relates to #143 (which is already closed)

sleberknight added the investigation Something that needs to be investigated before implementation can proceed label Apr 27, 2020

chrisrohr mentioned this issue Apr 21, 2021

Re-enable testScheduleCleanup_WithScheduledExecutor_UsingMultipleConcurrentCleaners_IntegrationTest on all platforms #554

Merged

sleberknight assigned chrisrohr Apr 21, 2021

sleberknight closed this as completed in #554 Apr 22, 2021

sleberknight mentioned this issue May 4, 2021

Refactor TimeBasedDirectoryCleanerConfigTest for macOS vs Linux #556

Merged

sleberknight linked a pull request May 4, 2021 that will close this issue

Refactor TimeBasedDirectoryCleanerConfigTest for macOS vs Linux #556

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Investigate why TimeBasedDirectoryCleaner counts too many deletes on MacOS #143

Investigate why TimeBasedDirectoryCleaner counts too many deletes on MacOS #143

chrisrohr commented Apr 27, 2020

sleberknight commented Aug 5, 2020

sleberknight commented Jan 30, 2021

chrisrohr commented Apr 21, 2021

chrisrohr commented Apr 21, 2021

sleberknight commented Apr 21, 2021

chrisrohr commented Apr 21, 2021

chrisrohr commented Apr 21, 2021

sleberknight commented Apr 21, 2021

Investigate why TimeBasedDirectoryCleaner counts too many deletes on MacOS #143

Investigate why TimeBasedDirectoryCleaner counts too many deletes on MacOS #143

Comments

chrisrohr commented Apr 27, 2020

sleberknight commented Aug 5, 2020

sleberknight commented Jan 30, 2021

chrisrohr commented Apr 21, 2021

chrisrohr commented Apr 21, 2021

sleberknight commented Apr 21, 2021

chrisrohr commented Apr 21, 2021

chrisrohr commented Apr 21, 2021

sleberknight commented Apr 21, 2021