HDDS-9039. Added a test to verify that no compaction log entry is added to compactionLogTable and DAG when tarball creation is in progress. #6171

hemantk-12 · 2024-02-05T17:28:46Z

What changes were proposed in this pull request?

There are two major changes in this PR.

Added a new test to validate that no new compaction log entry is added to compactionLogTable and DAG while tarball creation is in progress.
Made waitForTarballCreation synchronized function so that wait and notify threads are on a same object's monitor.

What is the link to the Apache JIRA

HDDS-9039

How was this patch tested?

Ran newly added test locally and fork branch.

…ctionLogTable and DAG.

swamirishi

The testcase doesn't really test the race condition explained in the comments.

swamirishi · 2024-02-06T19:20:49Z

...ksdb-checkpoint-differ/src/main/java/org/apache/ozone/rocksdiff/RocksDBCheckpointDiffer.java

@@ -587,7 +587,7 @@ void addToCompactionLogTable(CompactionLogEntry compactionLogEntry) {
   * Check if there is any in_progress tarball creation request and wait till
   * all tarball creation finish, and it gets notified.
   */
-  private void waitForTarballCreation() {
+  private synchronized void waitForTarballCreation() {


How would this change help in anyway? I don't think rocksdb will do multiple compaction in parallel. onCompactionCompleted method will commit the compaction.

I believe we would need a lock on the tarballRequestCount when incrementing and decrementing request and add the compaction log entry within the lock.

waitForTarballCreation wouldn't really ensure the count would be zero after the method call. There could be a race condition here.

tarballRequestCount is an AtomicInteger so the incrementing and decrementing it would be atomic.

waitForTarballCreation is synchronized because wait/notify are supposed to be allowed on the same object's lock otherwise it throws IllegalMonitorStateException. Doc: wait/notifyAll.

More in discussion on jira: HDDS-9039 why we are adding the test this way.

What I didn't understand is getCheckpoint() method is not taking a synchronized lock. So the question if waitForTarballCreation() passes.

ozone/hadoop-ozone/ozone-manager/src/main/java/org/apache/hadoop/ozone/om/OMDBCheckpointServlet.java

Line 248 in 2c0580d

differ.incrementTarballRequestCount();

can still run. Thus isn't there a race condition here? So while compaction is running we still end up starting a tarballing for the checkpoint.

I think you are right if we are appending compaction log entries to a txt file but we don't use that anymore and add entries to compactionLog columnFamily.

I gave it more thought and I think this locking is not needed anymore because we use RocksDB column family for compaction now. So compaction entry will be either present in the table or not in the snapshot of the ActiveFS depending on the order of compaction entry append and checkpoint creation. Hence we can simply remove this locking code and just rely on RocksDB.

Original discussion to add lock: #4680 (comment)

yeah i agree we don't need the locks since we are writing it into a rocksdb table. This is fine as long as we are using rocksdb checkpoints to take a checkpoint of the rocksdb.

hemantk-12 · 2024-04-17T22:32:26Z

Closing it in favor of #6552

Added a test to verify that no compaction log entry is added to compa…

7491270

…ctionLogTable and DAG.

hemantk-12 added the snapshot https://issues.apache.org/jira/browse/HDDS-6517 label Feb 5, 2024

hemantk-12 requested review from GeorgeJahad, aswinshakil and swamirishi February 5, 2024 17:28

swamirishi requested changes Feb 6, 2024

View reviewed changes

hemantk-12 closed this Apr 17, 2024

hemantk-12 deleted the HDDS-9039 branch October 28, 2024 18:40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

HDDS-9039. Added a test to verify that no compaction log entry is added to compactionLogTable and DAG when tarball creation is in progress. #6171

HDDS-9039. Added a test to verify that no compaction log entry is added to compactionLogTable and DAG when tarball creation is in progress. #6171

hemantk-12 commented Feb 5, 2024

swamirishi left a comment

swamirishi Feb 6, 2024

swamirishi Feb 6, 2024

swamirishi Feb 6, 2024

hemantk-12 Feb 6, 2024

swamirishi Apr 17, 2024

hemantk-12 Apr 17, 2024 •

edited

Loading

swamirishi Apr 17, 2024

hemantk-12 commented Apr 17, 2024

HDDS-9039. Added a test to verify that no compaction log entry is added to compactionLogTable and DAG when tarball creation is in progress. #6171

HDDS-9039. Added a test to verify that no compaction log entry is added to compactionLogTable and DAG when tarball creation is in progress. #6171

Conversation

hemantk-12 commented Feb 5, 2024

What changes were proposed in this pull request?

What is the link to the Apache JIRA

How was this patch tested?

swamirishi left a comment

Choose a reason for hiding this comment

swamirishi Feb 6, 2024

Choose a reason for hiding this comment

swamirishi Feb 6, 2024

Choose a reason for hiding this comment

swamirishi Feb 6, 2024

Choose a reason for hiding this comment

hemantk-12 Feb 6, 2024

Choose a reason for hiding this comment

swamirishi Apr 17, 2024

Choose a reason for hiding this comment

hemantk-12 Apr 17, 2024 • edited Loading

Choose a reason for hiding this comment

swamirishi Apr 17, 2024

Choose a reason for hiding this comment

hemantk-12 commented Apr 17, 2024

hemantk-12 Apr 17, 2024 •

edited

Loading