Skip to content

Conversation

@earthling-amzn
Copy link
Contributor

@earthling-amzn earthling-amzn commented Oct 23, 2025

Keep track of promotion failures. Report the number of failures and total number of bytes that could not be promoted. These changes were hoisted out of #27632.


Progress

  • Change must be properly reviewed (1 review required, with at least 1 Reviewer)
  • Change must not contain extraneous whitespace
  • Commit message must refer to an issue

Issue

  • JDK-8370520: GenShen: Track and report on promotion failures (Enhancement - P4)

Reviewers

Reviewing

Using git

Checkout this PR locally:
$ git fetch https://git.openjdk.org/jdk.git pull/27962/head:pull/27962
$ git checkout pull/27962

Update a local copy of the PR:
$ git checkout pull/27962
$ git pull https://git.openjdk.org/jdk.git pull/27962/head

Using Skara CLI tools

Checkout this PR locally:
$ git pr checkout 27962

View PR using the GUI difftool:
$ git pr show -t 27962

Using diff file

Download this PR as a diff file:
https://git.openjdk.org/jdk/pull/27962.diff

Using Webrev

Link to Webrev Comment

@bridgekeeper
Copy link

bridgekeeper bot commented Oct 23, 2025

👋 Welcome back wkemper! A progress list of the required criteria for merging this PR into master will be added to the body of your pull request. There are additional pull request commands available for use with this pull request.

@openjdk
Copy link

openjdk bot commented Oct 23, 2025

@earthling-amzn This change is no longer ready for integration - check the PR body for details.

@openjdk openjdk bot added hotspot-gc hotspot-gc-dev@openjdk.org shenandoah shenandoah-dev@openjdk.org labels Oct 23, 2025
@openjdk
Copy link

openjdk bot commented Oct 23, 2025

@earthling-amzn The following labels will be automatically applied to this pull request:

  • hotspot-gc
  • shenandoah

When this pull request is ready to be reviewed, an "RFR" email will be sent to the corresponding mailing lists. If you would like to change these labels, use the /label pull request command.

@openjdk openjdk bot added the rfr Pull request is ready for review label Oct 23, 2025
@mlbridge
Copy link

mlbridge bot commented Oct 23, 2025

Webrevs

Copy link
Member

@shipilev shipilev left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks fine, with a nit.

Comment on lines 128 to 129
size_t get_promotion_failed_count() const { return _promotion_failure_count; }
size_t get_promotion_failed_words() const { return _promotion_failure_words; }
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Shouldn't these be AtomicAccess::load(...)-s? I don't think you need memory ordering, but if you are doing the updates atomically somewhere, it stands to reason you want to match the loads with atomics as well.

@openjdk openjdk bot added the ready Pull request is ready to be integrated label Oct 24, 2025
@openjdk openjdk bot removed the ready Pull request is ready to be integrated label Oct 24, 2025

const size_t gc_id = heap->control_thread()->get_gc_id();

AtomicAccess::inc(&_promotion_failure_count);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just noticing that in the next code block, we acquire the heap->lock(). Could we just use that same heap lock to protect adjustments to _promotion_failure_count and _promotion_failure_words and then we would need to use Atomic access operations?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should probably be ok to read these variables without lock. We're only logging these results when evacuation is no longer happening. Right?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Now that we have this new log message, can we get rid of the "promotion failure messages" for individual objects?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We only take the heap lock there conditionally when we haven't yet "squelched" the log message. We could change this to a log_debug(gc, plab) level message. The message is still useful when trying to understand the history and context for a how a thread became unable to promote. I'm not completely convinced there aren't still cases where a thread should be able to promote but can't for some (unknown) reason.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

If we made this a log_debug(gc, plab) message, we could make the whole block of code conditional on the log level being enabled.

@openjdk openjdk bot added the ready Pull request is ready to be integrated label Oct 24, 2025
And only do the work for the message if the log level is enabled.
@openjdk openjdk bot removed the ready Pull request is ready to be integrated label Oct 24, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

hotspot-gc hotspot-gc-dev@openjdk.org rfr Pull request is ready for review shenandoah shenandoah-dev@openjdk.org

Development

Successfully merging this pull request may close these issues.

4 participants