Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LG-293: Add ability for requestTiming hungRequest detection to not collect javacores #13694

Closed
28 tasks done
donbourne opened this issue Aug 27, 2020 · 13 comments
Closed
28 tasks done
Assignees
Labels
Design Approved Epic Used to track Feature Epics that are following the UFO process focalApproved:accessibility Focal Approval granted for Accessibility for the feature focalApproved:demo Approval that a Demo has been scheduled focalApproved:fat Focal Approval granted for FAT for the feature focalApproved:globalization Focal Approval granted for Globalization for the feature focalApproved:id Focal Approval granted for ID for the feature focalApproved:performance Focal Approval granted for Performance for the feature focalApproved:serviceability Focal Approval granted for Serviceability for the feature focalApproved:ste Focal Approval granted for STE for the feature focalApproved:svt Focal Approval granted for SVT for the feature ID Required release:21003-beta release:21004 target:21003-beta target:21004 team:Lumberjack

Comments

@donbourne
Copy link
Member

donbourne commented Aug 27, 2020

Provide a means to disable the collection of javacores during hungRequest detection in requestTiming-1.0.

When ready, add links to the Upcoming Feature Overview document and Feature Test Summary issue:


List of Steps to complete or get approvals / sign-offs for Onboarding to the Liberty release (GM date)

Instructions:

  • Do the actions below and mark them complete in the checklist when they are done.
  • Make sure all feature readiness approvers put the appropriate tag on the epic to indicate their approval.

Design

Before Development Starts or 8 weeks before Onboarding

  • POC Design / UFO Review Scheduled (David Chang) or N/A.
  • POC Design / UFO Reviewed (Feature Owner) or N/A.
  • Complete any follow-ons from the POC Review.
  • Design / UFO Approval (Alasdair Nottingham) or N/A.
  • No Design / No UFO Approval (Arthur De Magalhaes - cloud / Alasdair Nottingham - server) or N/A.
  • SVT Requirements identified. (Epic owner / Feature owner with SVT focal point)
  • ID Requirements identified. (Epic owner / Feature owner with ID focal point)
  • Create a child task of this epic entitled "FAT Approval Test Summary". Add the link in above.

Legal

3 weeks before Onboarding

  • Identify all open source libraries that are changing or are new. Work with Legal Release Services (Cass Tucker or Release PM) to get open source cleared and approved. Or N/A. (Epic Owner). New or changed open source impacts license and Certificate of Originality.

Translation

3 weeks before Onboarding

  • All new or changed PII messages are checked into the integration branch, before the last translation shipment out. (Epic Owner)

Feature Complete

2 weeks before Onboarding

  • Implementation complete. (Epic owner / Feature owner)
  • All function tests complete. Ready for FAT Approval. (Epic owner / Feature owner)
  • Review all known issues for Stop Ship. (Epic owner / Feature owner / PM)

Focal Point Approvals

2 to 1 week before Onboarding

You MUST have the Design Approved or No Design Approved label before requesting focal point approvals.

All features (both "Design Approved" and "No Design Approved")

  • FAT - (Kevin Smith). SOE FATS are running successfully or N/A . Approver adds label focalApproved:fat to the Epic in Github.
  • Demo - (Tom Evans or Chuck Bridgham). Demo is scheduled for an upcoming EOI. Approver adds label focalApproved:demo to the Epic in Github.
  • Globalization (Sam Wong - Liberty / Simy Cheeran - tWAS). Translation is complete or N/A. TVT - complete or N/A. Approver adds label focalApproved:globalization to the Epic in Github.

"Design Approved" features

  • Accessibility - (G Scott Johnston). Accessibility testing is complete or N/A. Approver adds label focalApproved:accessibility to the Epic in Github.
  • ID - (Kareen Deen). Documentation work is complete or N/A . Approver adds label focalApproved:id to the Epic in Github.
  • Performance - (Jared Anderson). Performance testing is complete with no high severity defects or N/A . Approver adds label focalApproved:performance to the Epic in Github.
  • Serviceability - (Don Bourne). Serviceability has been addressed.
  • STE - (Swati Kasundra). STE chart deck is complete or N/A . Approver adds label focalApproved:ste to the Epic in Github.
  • SVT - (Brian Hanczaryk- APS). SVT is complete or N/A . Approver adds label focalApproved:svt to the Epic in Github.

Ready for GA

1 week before Onboarding

  • No Stop Ship issues for the feature. (Epic owner / Feature owner / Release PM)
  • Ship Readiness Review and Release Notes completed (Epic owner / Feature owner / Release PM)
  • Github Epic and Epic's issues are closed / complete. All PRs are committed to the master branch. (Epic owner / Feature owner / Backlog Subtribe PM)

Other deliverbles

  • OL Guides - (Yee-Kang Chang). Assessment for OL Guides is complete or N/A.
  • WDT - (Leonard Theivendra). WDT work complete or N/A.
  • Blog - (Laura Cowen) Blog article writeup (Epic owner / Feature owner / Laura Cowen)
@donbourne donbourne added Epic Used to track Feature Epics that are following the UFO process team:Lumberjack labels Aug 27, 2020
@donbourne donbourne added the In Progress Items that are in active development. label Oct 4, 2020
@cthigh
Copy link

cthigh commented Dec 7, 2020

UFO review - Monday Dec 7, 2020.

Presenter: Prashanth Gunapalasingam
Moderator: Cindy High

Notes:

  1. Slide 2 - What do we do on non-OpenJ9? Will this work on any Java? OpenJ9 has Java cores. Hotspot has thread dumps.

    Suggestion: Rename enableJavacore to enableThreadDumps to be more generic for both OpenJ9 and Hotspot.

  2. Slide 2 - Add default behavior to the end user overview. Enabled by default. (default is described on slide 5)

  3. Slide 6 - All messages on this slide include “Java dump”. Should the attribute be “Java dump”. Can the message text be changed to say “Java thread dump” or something else? The message is from kernel (framework manager) and there are questions whether it should change but it would be nice to have consistency between message and configuration parameter. Take off-line.

  4. Page 7 - By default, this feature will not help the first problem of filling the file system. Jared noted that a concept was created to manage file system space using a rolling list of JavaCores (https://github.com/OpenLiberty/open-liberty/pull/3363/files ) that might help manage the first issue.

  5. Slide 18 - Confusion: appName, ServletName are bold but not new. Perhaps clarify that bold indicates changed use or behavior.

  6. Slide 19 - Comment - JDBCTiming - “enableJavacore=“true” is redundant since true is the default

  7. Slide 30 - The new message says “A series of java cores will not be created for the hung request…..”. Update the message using the final terminology for “thread dump” to have generic context for OpenJ9 and Hotspot. Resolve with issue Update local JDK wiring #3.

@pgunapal
Copy link
Member

pgunapal commented Dec 9, 2020

UFO has been updated with comments from the UFO socialization on Dec 7th and is ready for Design Review Approval.

@pgunapal pgunapal added target:21003-beta target:beta The Epic or Issue is targetted for the next beta labels Jan 27, 2021
@pgunapal pgunapal added target:ga The Epic is ready for focal approvals, after which it can GA. and removed target:beta The Epic or Issue is targetted for the next beta labels Mar 1, 2021
@pgunapal
Copy link
Member

pgunapal commented Mar 1, 2021

@samwatibm The new messages have been translated and will be checked in this week. May i get the globalization approval for this feature?

@pgunapal
Copy link
Member

pgunapal commented Mar 1, 2021

@steven1046 There are no UI-related changes/updates for this feature, as it does not have any UI. May I get the Accessibility approval for this feature?

@steven1046 steven1046 added the focalApproved:accessibility Focal Approval granted for Accessibility for the feature label Mar 1, 2021
@pgunapal
Copy link
Member

pgunapal commented Mar 1, 2021

@jhanders34 Performance testing is not needed for this feature, as we are providing an option to disable the creation of thread dumps for hung requests in the requestTiming-1.0 feature and will not regress performance. May I get the Performance approval for this feature?

@pgunapal
Copy link
Member

@samwatibm The translated messages are in master. May I have the Globalization approval for this feature?

@samwatibm samwatibm added the focalApproved:globalization Focal Approval granted for Globalization for the feature label Mar 16, 2021
@pgunapal
Copy link
Member

@hanczaryk It was decided in the UFO review that we don't need SVT for this feature, as the capability is easily tested and covered from the existing FAT tests. Would you mind reviewing the SVT approval for this feature?

@hanczaryk hanczaryk added the focalApproved:svt Focal Approval granted for SVT for the feature label Mar 17, 2021
@pgunapal
Copy link
Member

pgunapal commented Mar 18, 2021

@donbourne Serviceability Approval Comment :

UFO -- does the UFO identify the most likely problems customers will see and identify how the feature will enable them to diagnose and solve those problems without resorting to raising a PMR? Have these issues been addressed in the implementation?
- Yes, if the enableThreadDumps="false" is configured and a hung request is detected. a debug trace message will be logged, indicating that a series of java thread dumps will not be generated for that particular hung request. Also, if users incorrectly specify the enableThreadDumps attribute in the server.xml, a warning message will also be logged, and the default value will be used. Also, we will thoroughly document the behaviour of how the sub-element configurations will be override the root element configurations in the requestTiming-1.0.

Test and Demo -- As part of the serviceability process we're asking feature teams to test and analyze common problem paths for serviceability and demo those problem paths to someone not involved in the development of the feature (eg. L2, test team, or another development team).
a) What problem paths were tested and demonstrated?
   - If users specify the attribute name or value incorrectly, a warning message was logged and the default value was used.
   - If the attribute is not specified in server.xml, it will use the default value, and create thread dumps.
   - If both the root and sub elements have the same attribute, but different values, the sub-element value will be used.
b) Who did you demo to?
   - Observability squad
c) Do the people you demo'd to agree that the serviceability of the demonstrated problem scenarios is sufficient to avoid PMRs for any problems customers are likely to encounter, or that L2 should be able to quickly address those problems without need to engage L3?
   - Yes, they did. Only concern was that they found that the sub-element configuration overriding the root element was irregular, however, the other requestTiming configuration attributes behave similarly, hence, we decided to thoroughly document this.

SVT -- SVT team is often the first team to try new features and often encounters problems setting up and using them. Note that we're not expecting SVT to do full serviceability testing -- just to sign-off on the serviceability of the problem paths they encountered.
a) Who conducted SVT tests for this feature?
   - It was decided in the UFO review that we don't need SVT for this feature, as the capability is easily tested and covered from the existing FAT tests.
b) Do they agree that the serviceability of the problems they encountered is sufficient to avoid PMRs, or that L2 should be able to quickly address those problems without need to engage L3?
 - N/A

Which L2 / L3 queues will handle PMRs for this feature? Ensure they are present in the contact reference file and in the queue contact summary, and that the respective L2/L3 teams know they are supporting it. Ask Don Bourne if you need links or more info.
- WAS L3: Logging

@donbourne donbourne added the focalApproved:serviceability Focal Approval granted for Serviceability for the feature label Mar 18, 2021
@skasund
Copy link

skasund commented Mar 23, 2021

L2 has requested a live STE presentation for this feature. The STE templates can be found at the links below. You can use either one to create the education.

Slide Template: https://ibm.box.com/s/1an42g7zdgmaj84w7dft0indqfgi8ffm

Github Template: https://pages.github.ibm.com/WASL3/site/STE/about

Please upload the completed slides to the same 'STE Archive' BOX folder or provide me the Github link, and I will provide the STE approval. The live presentation can be scheduled with the L2 team at a later date. Thanks!

@chirp1
Copy link
Contributor

chirp1 commented Mar 24, 2021

Info for the the ID issue at OpenLiberty/docs#3764 contains the information needed to write the documentations. Approving.

@chirp1 chirp1 added the focalApproved:id Focal Approval granted for ID for the feature label Mar 24, 2021
@pgunapal
Copy link
Member

@skasund The completed STE slides are uploaded to the STE Archive BOX folder.

@tevans78 tevans78 added the focalApproved:demo Approval that a Demo has been scheduled label Mar 25, 2021
@skasund skasund added the focalApproved:ste Focal Approval granted for STE for the feature label Mar 25, 2021
@skasund
Copy link

skasund commented Mar 25, 2021

I've provided the STE approval. L2 will work with you to schedule a live session.

@samwatibm samwatibm added release:21004 and removed target:ga The Epic is ready for focal approvals, after which it can GA. labels Apr 8, 2021
@pgunapal
Copy link
Member

Closing as this feature has been released and GA'ed in 21.0.0.4 (https://openliberty.io/downloads/#runtime_releases)

@NottyCode NottyCode removed the In Progress Items that are in active development. label Jul 20, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Design Approved Epic Used to track Feature Epics that are following the UFO process focalApproved:accessibility Focal Approval granted for Accessibility for the feature focalApproved:demo Approval that a Demo has been scheduled focalApproved:fat Focal Approval granted for FAT for the feature focalApproved:globalization Focal Approval granted for Globalization for the feature focalApproved:id Focal Approval granted for ID for the feature focalApproved:performance Focal Approval granted for Performance for the feature focalApproved:serviceability Focal Approval granted for Serviceability for the feature focalApproved:ste Focal Approval granted for STE for the feature focalApproved:svt Focal Approval granted for SVT for the feature ID Required release:21003-beta release:21004 target:21003-beta target:21004 team:Lumberjack
Projects
None yet
Development

No branches or pull requests