From 495d9fa5d6b60bc627e984efbfbb6ff7d995e0cf Mon Sep 17 00:00:00 2001 From: Nicholas Kumia <85196563+nickumia-reisys@users.noreply.github.com> Date: Mon, 20 Mar 2023 17:09:27 -0400 Subject: [PATCH 1/2] docs: update o&m checklists --- .github/ISSUE_TEMPLATE/o-and-m.md | 48 +++++++++++++++++++------------ 1 file changed, 30 insertions(+), 18 deletions(-) diff --git a/.github/ISSUE_TEMPLATE/o-and-m.md b/.github/ISSUE_TEMPLATE/o-and-m.md index 4a7257e8e..4c33a5a5d 100644 --- a/.github/ISSUE_TEMPLATE/o-and-m.md +++ b/.github/ISSUE_TEMPLATE/o-and-m.md @@ -14,21 +14,33 @@ As part of day-to-day operation of Data.gov, there are many [Operation and Maint ## Acceptance criteria -You are responsible for all [O&M responsibilities](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities) this week. We've highlighted a few so they're not forgotten. - -| Task | Friday | Monday | Tuesday | Wednesday | Thursday | Friday | Monday | Tuesday | Wednesday | Thursday | Weekly/Monthly | -|---------------------------|-----------------|-----------------|-----------------|-----------------|-----------------|-----------------|-----------------|-----------------|-----------------|-----------------|-----------------| -| Check Deployments | | | | | | | | | | | ➖ | -| Check Restarts | | | | | | | | | | | ➖ | -| Check [Snyk Scans](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#automated-dependency-updates-ad-hoc-github-prs) | | | | | | | | | | | ➖ | -| Check Catalog Auto Tasks | | | | | | | | | | | ➖ | -| Check [Harvesting Emails](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#harvest-job-report-daily-email-report) | | | | | | | | | | | ➖ | -| [New Relic Alerts](https://alerts.newrelic.com/accounts/1601367/incidents) Triaged | | | | | | | | | | | ➖ | -| Triage DMARC Report from Google | | | | | | | | | | | ➖ | -| Check [Catalog Solr](https://github.com/GSA/data.gov/wiki/Operation-and-Maintenance-Responsibilities#solr) | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | | -| [Audit Log](https://docs.google.com/spreadsheets/d/1z6lqmyNxC7s5MiTt9f6vT41IS2DLLJl4HwEqXvvft40/edit) [*AU-6*](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#au-6-log-auditing) | | ➖ | ➖ | ➖ | ➖ | | ➖ | ➖ | ➖ | ➖ | ➖ | -| [Catalog Dupe Check](https://github.com/GSA/data.gov/wiki/Operation-and-Maintenance-Responsibilities#duplicate-check) | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | | -| [Invicti Scan](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#netsparker-compliance-scan-report-from-isso) | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | ➖ | | - -- [ ] Weekly [resources.data.gov link scan](https://app.circleci.com/pipelines/github/GSA/resources.data.gov?branch=main) -- [ ] Finishing the shift: Log the [number of alerts](https://docs.google.com/spreadsheets/d/1u1hSUAQW6FWzphog122stfB6MB9Wiq0NROT3PeicRoM/edit#gid=939071144) +You are responsible for all [O&M responsibilities](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities) this week. We've highlighted a few so they're not forgotten. You can copy each checklist into your daily report. + +``` +## Daily Checklist +Check [Production States](https://github.com/GSA/data.gov/wiki/Automated-O&M-Tasks) +- [ ] Catalog +- [ ] Inventory + +Check [Snyk Scans](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#automated-dependency-updates-ad-hoc-github-prs) +- [ ] Catalog +- [ ] Inventory + +Check [Harvesting Emails](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#harvest-job-report-daily-email-report) +- [ ] Catalog + +Other +- [ ] [New Relic Alerts](https://alerts.newrelic.com/accounts/1601367/incidents) Triaged +- [ ] Triage DMARC Report from Google +``` + +``` +## Weekly Checklist +- [ ] [Audit Log](https://docs.google.com/spreadsheets/d/1z6lqmyNxC7s5MiTt9f6vT41IS2DLLJl4HwEqXvvft40/edit) (more info on [AU-3 and AU-6 Log auditing](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#au-3-and-au-6-log-auditing)) +- [ ] Check [Catalog Solr](https://github.com/GSA/data.gov/wiki/Operation-and-Maintenance-Responsibilities#solr) +- [ ] [Catalog Dupe Check](https://github.com/GSA/data.gov/wiki/Operation-and-Maintenance-Responsibilities#duplicate-check) +- [ ] [Invicti Scan](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#netsparker-compliance-scan-report-from-isso) +- [ ] [resources.data.gov link scan](https://app.circleci.com/pipelines/github/GSA/resources.data.gov?branch=main) +``` + +Finishing the shift: Log the [number of alerts](https://docs.google.com/spreadsheets/d/1u1hSUAQW6FWzphog122stfB6MB9Wiq0NROT3PeicRoM/edit#gid=939071144) From 2c3eaf68d9130ced51ea58b83066e2f56e129857 Mon Sep 17 00:00:00 2001 From: Tyler Burton Date: Thu, 30 Mar 2023 10:04:05 -0500 Subject: [PATCH 2/2] add updated O&M checklist --- .github/ISSUE_TEMPLATE/o-and-m.md | 31 +++++++++++++++++-------------- 1 file changed, 17 insertions(+), 14 deletions(-) diff --git a/.github/ISSUE_TEMPLATE/o-and-m.md b/.github/ISSUE_TEMPLATE/o-and-m.md index 4c33a5a5d..96c3b6226 100644 --- a/.github/ISSUE_TEMPLATE/o-and-m.md +++ b/.github/ISSUE_TEMPLATE/o-and-m.md @@ -7,40 +7,43 @@ assignees: '' --- As part of day-to-day operation of Data.gov, there are many [Operation and Maintenance (O&M) responsibilities](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities). Instead of having the entire team watching notifications and risking some notifications slipping through the cracks, we have created an [O&M Triage role](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#om-triage-rotation). One person on the team is assigned the Triage role which rotates each sprint. _This is not meant to be a 24/7 responsibility, only East Coast business hours. If you are unavailable, please note when you will be unavailable in Slack and ask for someone to take on the role for that time._ - ## Miscs -- Watch for user email requests -- Watch in [#datagov-alerts](https://gsa-tts.slack.com/archives/C4RGAM1Q8) and [Vulnerable dependency notifications (daily email reports)](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#vulnerable-dependency-notifications-daily-email-reports) for critical alerts. - +- Watch for user email requests +- Watch in [#datagov-alerts](https://gsa-tts.slack.com/archives/C4RGAM1Q8) and [Vulnerable dependency notifications (daily email reports)](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#vulnerable-dependency-notifications-daily-email-reports) for critical alerts. +- Monitor and improve [Data.gov O&M Dashboard](https://onenr.io/0LREMrzdrRa) ## Acceptance criteria You are responsible for all [O&M responsibilities](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities) this week. We've highlighted a few so they're not forgotten. You can copy each checklist into your daily report. -``` ## Daily Checklist -Check [Production States](https://github.com/GSA/data.gov/wiki/Automated-O&M-Tasks) -- [ ] Catalog +Check Deployments +- [ ] Catalog +- [ ] Inventory + +Check Restarts +- [ ] Catalog - [ ] Inventory Check [Snyk Scans](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#automated-dependency-updates-ad-hoc-github-prs) -- [ ] Catalog +- [ ] Catalog - [ ] Inventory +Check Catalog Auto Tasks +> **Note** +> You will need to update the chart values manually. Click the Action link in each issue and grab the values from `monitor task output` and `check runtime`. + +- [ ] [DB-Solr Sync](https://github.com/GSA/catalog.data.gov/issues/848) +- [ ] [Tracking Update](https://github.com/GSA/catalog.data.gov/issues/847) + Check [Harvesting Emails](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#harvest-job-report-daily-email-report) - [ ] Catalog Other - [ ] [New Relic Alerts](https://alerts.newrelic.com/accounts/1601367/incidents) Triaged - [ ] Triage DMARC Report from Google -``` -``` ## Weekly Checklist - [ ] [Audit Log](https://docs.google.com/spreadsheets/d/1z6lqmyNxC7s5MiTt9f6vT41IS2DLLJl4HwEqXvvft40/edit) (more info on [AU-3 and AU-6 Log auditing](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#au-3-and-au-6-log-auditing)) - [ ] Check [Catalog Solr](https://github.com/GSA/data.gov/wiki/Operation-and-Maintenance-Responsibilities#solr) - [ ] [Catalog Dupe Check](https://github.com/GSA/data.gov/wiki/Operation-and-Maintenance-Responsibilities#duplicate-check) - [ ] [Invicti Scan](https://github.com/gsa/data.gov/wiki/Operation-and-Maintenance-Responsibilities#netsparker-compliance-scan-report-from-isso) -- [ ] [resources.data.gov link scan](https://app.circleci.com/pipelines/github/GSA/resources.data.gov?branch=main) -``` - -Finishing the shift: Log the [number of alerts](https://docs.google.com/spreadsheets/d/1u1hSUAQW6FWzphog122stfB6MB9Wiq0NROT3PeicRoM/edit#gid=939071144)