-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Implement events acknowledgement #13
Comments
replaces Particular/ServiceControl#43 |
The more I think about this, the more this feature does not make sense to me In the following use case or in any other use case, where the indicators can self correct, for example: Also the top 10 events on the dashboard, shows that heartbeats got restored at xx:yy:zz. So Opie has context for both the failure and the restored event. If they happen close enough, he'll still see it. I'd like to downvote this feature for the minimum viable product or what am I missing here? @andreasohlund @johnsimons @dannycohen - thoughts? |
IMO - because it was down at 16:04:33 From a burden perspective - I'd say its the same as having emails sent and marked as read or deleted. Same applies to custom check: if there was a change, a custom check failed and then returned to operate, it may be buried well below the 10 most recent events. What then ? how would Opie know about it ? (by actively looking in the events list ? that will not happen...) |
So it seems like we need to classify the events that Opie should need to Or should we mark events as "seen" as Opie scrolls through the list of On Wed, Oct 2, 2013 at 1:00 PM, Danny Cohen notifications@github.comwrote:
|
I'd say we need to ack everyhting we currently have:
In the future we may (/will) add events that are less critical, like SLR attempts trends, or configuration changes etc.
This is a "implicit ack", which may be good enough. I'd say keep it as simple as possible for Alpha and have an explicit check box to ack (like the Read / Unread marker in email apps: It expects you to open or click on an email to be marked as read). |
On Wed, Oct 2, 2013 at 1:11 PM, Danny Cohen notifications@github.comwrote:
|
Lets go with the explicit "checkbox and click ack", and get feedback on that. We can always change to "implicit ack" later or have it be configurable, based on event criticality, Opie's preferences settings etc. |
@dannycohen - I think your concern is valid. But I think, its more of a notification concern. I'd argue that it belongs with the neurelic plug in. i..e Heartbeats went down for an endpoint. Notify Opie if the condition persists for x time, etc. via email, sms, etc. Notify Opie's super, if Opie hasn't taken action, etc etc. So, I am still not seeing value in the ack. When Opie acts, let's say fixes the endpoint, the indicator will get the right status. In which case, it is fixed. Opie acting on it is ack enough, without forcing Opie to go click a ack button. |
For beta and RC, there will be no new relic plugin, so we need to provide something that allows SP to stand on its own. |
Asking Opie to click a button after the fact is still not helping Opie. The way I see it is this: |
Notifications will be provided by 3rd party tools.
That's the wrong description for the scenario.
|
Here's the alternative scenario, without the requirement to ack:
I agree that SMS / Email notifications are missing, but this will be provided via 3rd party integration. Still this scenario is not a valid one for SP. |
I think notifications and acks go hand in hand. Implementing something partially here without the other is not making sense to me. |
I'd also strongly argue that this is not a critical feature for a MVP. |
+1. Can we put this onhold until we get some more user feedback? Sent from my iPhone On 2 okt 2013, at 19:27, Indu Alagarsamy notifications@github.com wrote:
|
NP. Lets do that. |
@dannycohen - is this still targeted for beta? I thought we moved this out? |
@indualagarsamy - Given the schedule, I do not see how we can complete this for Beta. Moving to RC. We;ve had some discussions about this in the past, so I would like to make sure that we agree (or not) that from a requirements perspective, this makes sense to you. //CC @andreasohlund |
@indualagarsamy - |
@dannycohen - can we talk about this? |
I'll schedule a chat for this evening (your morning) |
@indualagarsamy - untill we manage to discuss this topic, please advise regarding the following issue: This is how my Custom Checks look right now:
So, how Can I, as Opie, make the Custom Check indicator green again, given that it is currently flashing red for failed custom checks that are obsolete and not fixable ? //CC @andreasohlund / @johnsimons |
Make sure the custom check id is the same accross different revisions On Friday, November 8, 2013, Danny Cohen wrote:
Regards |
That's not a realistic solution. Another similar scenario, is when a custom check needs to be retired (e.g. it checks something that always fails). |
@indualagarsamy / @johnsimons - I have these 3 failed messages which cannot succeed on retry due to compatibility issues. They are basically poison messages. How can I make them disappear ? |
We need a Archive error message command Sent from my iPhone
|
Seem like we need a manage custom checks page where you can delete old definitions? Sent from my iPhone
|
We can call it archive, hide, "mark as obsolete" etc.
Same as for error messages. |
I see multiple requirements in play here. How about:
|
Closed and replaced by per-indicator GH issue using the terminology proposed by @andreasohlund in #13 (comment) |
As Opie, I want to view alerts details and mark an event as "acknowledged" after I took corrective actions.
Visualizations:
Notes:
Demo / Acceptance tests:
Case 1:
Case 2:
The text was updated successfully, but these errors were encountered: