-
Notifications
You must be signed in to change notification settings - Fork 18
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Signed-off-by: Bartlomiej Plotka <bwplotka@gmail.com>
- Loading branch information
Showing
18 changed files
with
4,916 additions
and
602 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,5 @@ | ||
# Red Hat Observability Service | ||
|
||
Status: Open to internal Red Hat Teams Only. | ||
|
||
TBD |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,3 @@ | ||
# RHOBS: Telemeter | ||
|
||
TBD |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,61 @@ | ||
# Thanos | ||
|
||
> In progress. | ||
Thanos is a horizontally scalable, multi-tenant monitoring system in a form of distributed time series database that supports Prometheus data format. | ||
|
||
### Official Documentation | ||
|
||
https://thanos.io/tip/thanos/getting-started.md | ||
|
||
### APIs | ||
|
||
* Querying: Prometheus APIs, Remote Read | ||
* Series: Prometheus APIs, gRPC SeriesAPI | ||
* Metric Metadata: Prometheus API, gRPC MetricMetadataAPI | ||
* Rules, Alerts: Prometheus API, gRPC RulesAPI | ||
* Targets: Prometheus API, gRPC TargetsAPI | ||
* Exemplars: Prometheus API, gRPC ExemplarsAPI | ||
* Receiving: Prometheus Remote Write | ||
|
||
### Tutorials | ||
|
||
https://katacoda.com/thanos | ||
|
||
### Notable Talks/Blog Posts | ||
|
||
* 12.2020: [Absorbing Thanos Infinite Powers for Multi-Cluster Telemetry](https://www.youtube.com/watch?v=6Nx2BFyr7qQ) | ||
* 12.2020: [Turn It Up to a Million: Ingesting Millions of Metrics with Thanos Receive](https://www.youtube.com/watch?v=5MJqdJq41Ms) | ||
* 02.2019: [FOSDEM + demo](https://fosdem.org/2019/schedule/event/thanos_transforming_prometheus_to_a_global_scale_in_a_seven_simple_steps/) | ||
* 03.2019: [Alibaba Cloud user story](https://www.youtube.com/watch?v=ZS6zMksfipc) | ||
* [CloudNative Deep Dive](https://www.youtube.com/watch?v=qQN0N14HXPM) | ||
* [CloudNative Intro](https://www.youtube.com/watch?v=m0JgWlTc60Q) | ||
* [Prometheus in Practice: HA with Thanos](https://www.slideshare.net/ThomasRiley45/prometheus-in-practice-high-availability-with-thanos-devopsdays-edinburgh-2019) | ||
|
||
* [Banzai Cloud user story](https://banzaicloud.com/blog/multi-cluster-monitoring/) | ||
|
||
### Bug Trackers | ||
|
||
https://github.com/thanos-io/thanos/issues | ||
|
||
### Communication Channels | ||
|
||
The CNCF Slack workspace's ([join here](https://cloud-native.slack.com/messages/CHY2THYUU)) channels: | ||
|
||
* `#thanos` for user related things. | ||
* `#thanos-dev` for developer related things. | ||
|
||
### Proposal Process | ||
|
||
https://thanos.io/tip/contributing/contributing.md/#adding-new-features--components | ||
|
||
### Our Usage | ||
|
||
We use Thanos in many places within Red Hat, notably: | ||
|
||
* In [Prometheus Operator (sidecar)](prometheusOp.md) | ||
* In Openshift Platform Monitoring (PM) (see [CMO](openshiftcmo.md)) | ||
* In Openshift User Workload Monitoring (UWM) | ||
* In [RHOBS](RHOBS) (so [Observatorium](observatorium.md)) | ||
|
||
### Maintainers | ||
|
||
https://thanos.io/tip/thanos/maintainers.md/#core-maintainers-of-this-repository |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +1,7 @@ | ||
# Accepted | ||
|
||
> In progress. | ||
This is a list of accepted proposals. This means proposal was accepted, but not yet implemented. | ||
|
||
## Internal Accepted Proposals | ||
|
||
* ... |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,112 @@ | ||
# 2021-06: Proposal Process | ||
|
||
* **Owners:**: | ||
* [`@bwplotka`](https://github.com/bwplotka) | ||
|
||
* **Other docs:** | ||
* [KEP Process](https://github.com/kubernetes/enhancements/blob/master/keps/README.md) | ||
* [Observability Team Process (Internal)](https://docs.google.com/document/d/1eojXStPdq1hYwv36pjE-vKR1q3dlBbpIx5w_L_v2gNo/edit) | ||
|
||
> TL;DR: We would like to propose an improved, official proposal process for Monitoring Group that clearly states when, where and how to create proposal/enhancement/design documents. | ||
## Why | ||
|
||
More extensive architectural, process, or feature decisions are hard to explain, understand and discuss. It takes a lot of time to describe the idea, to motivate interested parties to review it, give feedback and approve. That's why it is essential to streamline the proposal process. | ||
|
||
Given that we work in highly distributed teams and work with multiple communities, we need to allow asynchronous discussions. This means it's essential to structure the talks into shared documents. Persisting in those decisions, once approved or rejected, is equally important, allowing us to understand previous motivations. | ||
|
||
There is a common saying [`"I've just been around long enough to know where the bodies are buried"`](https://twitter.com/AlexJonesax/status/1400103567822835714). We want to ensure the team related knowledge is accessible to everyone, every day, no matter if the team member is new or part of the team for ten years. | ||
|
||
### Pitfalls of the current solution | ||
|
||
Currently, the Observability Platform team have the process defined [here (internal)](https://docs.google.com/document/d/1eojXStPdq1hYwv36pjE-vKR1q3dlBbpIx5w_L_v2gNo/edit#heading=h.kpdg1wrd3pcc), whereas the In-Cluster part were not defining any official process ([as per here (internal)](https://docs.google.com/document/d/1vbDGcjMjJMTIWcua5Keajla9FzexjLKmVk7zoUc0_MI/edit#heading=h.n0ac5lllvh13)). | ||
|
||
In practice, both teams had somehow similar flow: | ||
|
||
* For upstream: Follow the upstream project's contributing guide, e.g Thanos | ||
* For downstream: | ||
* Depending on the size: | ||
* Small features can be proposed during the bi-weekly team-sync or directly in Slack. | ||
* If the team can reach consensus in this time, then document the decision somewhere written, e.g. an email, Slack message to which everyone can add an emoji reaction, etc. | ||
* Add a JIRA ticket to plan this work. | ||
* Large features might need a design doc: | ||
1. Add a JIRA ticket for creating the design doc | ||
2. Create a new Google Doc in the team folder based on [this template](https://docs.google.com/document/d/1ddl_dLxjoIvWQuRgLdzL2Gd1EX1mkJQUZ-rgUh-T4d8/edit) | ||
3. Fill sections | ||
4. Announce it on the team mailing list and Slack channel | ||
5. Address comments / concerns 6 Define what "done" means for this proposal, i.e. what is the purpose of this design document: | ||
* Knowledge sharing / Brain dump: This kind of document may not need a thorough review or any official approval | ||
* Long term vision and Execution & Implementation: If approved (with LGTM comments, or in an approved section) by a majority of the team and no major concerns consider it approved. NOTE: The same applies to rejected proposals. | ||
1. If the document has no more offline comments and no consensus was reached, schedule a meeting with interested parties. | ||
2. When the document changes status, move it to the appropriate status folder in the design docs directory of the team folder. If an approved proposal concerns a component with its own directory, e.g. Telemeter, then create a shortcut to the proposal document in the component-specific directory. This helps us find design documents by topic and by status. | ||
|
||
It served us well, but it had the following issues (really similar to ones stated in [handbook proposal](202106-handbook.md#pitfalls-of-the-current-solution)): | ||
|
||
* Even if our Google Design docs organized in our team drive, those Google documents are not easily discoverable. | ||
* Existing Google doc-based documents are hard to consume. The formatting is widely different. Naming is inconsistent. | ||
* Document creation is rarely actionable. There is no review process, so the effort of creating a relevant document might be wasted, as the document is lost. This also leads to docs being in the half-completed state, demotivating readers to look at it. | ||
* It's hard to track previous discussions around proposals, who approved them (e.g. proposals). | ||
* It's not public, and it's hard to share good proposals with other external and internal teams. | ||
|
||
## Goals | ||
|
||
Goals and use cases for the solution as proposed in [How](#how): | ||
|
||
* Allow easy collaboration and decision making on design ideas. | ||
* Have a consistent design style that is readable and understandable. | ||
* Ensure design docs are discoverable for better awareness and knowledge sharing about past decisions. | ||
* Define a clear review and approval process. | ||
|
||
## Non-Goals | ||
|
||
* Define process for other documents (see [handbook proposal](202106-handbook.md#pitfalls-of-the-current-solution)) | ||
|
||
## How | ||
|
||
We want to propose an improved, official proposal process for Monitoring Group that clearly states *when, where and how* to create proposal/enhancement/design documents. | ||
|
||
Everything starts with a problem statement. It might be a missing functionality, confusing existing functionality or broken one. It might be an annoying process, performance or security issue (or potential one). | ||
|
||
### What Process to Follow? | ||
|
||
As defined in [handbook proposal](202106-handbook.md#pitfalls-of-the-current-solution), our Handbook should tell you that Handbook is meant to be an index for our team resources and linking point to other distributed projects we maintain or contribute to. | ||
|
||
First, we need to identify if the idea we have is something we can contribute to the upstream project, or it does not fit anywhere else, so we can leverage the [Handbok Proposal directory](..) and the [process](#handbook-proposal-process). See the below algorithm to find it out: | ||
|
||
![where](../../assets/proposal-where.png) | ||
|
||
[Internal Team Drive for Public and Confidential Proposals](https://drive.google.com/drive/folders/1WGqC3gMCxIQlrnjDUYfNUTPYYRI5C) | ||
|
||
### Handbook Proposal Process | ||
|
||
If there is no problem, there is no need for changing anything, no need for a proposal. This might feel trivial, but we should first ask ourselves this question before even thinking about writing a proposal. | ||
|
||
It takes time to propose an idea, find consensus and implement more significant concepts, so let's not waste time before it's worth it. But, unfortunately, even good ideas sometimes have to wait for a good moment to discuss them. | ||
|
||
Let's assume the idea sounds interesting to you; what to do next, where to propose it? How to review it? Follow the algorithm below: | ||
|
||
![where](../../assets/proposal-how.png) | ||
|
||
> Note: It's totally ok to reject a proposal if a team member feels the idea is wrong. It's better to explicitly oppose it than to ignore it and leave it in limbo. | ||
> NOTE: We would love to host Logging and Tracing Teams if they choose to follow our process, but we don't want to enforce it. We are happy to extend this process from the Monitoring Group handbook to Observability Group. Still, it has to grow organically (if the Logging, Tracing team will see the value of joining us here). | ||
## Alternatives | ||
|
||
1. Organize Team Google Drive with all Google docs we have. | ||
|
||
Pros: | ||
* Great for initial collaboration | ||
|
||
Cons: | ||
* Inconsistent format | ||
* Hard to track approvers | ||
* Never know when the doc is "completed." | ||
* Hard to maintain over time | ||
* Hard to share and reuse outside | ||
|
||
## Action Plan | ||
|
||
* [ ] Move existing up-to-date public design docs over to the Handbook (deadline: End of July). | ||
* TIP: You can use [Google Chrome Plugin](https://workspace.google.com/marketplace/app/docs_to_markdown/700168918607) to convert Google Doc into markdown easily. | ||
* [ ] Propose a similar process to projects that do not have it. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,7 @@ | ||
# Done | ||
|
||
This is a list of implemented proposals. | ||
|
||
## Internal Implemented Proposals | ||
|
||
* ... |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,9 @@ | ||
# Rejected | ||
|
||
This is a list of rejected proposals. | ||
|
||
> NOTE: This does not mean we can return to them and accept! | ||
## Internal Rejected Proposals | ||
|
||
* ... |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.