forked from GoogleContainerTools/skaffold
-
Notifications
You must be signed in to change notification settings - Fork 1
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
skaffold trace wrapping of critical functions and skaffold trace expo…
…rters What is the problem being solved? Part of GoogleContainerTools#5756, adding opentelemetry trace information to skaffold commands. Added trace information to specific performance critical skaffold functions (identified in go/cloud-trace-skaffold). Also added 4 trace exporters - gcp-skaffold, gcp-adc, stdout, and jaeger. This PR uses env var based enabling/disabling for the trace for simplicity and to hide it from users directly. Why is this the best approach? Using opentelemetry tracing is the obvious choice as we use open telemetry libs for metrics and it is becoming the metrics/tracing standard. Using an env var in this PR and later integrating the flag setup was considered optimal as currently skaffold tracing will be used for benchmarking/bottleneck-identifying for select use cases while the user facing UX w/ jaeger, etc. is still being worked out. What other approaches did you consider? There was the possibility of building tracing directly into skaffold events but I think with the current wrapper setup in pkg/skaffold/instrumentation/trace.go (w/ the minimal code required) and the fact that many trace locations will not be event locations (eg: how long to hash a file, etc.) it makes sense to not integrate them. What side effects will this approach have? There shouldn't be any side effects w/ this approach as the default "off" for tracing and the minimal user visibility for now should mean that it used only for select use cases experimentally. I have done timing tests with the no-op/empty trace (SKAFFOLD_TRACE unset) and it does not change the performance of skaffold. What future work remains to be done? Future work includes wiring up a --trace flag through dev, build, deploy, etc. and working on how skaffold might be able to do distributed tracing w/ other tools (minikube, buildpacks, etc.). Additionally the ability to allow for more sporadic sampling (vs AlwaysSample) should be added. Some future work mentioned in PR review included: - OTEL_TRACES_EXPORTER=* support (vs SKAFFOLD_TRACE)
- Loading branch information
1 parent
f7289a5
commit 56f8fd4
Showing
36 changed files
with
1,052 additions
and
42 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,52 @@ | ||
### Example: Skaffold Command Tracing with Jaeger | ||
|
||
|
||
_**WARNING: Skaffold's trace functionality is experimental and may change without notice.**_ | ||
|
||
In this example: | ||
|
||
* Use Skaffold to deploy a local/remote Jaeger instance | ||
* Enable Skaffold tracing functionality to get trace information about skaffold `dev`, `build`, `deploy`, etc. timings | ||
* Send Skaffold trace information to Jaeger and have that information be visible in the Jaeger UI, | ||
|
||
In this example, we'll walk through enabling Skaffold trace information that can be used to explore performance bottlenecks and to get a more in depth view of user's local dev loop. | ||
|
||
_**WARNING: If you're running this on a cloud cluster, this example will create a service and expose a webserver. | ||
It's highly recommended that you only run this example on a local, private cluster like minikube or Kubernetes in Docker for Desktop.**_ | ||
|
||
#### Setting up Jaeger locally | ||
|
||
Use docker to start a local jaeger instance using the Jaeger project's [all-in-one docker setup](https://www.jaegertracing.io/docs/getting-started/#all-in-one): | ||
```bash | ||
docker run -d --name jaeger \ | ||
-e COLLECTOR_ZIPKIN_HOST_PORT=:9411 \ | ||
-p 5775:5775/udp \ | ||
-p 6831:6831/udp \ | ||
-p 6832:6832/udp \ | ||
-p 5778:5778 \ | ||
-p 16686:16686 \ | ||
-p 14268:14268 \ | ||
-p 14250:14250 \ | ||
-p 9411:9411 \ | ||
jaegertracing/all-in-one:1.22 | ||
``` | ||
|
||
Now, in a different terminal, go to another Skaffold example (eg: microservices), enable SKAFFOLD_TRACE with Jaeger and start dev session there: | ||
```bash | ||
cd ../microservices | ||
export SKAFFOLD_TRACE=jaeger | ||
skaffold dev | ||
``` | ||
|
||
Now go to the Jaeger UI that Skaffold will port-forward to localhost at http://127.0.0.1:16686/ | ||
|
||
Select service:`skaffold-trace` in the left bar and then click `Find Traces` on the bottom of the left bar. | ||
|
||
From here you should be able to view all of the relevant traces | ||
|
||
#### Cleaning up | ||
To cleanup Jaeger all-in-one setup, run the following: | ||
``` | ||
docker kill jaeger # stops the running jaeger container | ||
docker rm jaeger #removes the container image | ||
``` |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,40 @@ | ||
### Example: Skaffold Command Tracing with Jaeger | ||
|
||
|
||
_**WARNING: Skaffold's trace functionality is experimental and may change without notice.**_ | ||
|
||
In this example: | ||
|
||
* Use Skaffold to deploy a local/remote Jaeger instance | ||
* Enable Skaffold tracing functionality to get trace information about skaffold `dev`, `build`, `deploy`, etc. timings | ||
* Send Skaffold trace information to Jaeger and have that information be visible in the Jaeger UI, | ||
|
||
In this example, we'll walk through enabling Skaffold trace information that can be used to explore performance bottlenecks and to get a more in depth view of user's local dev loop. | ||
|
||
_**WARNING: If you're running this on a cloud cluster, this example will create a service and expose a webserver. | ||
It's highly recommended that you only run this example on a local, private cluster like minikube or Kubernetes in Docker for Desktop.**_ | ||
|
||
#### Running the example on minikube | ||
|
||
Start minikube (or verify that minikube is up and running locally) | ||
```bash | ||
minikube start | ||
``` | ||
|
||
From this directory, run | ||
```bash | ||
skaffold dev --port-forward | ||
``` | ||
|
||
Now, in a different terminal, go to another Skaffold example (eg: microservices), enable SKAFFOLD_TRACE with Jaeger and start dev session there: | ||
```bash | ||
cd ../microservices | ||
export SKAFFOLD_TRACE=jaeger | ||
skaffold dev | ||
``` | ||
|
||
Now go to the Jaeger UI that Skaffold will port-forward to localhost at http://127.0.0.1:16686/ | ||
|
||
Select service:`skaffold-trace` in the left bar and then click `Find Traces` on the bottom of the left bar. | ||
|
||
From here you should be able to view all of the relevant traces |
168 changes: 168 additions & 0 deletions
168
integration/examples/jaeger-skaffold-trace/jaeger-all-in-one-template.yaml
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,168 @@ | ||
# Copyright 2017-2019 The Jaeger Authors | ||
# Copyright 2021 The Skaffold Authors | ||
|
||
# Licensed under the Apache License, Version 2.0 (the "License"); | ||
# you may not use this file except in compliance with the License. | ||
# You may obtain a copy of the License at | ||
|
||
# http://www.apache.org/licenses/LICENSE-2.0 | ||
|
||
# Unless required by applicable law or agreed to in writing, software | ||
# distributed under the License is distributed on an "AS IS" BASIS, | ||
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. | ||
# See the License for the specific language governing permissions and | ||
# limitations under the License. | ||
|
||
|
||
# adapted from https://github.com/jaegertracing/jaeger-kubernetes/blob/master/all-in-one/jaeger-all-in-one-template.yml | ||
# changes made include: | ||
# - updating Deployment to correct apiVersion for newer k8s API - extensions/v1beta1 -> apps/v1 | ||
# - fixing deployment to have required `selector` field in newer k8s Deployment API | ||
# - changed Service - query-http to not use port 80 -> port 16686 which is the export local jaeger port (vs web hosted) | ||
apiVersion: v1 | ||
kind: List | ||
items: | ||
- apiVersion: apps/v1 | ||
kind: Deployment | ||
metadata: | ||
name: jaeger | ||
labels: | ||
app: jaeger | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: all-in-one | ||
spec: | ||
replicas: 1 | ||
strategy: | ||
type: Recreate | ||
selector: | ||
matchLabels: | ||
app: jaeger | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: all-in-one | ||
template: | ||
metadata: | ||
labels: | ||
app: jaeger | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: all-in-one | ||
annotations: | ||
prometheus.io/scrape: "true" | ||
prometheus.io/port: "16686" | ||
spec: | ||
containers: | ||
- env: | ||
- name: COLLECTOR_ZIPKIN_HTTP_PORT | ||
value: "9411" | ||
image: jaegertracing/all-in-one | ||
name: jaeger | ||
ports: | ||
- containerPort: 5775 | ||
protocol: UDP | ||
- containerPort: 6831 | ||
protocol: UDP | ||
- containerPort: 6832 | ||
protocol: UDP | ||
- containerPort: 5778 | ||
protocol: TCP | ||
- containerPort: 16686 | ||
protocol: TCP | ||
- containerPort: 9411 | ||
protocol: TCP | ||
readinessProbe: | ||
httpGet: | ||
path: "/" | ||
port: 14269 | ||
initialDelaySeconds: 5 | ||
- apiVersion: v1 | ||
kind: Service | ||
metadata: | ||
name: jaeger-query | ||
labels: | ||
app: jaeger | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: query | ||
spec: | ||
ports: | ||
- name: query-http | ||
port: 16686 | ||
protocol: TCP | ||
targetPort: 16686 | ||
selector: | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: all-in-one | ||
type: LoadBalancer | ||
- apiVersion: v1 | ||
kind: Service | ||
metadata: | ||
name: jaeger-collector | ||
labels: | ||
app: jaeger | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: collector | ||
spec: | ||
ports: | ||
- name: jaeger-collector-tchannel | ||
port: 14267 | ||
protocol: TCP | ||
targetPort: 14267 | ||
- name: jaeger-collector-http | ||
port: 14268 | ||
protocol: TCP | ||
targetPort: 14268 | ||
- name: jaeger-collector-zipkin | ||
port: 9411 | ||
protocol: TCP | ||
targetPort: 9411 | ||
selector: | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: all-in-one | ||
type: ClusterIP | ||
- apiVersion: v1 | ||
kind: Service | ||
metadata: | ||
name: jaeger-agent | ||
labels: | ||
app: jaeger | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: agent | ||
spec: | ||
ports: | ||
- name: agent-zipkin-thrift | ||
port: 5775 | ||
protocol: UDP | ||
targetPort: 5775 | ||
- name: agent-compact | ||
port: 6831 | ||
protocol: UDP | ||
targetPort: 6831 | ||
- name: agent-binary | ||
port: 6832 | ||
protocol: UDP | ||
targetPort: 6832 | ||
- name: agent-configs | ||
port: 5778 | ||
protocol: TCP | ||
targetPort: 5778 | ||
clusterIP: None | ||
selector: | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: all-in-one | ||
- apiVersion: v1 | ||
kind: Service | ||
metadata: | ||
name: zipkin | ||
labels: | ||
app: jaeger | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: zipkin | ||
spec: | ||
ports: | ||
- name: jaeger-collector-zipkin | ||
port: 9411 | ||
protocol: TCP | ||
targetPort: 9411 | ||
clusterIP: None | ||
selector: | ||
app.kubernetes.io/name: jaeger | ||
app.kubernetes.io/component: all-in-one | ||
|
Oops, something went wrong.