Scrape related annotations only in deployment and not in services caused prometheus to ignore seldon analytics metric endpoints #1705

JulianBarr · 2020-04-17T07:51:00Z

Prometheus does service discovery with k8s annotations. However, it will check for annotations prometheus.io/scrape etc in service, I noticed that when creating Seldondeployment, seldon only add such annotations to deployment but not service. This caused prometheus to ignore the endpoint. I once did a test to manually create the deployment and service with those annotations configured for service and it will work. Am I missing anything?

My k8s is 1.14. Prometheus 9.7.4.

The text was updated successfully, but these errors were encountered:

ukclivecox · 2020-04-17T08:26:13Z

Are you using your own Prometheus config our ours?

Our config has

seldon-core/helm-charts/seldon-core-analytics/files/prometheus/prometheus-config.yaml

Lines 97 to 99 in 8610119

    
           - source_labels: [__meta_kubernetes_pod_container_port_name] 
        
             action: keep 
        
             regex: metrics(-.*)?

JulianBarr · 2020-04-17T09:19:22Z

I am using seldon's default, however I am using a version of helm-chart a few months back probably in Dec19. The configuration file doesn't have these three lines. I noticed that this is added in 16 Mar.

I am new to prometheus too and ignorant on its discovery mechanism. So I am trying to decipher these lines. Is it asking prometheus to look for pods with containers that expose a port with port name starting with metrics? So seldon operator actually registers ports (e.g. 8000) some where with name metrics?

ukclivecox · 2020-04-17T09:21:51Z

It will scrape containers in pods that have a port names "metrics*"

JulianBarr · 2020-04-17T09:27:50Z

got you. Seems promising. Thanks. I'll try that.

JulianBarr · 2020-04-17T10:12:34Z

Oops. I recheck my configuration. It does have the lines you mentioned above. I was using 1.0.3, but somehow I forgot.

My yaml configuration exported shows that seldon-container-engine does have a port with name metrics. However there's something fishy that the port 8000 appears twice, once with name and the other without a name. I don't know whether that may cause a problem.

  name: seldon-container-engine
  ports:
  - containerPort: 8000
     protocol: TCP
  - containerPort: 8000
    name: metrics
    protocol: TCP

ukclivecox · 2020-04-17T10:29:28Z

Which version of Seldon have you installed? Are you able to use 1.1.0?

JulianBarr · 2020-04-17T10:42:33Z

I'll try seldon-core-analytics 1.1.0. I am in a bank and we have many restrictions, so to try a new version, I may have to download newer prometheus images and bring it in. I'll try that later.

Besides that, anything else I could do to identify the problem?

ukclivecox · 2020-04-17T12:32:20Z

The way metrics is done has changed between 1.0.2 and 1.1.0
So with 1.0.2 the Java Seldon engine is used so there should be no "metrics" endpoint on your containers if you built them with the previous version of the python wrapper.

From 1.1.0 you can see the upgrade and difference to metrics discussed: https://docs.seldon.io/projects/seldon-core/en/v1.1.0/reference/upgrading.html

JulianBarr · 2020-04-20T07:34:37Z

@cliveseldon
Hi Clive,

Thanks.

1.1.0 is working, so the problem is again related to the Java engine.

I've got another issue with 1.1. I'll raise separately.

ukclivecox · 2020-04-20T07:38:55Z

The java engine would work as previously and expose the metrics itself.
Can we close this issue?

JulianBarr · 2020-04-20T08:13:05Z

OK.

JulianBarr added bug triage Needs to be triaged and prioritised accordingly labels Apr 17, 2020

JulianBarr mentioned this issue Apr 17, 2020

web client metrics not working #1685

Closed

ukclivecox removed the triage Needs to be triaged and prioritised accordingly label Apr 17, 2020

ukclivecox closed this as completed Apr 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Scrape related annotations only in deployment and not in services caused prometheus to ignore seldon analytics metric endpoints #1705

Scrape related annotations only in deployment and not in services caused prometheus to ignore seldon analytics metric endpoints #1705

JulianBarr commented Apr 17, 2020

ukclivecox commented Apr 17, 2020

JulianBarr commented Apr 17, 2020

ukclivecox commented Apr 17, 2020

JulianBarr commented Apr 17, 2020

JulianBarr commented Apr 17, 2020

ukclivecox commented Apr 17, 2020

JulianBarr commented Apr 17, 2020

ukclivecox commented Apr 17, 2020

JulianBarr commented Apr 20, 2020

ukclivecox commented Apr 20, 2020

JulianBarr commented Apr 20, 2020

Scrape related annotations only in deployment and not in services caused prometheus to ignore seldon analytics metric endpoints #1705

Scrape related annotations only in deployment and not in services caused prometheus to ignore seldon analytics metric endpoints #1705

Comments

JulianBarr commented Apr 17, 2020

ukclivecox commented Apr 17, 2020

JulianBarr commented Apr 17, 2020

ukclivecox commented Apr 17, 2020

JulianBarr commented Apr 17, 2020

JulianBarr commented Apr 17, 2020

ukclivecox commented Apr 17, 2020

JulianBarr commented Apr 17, 2020

ukclivecox commented Apr 17, 2020

JulianBarr commented Apr 20, 2020

ukclivecox commented Apr 20, 2020

JulianBarr commented Apr 20, 2020