Query returns service (and checks still pass) after node was terminated #811

discordianfish · 2015-03-24T14:56:36Z

Hi,

I'm running a consul cluster with 3 server nodes and several client nodes.
Some clients run consul with such json service definition:

{
  "service": {
    "name": "infra-haproxy-stats",
    "port": 8000,
    "check": {
      "script": "curl -o /dev/null localhost:8000",
      "interval": "60s"
    }
  }
}

Now if I shutdown such node, the serf check fails but the service check still passes:

And most importantly, consul still returns those nodes in queries:

$ curl consul:8500/v1/catalog/service/infra-haproxy-stats | jq '.[]|select(.Address == "10.128.13.32")'
{
  "Node": "ip-10-128-13-32",
  "Address": "10.128.13.32",
  "ServiceID": "infra-haproxy-stats",
  "ServiceName": "infra-haproxy-stats",
  "ServiceTags": null,
  "ServiceAddress": "",
  "ServicePort": 8000
}

I'm not sure if the passing service check is due to design (since the check never reported 'down' although I expected it to be marked failed if there is no 'ok' in some time), but at least I expected consul to not return unhealthy nodes.

The text was updated successfully, but these errors were encountered:

grobie · 2015-03-24T15:07:41Z

Do you use leave on terminate? For an intentional shut down of a node, you probably want to make sure it also leaves the cluster.

For anything health related, I believe you should use the Health API instead. https://consul.io/docs/agent/http/health.html.

pearkes · 2015-03-24T16:50:39Z

I'll note that this is a known UI issue, and expected per the Consul API. The UI should handle nodes that aren't responding to the serfHealth check in a special case to mark them as unreachable.

ryanuber · 2015-03-24T17:23:33Z

This is expected behavior. The script/interval check is run locally on the agent, so if that node goes away, the result will not be updated, which was indeed a design decision. This is where the serfHealth check smooths things over for you by quickly detecting the node failure and updating the catalog. As pointed out by @grobie, you will want to use the /v1/health endpoint to query for services in a passing state. The equivalent API call in your use case would have been curl consul:8500/v1/health/service/infra-haproxy-stats?passing.

ryanuber · 2015-03-24T17:32:41Z

Created #813 for the UI issue, let's track that separately.

discordianfish · 2015-03-24T23:40:53Z

Ok, got it - but is it also expected for the dns api to return unhealthy nodes? I think I saw that happen but will verify.

ryanuber · 2015-03-24T23:52:36Z

@discordianfish definitely not - the DNS interface should only return healthy results. Please do let us know if you see otherwise.

discordianfish changed the title ~~Query return service and checks still pass after node was terminated~~ Query returns service (and checks still pass) after node was terminated Mar 24, 2015

ryanuber mentioned this issue Mar 24, 2015

UI: Display nodes failing serfHealth as unreachable #813

Closed

ryanuber closed this as completed Mar 24, 2015

snyk-bot mentioned this issue Jan 22, 2022

[Snyk] Upgrade nuka-carousel from 4.7.5 to 4.8.4 bigcommerce/consul#34

Closed

snyk-bot mentioned this issue Feb 28, 2023

[Snyk] Upgrade nuka-carousel from 4.7.5 to 4.8.4 qmutz/consul#3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query returns service (and checks still pass) after node was terminated #811

Query returns service (and checks still pass) after node was terminated #811

discordianfish commented Mar 24, 2015

grobie commented Mar 24, 2015

pearkes commented Mar 24, 2015

ryanuber commented Mar 24, 2015

ryanuber commented Mar 24, 2015

discordianfish commented Mar 24, 2015

ryanuber commented Mar 24, 2015

Query returns service (and checks still pass) after node was terminated #811

Query returns service (and checks still pass) after node was terminated #811

Comments

discordianfish commented Mar 24, 2015

grobie commented Mar 24, 2015

pearkes commented Mar 24, 2015

ryanuber commented Mar 24, 2015

ryanuber commented Mar 24, 2015

discordianfish commented Mar 24, 2015

ryanuber commented Mar 24, 2015