Skip to content
This repository has been archived by the owner on Dec 4, 2024. It is now read-only.

Commit

Permalink
Document lack of guarantees with respect to the contents of diagnosti…
Browse files Browse the repository at this point in the history
  • Loading branch information
janisz committed May 10, 2019
1 parent 0e5e327 commit ebbcee3
Show file tree
Hide file tree
Showing 2 changed files with 11 additions and 2 deletions.
6 changes: 5 additions & 1 deletion pages/1.12/monitoring/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -92,9 +92,13 @@ You can sort system health by `systemd` unit. However, this search can bring up

The system health API relies on Mesos-DNS to know about all the cluster hosts. It finds these hosts by combining a query from `mesos.master` A records as well as `leader.mesos:5050/slaves` to get the complete list of hosts in the cluster. This system has a known bug, in which an agent will not show up in the list returned from `leader.mesos:5050/slaves` if the Mesos agent service is not healthy. This means the system health API will not show this host. If you experience this behavior it is most likely that your Mesos agent service on the missing host is unhealthy.

### Diagnostics bundle contents

The contents of the generated bundle are not stable over time and any internal or third party bundle analysis tooling should be programmed very defensively in this regard.

## Troubleshooting

If you have any problems, you can check if the diagnostics service is running by SSHing to the Mesos leading master and checking the `systemd` status of the diagnostics component (`dcos-d3t.service`).
If you have any problems, you can check if the diagnostics service is running by SSHing to the Mesos leading master and checking the `systemd` status of the diagnostics component (`dcos-disgnostics.service`).

[4]: https://www.freedesktop.org/wiki/Software/systemd/
[5]: http://erlang.org/doc/man/epmd.html
Expand Down
7 changes: 6 additions & 1 deletion pages/1.13/monitoring/index.md
Original file line number Diff line number Diff line change
Expand Up @@ -92,9 +92,14 @@ You can sort system health by `systemd` unit. However, this search can bring up

The system health API relies on Mesos-DNS to know about all the cluster hosts. It finds these hosts by combining a query from `mesos.master` A records as well as `leader.mesos:5050/slaves` to get the complete list of hosts in the cluster. This system has a known bug, in which an agent will not show up in the list returned from `leader.mesos:5050/slaves` if the Mesos agent service is not healthy. This means the system health API will not show this host. If you experience this behavior it is most likely that your Mesos agent service on the missing host is unhealthy.

### Diagnostics bundle contents

The contents of the generated bundle are not stable over time and any internal or third party bundle analysis tooling should be programmed very defensively in this regard.

## Troubleshooting

If you have any problems, you can check if the diagnostics service is running by SSHing to the Mesos leading master and checking the `systemd` status of the diagnostics component (`dcos-d3t.service`).
If you have any problems, you can check if the diagnostics service is running by SSHing to the Mesos leading master and checking the `systemd` status of the diagnostics component (`dcos-diagnostics.service`).


[4]: https://www.freedesktop.org/wiki/Software/systemd/
[5]: http://erlang.org/doc/man/epmd.html
Expand Down

0 comments on commit ebbcee3

Please sign in to comment.