-
Notifications
You must be signed in to change notification settings - Fork 2
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Add troubleshooting guide to docs site based on user questions.
- Loading branch information
1 parent
f585459
commit 9028af0
Showing
3 changed files
with
80 additions
and
5 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,57 @@ | ||
# Troubleshooting | ||
|
||
For general K8s and Inspect sandbox debugging, see the [Debugging K8s | ||
Sandboxes](debugging-k8s-sandboxes.md) guide. | ||
|
||
## Capture Inspect `SANDBOX`-level logs | ||
|
||
A good starting point to most issues is to capture the output of the Python `logging` | ||
module at `SANDBOX` level. See the [`SANDBOX` log level | ||
section](debugging-k8s-sandboxes.md#sandbox-log-level). | ||
|
||
## View cluster events | ||
|
||
Certain cluster events may impact your eval, for example, a node failure. | ||
|
||
```sh | ||
kubectl get events --sort-by='.metadata.creationTimestamp' | ||
``` | ||
|
||
To also see timestamps: | ||
|
||
```sh | ||
kubectl get events --sort-by='.metadata.creationTimestamp' \ | ||
-o custom-columns=LastSeen:.lastTimestamp,Type:.type,Object:.involvedObject.name,Reason:.reason,Message:.message | ||
``` | ||
|
||
To filter to a particular release or pod, either pipe into `grep` or use the | ||
`--field-selector` flag: | ||
|
||
```sh | ||
kubectl get events --sort-by='.metadata.creationTimestamp' \ | ||
--field-selector involvedObject.name=agent-env-xxxxxxxx-default-0 | ||
``` | ||
|
||
Find the Pod name (including the random 8-character identifier) in the `SANDBOX`-level | ||
logs or the stack trace. | ||
|
||
To specify a specific namespace, use the `-n` flag. | ||
|
||
## I'm seeing "Helm uninstall failed" errors | ||
|
||
These are likely because the Helm chart was never installed. This typically happens if | ||
you cancel an eval, or an eval fails before a certain sample's Helm chart was installed. | ||
|
||
Check to see if any Helm releases were left behind: | ||
|
||
```sh | ||
helm list | ||
``` | ||
|
||
## I'm seeing "Handshake status 404 Not Found" errors from Pod operations | ||
|
||
This typically indicates that the Pod has been killed. This may be due to cluster issues | ||
(see how to view cluster events above), or because the eval had already failed and the | ||
Helm releases were uninstalled whilst some operations were queued or in flight. | ||
|
||
Check the `.json` or `.eval` log produced by Inspect to see the underlying error. |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters