Crash Analysis capability #1309

seanvaleo · 2023-02-02T17:38:58Z

Add the capability for AppScope to produce useful information following an application crash. Whether the application was scoped or not.

What scenarios are we intending to support?

Daemon running in container ; Process crashes in same container [container pid provided]
Daemon running on host ; Process crashes on host [host pid provided]
Daemon running on host ; Process crashes in below container [host pid provided]
~~Daemon running in container ; Process crashes on host or in another container~~ (requires --privileged where daemon is run)

How will the data be accessed?

From the file system (via the daemon)
From a network destination (via the daemon)

Main Components

library : capability to generate coredump, config, info, backtrace Snapshot functionality - library part #1307
library: extending integration test for coredump, backtrace Extend integration tests for coredump, backtrace #1317
cli: ebpf module to detect signals EBPF and user mode integration in order to export EBF data #1159
cli: capability to retrieve/create files in a crash scenario Capability to retrieve/create files in a crash scenario #1315
cli: capability to retrieve files from another namespace Add ability to extract file from namespace #1312
cli: add command line args for --coredump and --backtrace cli: add command line args for --coredump and --backtrace #1319
cli: add capability for daemon to write crash files to a network destination Add capability for the cli daemon to write crash files to a network destination #1320
library: validate crash trace feature in java apps Validate crash trace feature in java apps #1333
library: support for crash analysis of Go apps Support for crash analysis of Go apps #1334

check == merged

The text was updated successfully, but these errors were encountered:

seanvaleo · 2023-02-14T22:01:31Z

seanvaleo · 2023-02-15T22:17:05Z

Demo/Repro Instructions

Daemon running in container ; Process crashes in same container

On host:

docker run -it --rm --cap-add=SYS_ADMIN --privileged -v /home/ubuntu/jrc/appscope2:/opt/appscope -v /sys/kernel/debug:/sys/kernel/debug:ro ubuntu:20.04

In container:

./bin/linux/x86_64/scope daemon
top
./bin/linux/x86_64/scope attach --backtrace --coredump top
kill -s SIGSEGV `pidof top`

Expected results:
In the container, the following should exist in /tmp/appscope/<pidof top>/ :

core
info
cfg
backtrace
snapshot

Daemon running on host ; Process crashes on host

On host:

sudo ./bin/linux/x86_64/scope daemon
top
sudo ./bin/linux/x86_64/scope attach --backtrace --coredump top
sudo kill -s SIGSEGV `pidof top`

Expected results:
On the host, the following should exist in /tmp/appscope/<pidof top>/ :

core
info
cfg
backtrace
snapshot

Daemon running on host ; Process crashes in below container

On host:

sudo ./bin/linux/x86_64/scope daemon
docker run --rm -it redis
sudo ./bin/linux/x86_64/scope attach --backtrace --coredump redis-server
sudo kill -s SIGSEGV `pidof redis-server`

Expected results:
On the host, the following should exist in /tmp/appscope/<pidof redis-server>/ :

core
info
cfg
backtrace
snapshot

seanvaleo · 2023-02-17T17:44:09Z

Top

We observe two signals being received (and thus two snapshots being created) for top after sending it a SIGILL/similar.

We think our behavior is correct in terms of signal handling (in the library).

Top behaves as follows:

Register a valid signal handler with sigaction to catch SIGILL/etc.
When a signal is received:
- It prints the signal to the console.
- It then registers a second (null) signal handler (supposedly to allow things like coredump to work).
- It then raises the original signal again, for it to be caught by the second signal handler.
- (It does not print the repeated signal to the console.)

Since all apps don't behave like top, we think its best to act on both signals (rather than shut off our signal handler after the first one for example).

Worth noting: We send a sigabort when the second signal is received with the null handler (which is why top shows that it also received sigabort).

reference top code the function sig_abexit()

seanvaleo added the enhancement New feature or request label Feb 2, 2023

seanvaleo added this to AppScope 1.3.0 (Spring) Feature Release Feb 2, 2023

seanvaleo moved this to In Progress in AppScope 1.3.0 (Spring) Feature Release Feb 2, 2023

seanvaleo mentioned this issue Feb 2, 2023

Add ability to extract file from namespace 1312 #1306

Merged

seanvaleo pinned this issue Feb 2, 2023

seanvaleo mentioned this issue Feb 2, 2023

EBPF and user mode integration in order to export EBF data #1159

Closed

seanvaleo linked a pull request Feb 2, 2023 that will close this issue

Add ability to extract file from namespace 1312 #1306

Merged

seanvaleo removed a link to a pull request Feb 3, 2023

Add ability to extract file from namespace 1312 #1306

Merged

seanvaleo moved this from In Progress to In Review in AppScope 1.3.0 (Spring) Feature Release Feb 14, 2023

criblio deleted a comment from jrcheli Feb 16, 2023

seanvaleo unpinned this issue Feb 20, 2023

seanvaleo closed this as completed Feb 27, 2023

github-project-automation bot moved this from In Review to Done in AppScope 1.3.0 (Spring) Feature Release Feb 27, 2023

seanvaleo mentioned this issue Mar 6, 2023

Docs/1.3 changelog #1363

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Crash Analysis capability #1309

Crash Analysis capability #1309

seanvaleo commented Feb 2, 2023 •

edited

Loading

seanvaleo commented Feb 14, 2023 •

edited

Loading

seanvaleo commented Feb 15, 2023 •

edited

Loading

seanvaleo commented Feb 17, 2023 •

edited by iapaddler

Loading

Crash Analysis capability #1309

Crash Analysis capability #1309

Comments

seanvaleo commented Feb 2, 2023 • edited Loading

Main Components

seanvaleo commented Feb 14, 2023 • edited Loading

Todo

seanvaleo commented Feb 15, 2023 • edited Loading

Demo/Repro Instructions

Daemon running in container ; Process crashes in same container

Daemon running on host ; Process crashes on host

Daemon running on host ; Process crashes in below container

seanvaleo commented Feb 17, 2023 • edited by iapaddler Loading

Top

seanvaleo commented Feb 2, 2023 •

edited

Loading

seanvaleo commented Feb 14, 2023 •

edited

Loading

seanvaleo commented Feb 15, 2023 •

edited

Loading

seanvaleo commented Feb 17, 2023 •

edited by iapaddler

Loading