CP-39894: Move most of the daemon management code to the service module and make pid locations explicit #4720

psafont · 2022-05-24T09:33:50Z

The PR is better reviewed commit-by-commit. The biggest change (10c0291) should only move around the code and only modify the callers' side.

Xenops and the daemons until now have used 2 ways of communicating the pid to the former: a xenstore key and optionally a file. We want to move to a model where we can use only a pidfile for some daemons (SWTPM, mainly).

Previously the path pidfile was implicit and the code to fetch it was shared among all the daemons meaning that even if the daemon always provided a pidfile, their clients always had to take into account the case where there was no file to process.

These changes force all the damon modules to declare the locations where the pidfile is expected to be, and only expose the pidfile in the way the daemon actually uses it.

Having all this logic in a separate module also help us to maintain an interface and not allow clients to access all the internal workings, facilitating future refactors (on how to wait for a daemon, for example)

vTPM suite: SR 170394 green
ring3 BVT+BST: SR170395 green

lindig · 2022-05-24T10:04:29Z

ocaml/xenopsd/lib/xenops_utils.ml

+    let path x = "unix:" ^ x
+
+    let rm x =
+      let dbg = debug "error cleaning unix socket %s: %s" x in


I'd use "removing" as cleaning implies something like flushing.

ocaml/xenopsd/xc/service.ml

lindig · 2022-05-24T10:27:40Z

ocaml/xenopsd/xc/service.ml

+    | None ->
+        false
+    | Some p -> (
+      try Unix.kill p 0 ; (* This checks the existence of pid p *)


Is this guaranteed behavior? I understand it works with the kill(1) command but does this work with the system call as well?

The man page for kill(2) is missing from centos7 hosts (and CH), but on my dev machine seems to confirm this behaviour:

If sig is 0, then no signal is sent, but existence and permission checks are still performed; this can be used to check for the existence of a process ID or process group ID that the caller is permitted to signal.

psafont · 2022-05-24T16:09:49Z

Sorry, I missed a lot of commits here

robhoes

This generally looks good to me, and a useful exercise to consolidate the code for all of these mechanisms. A lot of churn though, so it needs careful testing, including upgrade scenarios with running VMs on the host.

edwintorok

Reviewing this PR on github is next to impossible: github won't show whether all the moved code is the same or not.
Locally with git I think it is possible with these git config settings:

[diff]
    algorithm = histogram
    colorMoved = dimmed_zebra
    renames = copies

Then identically moved lines will be kind of greyish (i.e. easy to visually filter out on a black background as they're barely visible, but still there for context if needed).

And then use something like this (after having added paus as a remote and fetched it), ignoring whitespace is a must otherwise indentation changes won't detect the moved code as identical:
git log --reverse --color-moved-ws=ignore-all-space -w -p origin/feature/vtpm..psafont/private/paus/pid-wait

edwintorok · 2022-06-27T17:06:52Z

I think some of the code moves got mixed together with also reformatting the moved code, e.g. git shows the vnc port path and pid path lines as having changed even if they're functionally the same.

Not sure whether we want all the arg building code moved out of device, that seems to belong there (it tells you how to launch the device, and is specific to it, e.g. varstored and qemu) and results in a lot of code churn.

I think we may be overcomplicating service launch detection here: the usual way a Unix process signals that it is "ready" is to open all sockets and everything it needs, fork off a child to run the actual daemon and then exit the parent, and this exit is detected by systemd to transition it from starting to started state. In more complex situations sd_notify can be used (though read the manpage there is also an sd_notify_barrier to avoid race conditions with the former, because systemd may just ignore the former completely and still end up with launch race conditions).
Also looking at the various places in the code that create the pidfile or the xenstore entry they might be created too soon (i.e. the parent can still fail for other reasons, e.g. wrong cmdline args).

I haven't checked whether all our processes fork off a child, but perhaps that'd be a lot simpler way: no need to wait for xenstore or a pidfile, just wait for the forked process from forkexecd to exit successfully, which already has a function and would be a single mechanism instead of two. We could also delete quite a lot of this code then.

If not all processes would support this "forking" detection method of launch we could perhaps add a few small patches implementing that to the daemons, which overall would be less complex code to maintain. (There might be a performance justification for avoiding the extra fork, especially in a PV Dom0 forking is expensive as it triggers a lot page map copies, but I don't think the added complexity here would justify the tiny perf advantage we might have by avoiding the fork, setting up the inotify is probably not exactly free either.)
Although of course introducing a fork into a code that never forked before can also lead to bugs, e.g. if you initialized some state in the parent, and you forked that state may not necessarily be valid in the child (especialy if it is related to threads, you can't fork if you already created threads, etc.)

edwintorok · 2022-06-27T17:39:53Z

I can't spot anything particularly wrong with the changes here though, but it'll be difficult to merge this to master without also merging the vTPM code itself (since it depends on it).

edwintorok

We should probably merge the xenopsd/ changes of feature/vtpm to master fairly soon, we've diverged quite a lot and if xenopsd changes on master we'll have lots of conflicts to solve. The swtpm daemon is not yet part of rt-next and if we don't merge the xapi side of the code this code should be mostly inactive, so we should just do some regression testing on a branch with ocaml/xenopsd commits cherry-picked.

This way it's harder to for code elsewhere to get a reference to inactive / non-existent sandboxes. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

This needed some bindings to move to xenops_utils or the PV_Vnc module in service. The change helps set up a baseline for the more complex changes around moving the arg building and reducing the surface of the service submodules. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

This allows to remove the waiting function from the interface Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

This forces to move the wait_path function there as well as the efivar variables. They all need to be exposed in the interface so they can be used from device.ml. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

This allows to hide the arg-building into the module, as well as removing wait_path from the interface. Not all starting code can be moved, as this creates a module dependency loop because of the use of the PCI module. This means that the usage of the xenstore key as a cancellable watch has to leak through the interface. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

This forces the daemon modules to define which of the two paths they use to advertise availability. The translation is as follows: use_pidfile = false means that only xenstore is used use_pidfile = true means that both xenstore and the filesystem are used, depending on situation. Note that this exposes a current issue in swtpm: it is forced to define a xenstore path when it is not used at all and the filepath is hardcoded in the waiting function. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

This allows for clearer meaning in the paths offered by the module, as well as removing the xenstore path by integrating the stop code in it and simplify building of the arguments, which hasn't been pulled into the module due to complexity. The Vusb code has been simplified, it was checking twice the pid of Qemu, this removes a race condition that could lead to a spurious error. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

This allows for clearer logic regarding which conditions trigger each cleanup. Also quite a bit of code from the functions can be removed as the client modules take control of the code. Prevent error messages when trying to shutdown swtpm and varstored for a domain that does not need them so they are not running. Instead print an informational message about not bothering to shut it down. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

This means the code will not try to read or write to xenstore for any operation regarding swtpm and the filesystem will be cleaned up as needed as well. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

Also replaces the code to read the pidfile to use the common function in Unixext. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

psafont · 2022-06-28T09:12:46Z

We should probably merge the xenopsd/ changes of feature/vtpm to master fairly soon

We'll probably need some retouches around the interface for this, or be conservative about the changes brought into master

psafont requested review from edwintorok and lindig May 24, 2022 09:33

lindig approved these changes May 24, 2022

View reviewed changes

psafont requested a review from lindig May 24, 2022 16:10

lindig approved these changes May 24, 2022

View reviewed changes

psafont force-pushed the private/paus/pid-wait branch from 47a1c7a to d269958 Compare May 25, 2022 08:44

robhoes approved these changes Jun 8, 2022

View reviewed changes

psafont force-pushed the private/paus/pid-wait branch from d269958 to a5c2ad0 Compare June 8, 2022 16:25

psafont marked this pull request as draft June 13, 2022 10:11

psafont force-pushed the private/paus/pid-wait branch 2 times, most recently from a78ddb8 to 5306b89 Compare June 21, 2022 13:18

psafont marked this pull request as ready for review June 21, 2022 13:25

edwintorok reviewed Jun 27, 2022

View reviewed changes

edwintorok approved these changes Jun 28, 2022

View reviewed changes

psafont added 11 commits June 28, 2022 10:11

xenops_sandbox: only get references by creating the sandbox

5ff4c2f

This way it's harder to for code elsewhere to get a reference to inactive / non-existent sandboxes. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

xenopsd/xc/service: add licensing header

c516727

Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

CP-39894: Move all swtpm starting code to service module

500bbab

This allows to remove the waiting function from the interface Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

CP-39894: move all varstored starting code to service module

816d4ab

This forces to move the wait_path function there as well as the efivar variables. They all need to be exposed in the interface so they can be used from device.ml. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

CP-39894: Add Pid.File and use it in Swtpm

584bf46

This means the code will not try to read or write to xenstore for any operation regarding swtpm and the filesystem will be cleaned up as needed as well. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

CA-366479: Remove Qemu's pidfile on domain shutdown

6e4a3ca

Also replaces the code to read the pidfile to use the common function in Unixext. Signed-off-by: Pau Ruiz Safont <pau.safont@citrix.com>

psafont force-pushed the private/paus/pid-wait branch from 5306b89 to 6e4a3ca Compare June 28, 2022 09:11

psafont merged commit d0c9a18 into xapi-project:feature/vtpm Jun 30, 2022

psafont deleted the private/paus/pid-wait branch June 30, 2022 15:00

psafont mentioned this pull request Jul 1, 2022

CP-39894: move xenopsd's daemon modules from device to service #4744

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CP-39894: Move most of the daemon management code to the service module and make pid locations explicit #4720

CP-39894: Move most of the daemon management code to the service module and make pid locations explicit #4720

psafont commented May 24, 2022 •

edited

Loading

lindig May 24, 2022

lindig May 24, 2022

psafont May 24, 2022

psafont commented May 24, 2022

robhoes left a comment

edwintorok left a comment •

edited

Loading

edwintorok commented Jun 27, 2022 •

edited

Loading

edwintorok commented Jun 27, 2022

edwintorok left a comment

psafont commented Jun 28, 2022

CP-39894: Move most of the daemon management code to the service module and make pid locations explicit #4720

CP-39894: Move most of the daemon management code to the service module and make pid locations explicit #4720

Conversation

psafont commented May 24, 2022 • edited Loading

lindig May 24, 2022

Choose a reason for hiding this comment

lindig May 24, 2022

Choose a reason for hiding this comment

psafont May 24, 2022

Choose a reason for hiding this comment

psafont commented May 24, 2022

robhoes left a comment

Choose a reason for hiding this comment

edwintorok left a comment • edited Loading

Choose a reason for hiding this comment

edwintorok commented Jun 27, 2022 • edited Loading

edwintorok commented Jun 27, 2022

edwintorok left a comment

Choose a reason for hiding this comment

psafont commented Jun 28, 2022

psafont commented May 24, 2022 •

edited

Loading

edwintorok left a comment •

edited

Loading

edwintorok commented Jun 27, 2022 •

edited

Loading