Formalize update metadata #103

iliana · 2019-07-25T22:01:32Z

Currently updog is loading some very basic metadata (currently called manifest.json) and some hardcoded target names. In order to support multiple architectures, flavors, and update waves, we need to formalize what the metadata that sits between the crunchy TUF shell and the gooey partition data + migrations looks like.

sam-aws · 2019-08-12T22:07:32Z

The most current example of what this metadata could include mentions objects containing

Flavour - eg. "thar-aws-eks-1.13"
Status - eg. "Active"
Latest - whether this is the latest update
Waves (Start time, stop time, lower & upper seed bounds)
Image - image name to download

Some of the more obvious things to think about:

Should the system architecture be in it's own field or could it be folded into the Flavour?
Should latest simply read true/false or should it specify an exact version?
Should we have a separate Version field?
Depending on the Flavour should Image be a list (eg. of partitions)?
Aside from implicit rules between versions, we should probably have something to describe any particular migrations required for this update. (Could also just be listed in Images with a predictable naming scheme)

sam-aws · 2019-08-14T17:26:10Z

A loose example:

{    
  [    
    {    
      "flavor": "thar-aws-eks",         
      "arch": "x86_64",    
      "version": "1.13",    
      "status": "ACTIVE",    
      "max_version": "1.20",    
      "waves": [    
          { "start": "2019-10-06T15:00:00Z", "stop": "2019-10-06T23:00:00Z",    
            "lower_bound": 0, "upper_bound": 858993459 },    
          { "start":"2019-10-07T15:00:00Z", "stop": "2019-10-07T23:00:00Z",    
            "lower_bound": 858993459, "upper_bound": 1717986918 }     
      ],    
      "images": [    
          { "part": "boot", "target": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img"},    
          { "part": "root", "target": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img"},    
          { "part": "hash", "target": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img"}    
      ]       
    }     
  ]     
}

Splitting out architecture and version from the "flavor" lets the client filter through a list of these faster. Similarly with "images" we're likely to have a few distinct images required for an upgrade so it makes sense to name them.

I like the wave example, although it would require making sure that the metadata-writer agrees with the clients about what the random seed range is lest all clients end up in the first wave for example. Maybe something like a "bound_max" field telling clients what the full range is and letting them scale themselves in the event of a mismatch avoids that, or maybe it's just easier to set it as a Thar-wide constant and never change it :)

Specifying the latest version is a bit a interesting. Is it enough to mark one of the updates in the list as "latest", or does an exact "max_version" need to be specified (or does "latest" + "version" solve that). @iliana probably has TUF opinions about this.

iliana · 2019-08-14T19:52:18Z

Direct access instead of having to filter through an iterator is more ideal.

          { "part": "boot", "target": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img"},    
          { "part": "root", "target": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img"},    
          { "part": "hash", "target": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img"}

should probably be

    {
        "boot": {"target": "stuff-boot-thar-whatever-blah-lskjafd-df.img"},
        ...
    }

iliana · 2019-08-14T19:57:35Z

The waves can similarly be more direct / less verbose. Maybe something like:

"waves": {
    "0": "2019-10-06T15:00:00Z",
    "858993459": "2019-10-06T23:00:00Z",
    "1717986918": "2019-10-07T23:00:00Z"
}

since they're just points on a graph. (Not a fan of having to stringify the bounds since JSON only permits strings for object keys, though.)

"bound_max" doesn't feel like something we need to provide, it can be inferred by the client.

sam-aws · 2019-08-14T23:02:16Z

"boot": {"target": "stuff-boot-thar-whatever-blah-lskjafd-df.img"},

Yeah, if we're not worried about seeing arbitrary partitions then we could even just have

"images": {
    "boot": "blah.img",
    "root": "blah.img",
    "hash": "blah.img",
}

sam-aws · 2019-08-14T23:08:29Z

"waves": {
"0": "2019-10-06T15:00:00Z",

Ack, but I'm assuming we still need distinct start and end times per wave

iliana · 2019-08-14T23:14:59Z

Ack, but I'm assuming we still need distinct start and end times per wave

Wouldn't the start/end times just be the boundaries?

Unless you actively want gaps in between waves, which I'm not certain is necessary.

sam-aws · 2019-08-15T17:57:51Z

(Not a fan of having to stringify the bounds since JSON only permits strings for object keys, though.)

I agree with you here, and it makes parsing it a little nasty. Would a better compromise be something like this?

"waves": [
    {"bound" : 0, "time" : "2019-10-06T15:00:00Z"},
    ...

iliana · 2019-08-15T17:58:48Z

I would +1 once I figure out if Serde can decode something like that directly into a BTreeMap<u64, DateTime<Utc>> :)

jahkeup · 2019-08-19T21:24:56Z

In addition to the upgrade wave architecture and facilities, the update metadata, as proposed, overlaps with what we'll likely want in order to integrate other "update orchestrators" (for cooperating with the likes of Kubernetes and ECS).

Let me re-read and digest some of the proposed metadata here a bit more. I've identified data I had in mind already for such an implementation. I think this metadata can accommodate much of what's drafted already. As for the update orchestrator itself, I'll open an issue to drive its integration concerns.

sam-aws · 2019-08-19T23:47:59Z

A quick thought: do we need to specify the datastore version associated with a "thar" version as well, given that they are not necessarily in lock-step?
Edit: I suppose this is covered by specifying what migrations are required for a given update however

jahkeup · 2019-08-20T00:02:06Z

The metadata regarding datastore version for a given thar version is useful information, but I think the slated migration & datastore-upgrade design mitigates the need for the update process to consider it at all - barring a nifty policy to control such upgrades.

That mapping does offer insight for response like in the case of traditional software upgrades that may require or not-require a database migration. That is to say: one with a migration its likely more intrusive and risky than an upgrade that doesn't include a database migration so extra care may be taken. In our case, I would hazard a guess and say that an upgrade which doesn't include a datastore migration is marginally less-dangerous than one with a migration given that the current designs do account for some of the known-unknowns and will be, tentatively, tested for each permutation logically possible migration path.

sam-aws · 2019-08-20T00:07:55Z

...design mitigates the need for the update process to consider it at all

Yep, these should be in-image already, but the design does mention the possibility of specifying migration "fix-ups" that are separately downloaded. These could be covered by just releasing a new imagine version as well though.

tjkirch · 2019-08-20T17:00:49Z

migration "fix-ups" that are separately downloaded

Yeah, we need migrations listed in the update metadata to be able to roll back. The "old" image won't have those migrations.

These could be covered by just releasing a new imagine version as well though.

Not if you've broken the update or migration process! We need rollbacks.

I haven't been involved in this discussion yet, but the basic idea is covered in the (internal, hopefully in-repo soon) migration doc. I've been assuming that whatever metadata we decide here would either include that or would be extensible enough to easily add it...

sam-aws · 2019-08-21T19:32:36Z

Following on from #91 we may have a metadata format that looks something like:

{
  "updates": [
    {
      "flavor": "thar-aws-eks",
      "arch": "x86_64",
      "version": "1.13.0",
      "status": "Ready",
      "max_version": "1.20.0",
      "waves": {
        "0": "2019-10-06T15:00:00Z",
        "500":"2019-10-07T15:00:00Z",
        "1024":"2019-10-08T15:00:00Z"
      },
      "images": {
        "boot": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img",
        "root": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img",
        "hash": "stuff-boot-thar-aws-eks-1.13-m1.20191006.img"
      }
    }
  ],
  "migrations": {
    "(0.1, 1.0)": ["migrate_1.0_foo"],
    "(1.0, 1.1)": ["migrate_1.1_foo", "migrate_1.1_bar"]
  },
  "datastore_versions": {
    "1.11.0": "0.1",
    "1.12.0": "1.0",
    "1.13.0": "1.1"
  }
}

Where we have

An array of update structures describing versions, waves, state, images, etc.
A migration structure that gives migrations required for some (from, to) pair.
A version structure that maps Thar image versions to datastore versions.

So for a given update the client would lookup the target datastore version, and then lookup the series of migrations needed to get from the current version to the target version.

As mentioned in #91 these mappings could also be nested arrays rather than pretend-tuples to make the JSON parsing easier; either way I think we're in for some #[serde(deserialize_with = "de::deserialize_thing")] to sort out the keys.

sam-aws · 2019-09-30T18:52:38Z

With the base metadata now described in https://github.com/amazonlinux/PRIVATE-thar/blob/develop/workspaces/updater/updog/src/main.rs#L109 and surrounds I'll close this issue. There may be some extensions that build upon this but lets cover those in their own issues.

iliana mentioned this issue Jul 25, 2019

updog: Initial commit of Thar updater CLI #94

Merged

iliana assigned sam-aws Aug 12, 2019

This was referenced Aug 22, 2019

dogswatch: Update Coordinator (Kubernetes Operator and Friends) #184

Closed

Updog: Building a better dog #186

Merged

sam-aws closed this as completed Sep 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Formalize update metadata #103

Formalize update metadata #103

iliana commented Jul 25, 2019

sam-aws commented Aug 12, 2019

sam-aws commented Aug 14, 2019

iliana commented Aug 14, 2019

iliana commented Aug 14, 2019 •

edited

Loading

sam-aws commented Aug 14, 2019

sam-aws commented Aug 14, 2019

iliana commented Aug 14, 2019 •

edited

Loading

sam-aws commented Aug 15, 2019

iliana commented Aug 15, 2019

jahkeup commented Aug 19, 2019

sam-aws commented Aug 19, 2019 •

edited

Loading

jahkeup commented Aug 20, 2019

sam-aws commented Aug 20, 2019

tjkirch commented Aug 20, 2019

sam-aws commented Aug 21, 2019

sam-aws commented Sep 30, 2019

Formalize update metadata #103

Formalize update metadata #103

Comments

iliana commented Jul 25, 2019

sam-aws commented Aug 12, 2019

sam-aws commented Aug 14, 2019

iliana commented Aug 14, 2019

iliana commented Aug 14, 2019 • edited Loading

sam-aws commented Aug 14, 2019

sam-aws commented Aug 14, 2019

iliana commented Aug 14, 2019 • edited Loading

sam-aws commented Aug 15, 2019

iliana commented Aug 15, 2019

jahkeup commented Aug 19, 2019

sam-aws commented Aug 19, 2019 • edited Loading

jahkeup commented Aug 20, 2019

sam-aws commented Aug 20, 2019

tjkirch commented Aug 20, 2019

sam-aws commented Aug 21, 2019

sam-aws commented Sep 30, 2019

iliana commented Aug 14, 2019 •

edited

Loading

iliana commented Aug 14, 2019 •

edited

Loading

sam-aws commented Aug 19, 2019 •

edited

Loading