(shortfin-sd) Adds iree.build artifact fetching. #411

monorimet · 2024-11-01T19:34:01Z

Also adds two slow tests for testing larger SDXL server loads that will not trigger in any workflows yet.

This is missing a few things:

ad-hoc artifacts fetching (e.g. someone inits the server with only 1024x1024 and wants to fetch and load modules for other output shapes ad-hoc when requested by client)
compile integrate (currently pulls precompiled vmfbs and weights)

Builder isn't very smart yet, and will just blindly try to download composed filenames from sharkpublic buckets.
It should eventually cover:

compile (short-term)
export (short/medium-term)

monorimet · 2024-11-01T20:30:21Z

shortfin/python/shortfin_apps/sd/components/config_struct.py

    clip_dtype: sfnp.DType = sfnp.float16
    unet_dtype: sfnp.DType = sfnp.float16
    vae_dtype: sfnp.DType = sfnp.float16

+    use_i8_punet: bool = False


really don't like this but the module expects fp16 I/O...

Maybe a specific place for IO dtype / params type is in order, but it's quite a distinction to start making over one inconsistency. One (*_dtype) is used for instructing device array creation, and the other (use_i8_punet) is used when inferring artifact names. Perhaps the filename convention should account for these cases, i.e., keep the precision spec for I/O and add a _pi8_ to denote "int8 params" or whatever fnuz924v83 datatype we need to parametrize for.

Drop by comment, I agree with the above. We have multiple punet models to support like int8 and fp8, so it would be better to keep them separate

monorimet · 2024-11-01T20:39:08Z

shortfin/python/shortfin_apps/sd/components/builders.py

+parent = os.path.dirname(this_dir)
+default_config_json = os.path.join(parent, "examples", "sdxl_config_i8.json")
+
+dtype_to_filetag = {


maybe we rename the artifacts to match sfnp.Dtype attributes instead of doing little workarounds like this for old naming conventions. Once the exports are spinning and publishing regularly we can make changes with control..

monorimet · 2024-11-01T22:49:48Z

NOTE: needs rebase on main once #413 lands

monorimet · 2024-11-02T15:48:56Z

shortfin/python/shortfin_apps/sd/components/builders.py

+    params_urls = get_url_map(params_filenames, SDXL_WEIGHTS_BUCKET)
+    ctx = executor.BuildContext.current()
+    for f, url in params_urls.items():
+        out_file = os.path.join(ctx.executor.output_dir, f)


This isn't very robust. Should have a md5sum checklist fetched from the bucket if downloads enabled, and compare with local checklist to determine which, if any, artifacts need updating.

Yeah, I hadn't yet gotten to stamp and change detection... Will in a bit.

Do you already have file hashes stored in the bucket somewhere?

Not yet. Right now, the mlir/vmfbs are always downloaded from a bucket versioned by date only.

That can work. You basically need something to derive a stamp value from. That can come from some part of the URL.

For mammoth files, a manual version of some kind can be best anyway: it can take a long time to compute a hash of such things

Does it seem too heavyweight to keep a md5sums.json in each bucket, and have the builder generate and keep a local set of hashes for its outputs? That way we can filter exactly what's needed before doing fetch_http? (edit: I'm pretty sure that's the same thing just more fine-grained and expensive, I suppose -- I just never liked having to download a new set of HF weights because someone added a completely unrelated file to the repo)

Yeah, that's the basic mechanism. We wouldn't actually compute the hash in the builder in typical use. Instead, you would tell it how to get the stamp artifact (ie. Some fixed string, a hash file, etc). If a hash file, we compute a running hash only during download and store the result, erroring if it mismatches. But just an opaque stamp value drives the up-to-date check.

It's better for everyone if such artifacts are in write once storage (ie. The same URL produces the same content for all of time). Then the stamp is just the url, and any hash checking is just for verifying the integrity of the transfer. That avoids several kinds of update race issues and it means that you can do the up to date check without network access.

monorimet marked this pull request as draft November 1, 2024 20:15

monorimet requested a review from stellaraccident November 1, 2024 20:29

monorimet commented Nov 1, 2024

View reviewed changes

monorimet marked this pull request as ready for review November 1, 2024 20:36

monorimet commented Nov 1, 2024

View reviewed changes

monorimet mentioned this pull request Nov 1, 2024

SD shortfin CI fails when apt is locked #407

Closed

eagarvey-amd added 5 commits November 2, 2024 10:14

(shortfin-sd) Adds iree.build artifact fetching.

330ca0c

Fixup test to use builder.

c029c2d

Revert timeout spec

3b485eb

Actually add the builder script.

905958a

Use versioned bucket for artifacts, and update README, gitignore

f073eae

monorimet force-pushed the sdxl-ireebuild branch from df9a0fd to f073eae Compare November 2, 2024 15:34

Bump iree shortfin req.

29a2e38

monorimet commented Nov 2, 2024

View reviewed changes

eagarvey-amd and others added 5 commits November 4, 2024 12:51

Use a date stamp to check for updates.

d3cd69d

Bump shortfin IREE checkout

59acc61

Set workers to default 1 per device, fix splat arg

c92e55f

Merge branch 'main' into sdxl-ireebuild

56c6135

Revert IREE bump

937f8ec

monorimet enabled auto-merge (squash) November 5, 2024 01:59

monorimet disabled auto-merge November 5, 2024 01:59

monorimet enabled auto-merge (squash) November 5, 2024 02:00

stellaraccident approved these changes Nov 5, 2024

View reviewed changes

monorimet merged commit e282fbc into main Nov 5, 2024
11 checks passed

monorimet deleted the sdxl-ireebuild branch November 5, 2024 02:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

(shortfin-sd) Adds iree.build artifact fetching. #411

(shortfin-sd) Adds iree.build artifact fetching. #411

monorimet commented Nov 1, 2024 •

edited

Loading

monorimet Nov 1, 2024

monorimet Nov 1, 2024 •

edited

Loading

nithinsubbiah Nov 5, 2024

monorimet Nov 1, 2024 •

edited

Loading

monorimet commented Nov 1, 2024

monorimet Nov 2, 2024 •

edited

Loading

stellaraccident Nov 2, 2024

monorimet Nov 2, 2024 •

edited

Loading

stellaraccident Nov 2, 2024

stellaraccident Nov 2, 2024

monorimet Nov 2, 2024 •

edited

Loading

stellaraccident Nov 2, 2024 •

edited

Loading

(shortfin-sd) Adds iree.build artifact fetching. #411

(shortfin-sd) Adds iree.build artifact fetching. #411

Conversation

monorimet commented Nov 1, 2024 • edited Loading

monorimet Nov 1, 2024

Choose a reason for hiding this comment

monorimet Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

nithinsubbiah Nov 5, 2024

Choose a reason for hiding this comment

monorimet Nov 1, 2024 • edited Loading

Choose a reason for hiding this comment

monorimet commented Nov 1, 2024

monorimet Nov 2, 2024 • edited Loading

Choose a reason for hiding this comment

stellaraccident Nov 2, 2024

Choose a reason for hiding this comment

monorimet Nov 2, 2024 • edited Loading

Choose a reason for hiding this comment

stellaraccident Nov 2, 2024

Choose a reason for hiding this comment

stellaraccident Nov 2, 2024

Choose a reason for hiding this comment

monorimet Nov 2, 2024 • edited Loading

Choose a reason for hiding this comment

stellaraccident Nov 2, 2024 • edited Loading

Choose a reason for hiding this comment

monorimet commented Nov 1, 2024 •

edited

Loading

monorimet Nov 1, 2024 •

edited

Loading

monorimet Nov 1, 2024 •

edited

Loading

monorimet Nov 2, 2024 •

edited

Loading

monorimet Nov 2, 2024 •

edited

Loading

monorimet Nov 2, 2024 •

edited

Loading

stellaraccident Nov 2, 2024 •

edited

Loading