show data type of tensor files #4068

keturn · 2023-07-29T21:36:37Z

InvokeAI 3.0.1 tends to download full-width tensor files even when it's configured to run them in float16: #4127

One step toward diagnosing and correcting this is to make it more visible what the types of tensor files are. The presence of .fp16 in the file name is a good hint, but I think we should verify.

Ultimately, there should be UI for this in the web interface. For now I've hacked it in to invokeai-model-install --list-models.

[code example]

If you want to, you can see a per-file breakdown (for submodels) instead of a single item for the whole diffusers multimodel.

from invokeai.app.services.config import InvokeAIAppConfig
from invokeai.backend.model_management import ModelManager

config = InvokeAIAppConfig.get_config()
mm = ModelManager(config.model_conf_path)

import itertools
from pathlib import Path
from invokeai.backend.model_management.models.base import calc_file_format_and_dtype

# TODO: How we do figure out which tensor files will actually be loaded?

def print_tensor_types_in_directory(model_path: Path):
    extensions = ['safetensors', 'ckpt', 'bin']
    if model_path.is_dir():
        tensor_paths = list(itertools.chain.from_iterable(model_path.glob(f"**/*.{ext}") for ext in extensions))
    else:
        tensor_paths = [model_path]
    
    for path in tensor_paths:
        file_format, dtype = calc_file_format_and_dtype(path)

        if model_path.is_dir():
            relative_path = path.relative_to(model_path)
        else:
            relative_path = path.name
        
        type_str = str(dtype).rsplit('.')[-1]
        print(f"  {file_format[0].upper()} {type_str}: {relative_path}")

for name in mm.model_names():
    print(f"{name[0]} [{name[1]}/{name[2]}]:")
    model = mm._instantiate(*name)
    print_tensor_types_in_directory(model.model_path)
    print()

Output:

stable-diffusion-xl-refiner-1-0 [BaseModelType.StableDiffusionXLRefiner/ModelType.Main]:
  S float32: text_encoder_2/model.safetensors
  S float32: vae/diffusion_pytorch_model.safetensors
  S float32: unet/diffusion_pytorch_model.safetensors

normal_bae [BaseModelType.StableDiffusion1/ModelType.ControlNet]:
  S float16: diffusion_pytorch_model.fp16.safetensors

tile [BaseModelType.StableDiffusion1/ModelType.ControlNet]:
  P float32: diffusion_pytorch_model.bin

This PR is a proof of concept, and based on #4059. I expect that will have to change around somewhat after the other model manager fixes for 3.0.1 land.

TODO

handle non-safetensor files? (are pickles banned yet?)

What type of PR is this? (check all applicable)

Have you discussed this change with the InvokeAI team?

Hello

Have you updated all relevant documentation?

Yes
No

Related Tickets & Documents

Related Issue #
Closes #

QA Instructions, Screenshots, Recordings

Added/updated tests?

Yes
No : please replace this line with details on why tests have not been included

keturn · 2023-07-29T21:51:18Z

Some questions too about the UI & API for this. Like what do we do about submodels? Is there ever a time when, say, a text encoder is at float32 while the unet is float16?

If so, how should the supermodel represent its type?

lstein · 2023-07-30T00:37:18Z

Some questions too about the UI & API for this. Like what do we do about submodels? Is there ever a time when, say, a text encoder is at float32 while the unet is float16?

If so, how should the supermodel represent its type?

I haven't seen mixed-precision models, but in theory there's no reason why they couldn't exist. I've created them by accident and they were fully functional. I guess if you had to, you'd create a type called "mixed" for the supermodel.

…l_precision

…d dtype

keturn · 2023-08-01T21:56:57Z

updated to support pickles and single-file models

…l_precision # Conflicts: # invokeai/backend/model_management/models/base.py

…l_precision

…sion

…sion # Conflicts: # invokeai/backend/model_management/models/base.py

…feat/show_model_precision

keturn · 2023-08-22T23:18:04Z

API Questions

I ended up including serialization format (pickle vs safetensors) as well. What's the desired interface for this?
- Different methods for each?
- Combine serialization + data type to one struct?
- Combine also with things that return a “model format”?
Should we stick with returning a torch.dtype, or do we need something more runtime-agnostic in anticipation of having non-pytorch models?
needing to describe a type as "mixed" points toward using a new type?
How to avoid running dtype-detection code when we don't want to? I put it in ModelManager.list_models so it would show up in the cli --list-models output, and that method already returned a loose dict of fields. But other things use that method too. The dtype-detection code is pretty fast, but it does require reading at least a bit from each file, which is slower than things that only need a stat or directory listing.
How to expose this to the web API?

bghira · 2023-09-01T14:57:00Z

i can state with certainty i've never bothered to label fp16 versions of the model, and only ever created wide versions as they're intended to continue fine-tuning from.

Millu · 2023-11-03T04:32:45Z

@keturn this would be reliant on the MM refactor right?

Have concerns about slowdown, but if it's only slow during model download, that wouldn't be an issue

internal(model_management): detect the data type of a safetensor file

f514630

keturn mentioned this pull request Jul 29, 2023

[enhancement]: convert model to 16-bit #4069

Open

1 task

keturn added enhancement New feature or request model manager labels Jul 29, 2023

keturn added 5 commits July 29, 2023 20:03

Merge branch 'refactor/model_manager_instantiate' into feat/show_mode…

4e45eed

…l_precision

Merge branch 'refactor/model_manager_instantiate' into feat/show_mode…

378a9d1

…l_precision

Merge branch 'refactor/model_manager_instantiate' into feat/show_mode…

c929bf8

…l_precision

feat(model_management): find dtype of pickles

73b5435

refactor(model_management): single method to determine file format an…

2972464

…d dtype

keturn added 8 commits August 1, 2023 16:52

Merge branch 'refactor/model_manager_instantiate' into feat/show_mode…

a87bdd8

…l_precision # Conflicts: # invokeai/backend/model_management/models/base.py

Merge branch 'refactor/model_manager_instantiate' into feat/show_mode…

2ea233d

…l_precision

Merge remote-tracking branch 'origin/main' into feat/show_model_preci…

fc04a10

…sion

Merge branch 'main' into feat/show_model_precision

baad48e

Merge branch 'main' into feat/show_model_precision

e4fc0c6

Merge remote-tracking branch 'origin/main' into feat/show_model_preci…

48a1c2d

…sion # Conflicts: # invokeai/backend/model_management/models/base.py

Merge remote-tracking branch 'keturn/feat/show_model_precision' into …

7cc889b

…feat/show_model_precision

feat(model manager): add dtype to list-models output

93d8384

keturn added 3 commits August 25, 2023 15:19

Merge branch 'main' into feat/show_model_precision

2d8f810

Merge branch 'main' into feat/show_model_precision

2bbd3f8

Merge branch 'main' into feat/show_model_precision

735b14e

Merge branch 'main' into feat/show_model_precision

9bac48d

hipsterusername closed this Oct 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

show data type of tensor files #4068

show data type of tensor files #4068

Uh oh!

keturn commented Jul 29, 2023 •

edited

Loading

Uh oh!

keturn commented Jul 29, 2023

Uh oh!

lstein commented Jul 30, 2023

Uh oh!

keturn commented Aug 1, 2023

Uh oh!

keturn commented Aug 22, 2023 •

edited

Loading

Uh oh!

bghira commented Sep 1, 2023

Uh oh!

Millu commented Nov 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

show data type of tensor files #4068

show data type of tensor files #4068

Uh oh!

Conversation

keturn commented Jul 29, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

TODO

What type of PR is this? (check all applicable)

Have you discussed this change with the InvokeAI team?

Have you updated all relevant documentation?

Related Tickets & Documents

QA Instructions, Screenshots, Recordings

Added/updated tests?

Uh oh!

keturn commented Jul 29, 2023

Uh oh!

lstein commented Jul 30, 2023

Uh oh!

keturn commented Aug 1, 2023

Uh oh!

keturn commented Aug 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

API Questions

Uh oh!

bghira commented Sep 1, 2023

Uh oh!

Millu commented Nov 3, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

keturn commented Jul 29, 2023 •

edited

Loading

keturn commented Aug 22, 2023 •

edited

Loading