add graph cache key #576

ccssu · 2024-01-26T06:46:54Z

No description provided.

ccssu · 2024-01-26T06:51:12Z

src/onediff/infer_compiler/utils/graph_management_utils.py

+            cache_key = calculate_model_hash(model) + "_" + flow.__version__
+            return f"{file_path}_{count}_{cache_key}.graph"


增加 oneflow 模型结构的 key 避免自定义注册导致的冲突，比如添加自定CrossAttention1f 但是加载以前保存的图文件。

torch2of_class_map = { comfy.ldm.modules.attention.CrossAttention: CrossAttention1f, comfy.ldm.modules.attention.SpatialTransformer: SpatialTransformer1f, comfy_ops_Linear: Linear1f, AttnBlock: AttnBlock1f, }

这块功能是支持 graph file 的缓存；

然后如果改了模型结构，为了避免复用错误的缓存，所以加了个和模型结构相关的 key 放到文件名以区分？

这构造 key 的方式，感觉开销不小啊，取了整个 module 的 repr 的 str 来生成？

是的取了整个 module 的 repr 的 str 来生成。
开销0.1～0.2s 在 SDXL 1.0 感觉可接受，主要是更安全。如下图画红框处，这里主要耗时卡点的是 torch2flow那个转换。

graph_file_management 每次 graph 调用都会被执行，100ms 的开销是不可接受的，1 ms 都需要想想。

graph_file_management 每次 graph 调用都会被执行，100ms 的开销是不可接受的，1 ms 都需要想想。

这个不是每次 graph 调用都会被执行，只会在第一次加载图时候执行

ccssu · 2024-01-26T06:51:56Z

src/onediff/infer_compiler/with_oneflow_compile.py

-            options={},
-            graph_path=None,
-            graph_device=None,
+            self, torch_module, oneflow_module, use_graph=True, dynamic=True, options={}


删除掉遗留的 graph_path=None,
graph_device=None,

在这里给这两个参数增加下接口说明吧：

onediff/src/onediff/infer_compiler/with_oneflow_compile.py

Line 430 in f2934ce

- 'size' which config the cache size when cache is enabled. Note that after onediff v0.12, cache is default disabled.

done

- 'graph_file' which config the graph file path, default None. - 'graph_file_device' which config the device of graph file, default None.

可以再补充完整一点，如果配置了 graph_file ，会生成编译结果的 cache；如果配置了 graph_file_device ，在加载编译结果时，会把编译结果给转换到 graph_file_device 这个设备上以支持改变设备

'graph_file' (None) generates a compilation cache file. If the file exists, loading occurs; if not, compilation is saved after the first.

'graph_file_device' (None) sets the device for the graph file, default None. Enables flexible loading and compilation shift to the specified device.

ccssu · 2024-01-28T02:33:20Z

src/onediff/infer_compiler/utils/graph_management_utils.py

+            with cost_time(
+                debug=transform_mgr.debug_mode, message="calculate model input count"
+            ):
+                args_tree = ArgsTree((args, kwargs), False, tensor_type=torch.Tensor)
+                count = len(
+                    [v for v in args_tree.iter_nodes() if isinstance(v, flow.Tensor)]
+                )
+
+            with cost_time(debug=transform_mgr.debug_mode, message="get model"):
+                model = self._deployable_module_model.oneflow_module
+
+            with cost_time(
+                debug=transform_mgr.debug_mode,
+                message="calculate model hash for cache key",
+            ):
+                cache_key = calculate_model_hash(model) + "_" + flow.__version__
+            return f"{file_path}_{count}_{cache_key}.graph"


DEBUG [2024-01-28 02:34:06] - calculate model input count run time 7.200241088867188e-05 seconds DEBUG [2024-01-28 02:34:06] - Convert <class 'comfy.ldm.modules.diffusionmodules.openaimodel.UNetModel'> ... DEBUG [2024-01-28 02:34:09] - Convert id(self._torch_module)=140361274282560 done! DEBUG [2024-01-28 02:34:09] - get model run time 3.119259834289551 seconds DEBUG [2024-01-28 02:34:09] - calculate model hash for cache key run time 0.014719009399414062 seconds

get model 的时间为什么达到了 3s，这里不是只是一次简单的 get attr 么

get model 的时间为什么达到了 3s，这里不是只是一次简单的 get attr 么
频繁的mock_torch enable 和 disable 耗时占了近一半。这里提前对模型中所有的 class 做一次 cache , 估计可以优化 1s 多

src/onediff/infer_compiler/with_oneflow_compile.py

ccssu added 2 commits January 26, 2024 06:46

add cache key

6a1dc18

refine

f2934ce

ccssu commented Jan 26, 2024

View reviewed changes

ccssu requested a review from strint January 26, 2024 06:52

ccssu and others added 3 commits January 27, 2024 23:28

Merge branch 'main' into fix_graph_management_utils

7176a08

TODO: Optimize _transform_entity for faster SDXL conversion (1.47s)

fac7c04

refine

f2482f6

ccssu commented Jan 28, 2024

View reviewed changes

ccssu added 2 commits January 29, 2024 02:21

refine oneflow_compile doc

6158dca

add is_first_load

93cd013

strint approved these changes Jan 29, 2024

View reviewed changes

strint reviewed Jan 29, 2024

View reviewed changes

src/onediff/infer_compiler/with_oneflow_compile.py Outdated Show resolved Hide resolved

Update src/onediff/infer_compiler/with_oneflow_compile.py

de26c49

strint reviewed Jan 29, 2024

View reviewed changes

src/onediff/infer_compiler/with_oneflow_compile.py Outdated Show resolved Hide resolved

Update src/onediff/infer_compiler/with_oneflow_compile.py

00bafa5

strint merged commit ca046d1 into main Jan 29, 2024
1 check passed

strint deleted the fix_graph_management_utils branch January 29, 2024 13:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add graph cache key #576

add graph cache key #576

ccssu commented Jan 26, 2024

ccssu Jan 26, 2024 •

edited

Loading

strint Jan 26, 2024

strint Jan 26, 2024

ccssu Jan 27, 2024

strint Jan 28, 2024

ccssu Jan 29, 2024 •

edited

Loading

ccssu Jan 26, 2024

strint Jan 26, 2024

ccssu Jan 28, 2024

strint Jan 28, 2024

ccssu Jan 29, 2024

ccssu Jan 28, 2024 •

edited

Loading

strint Jan 28, 2024

ccssu Jan 29, 2024 •

edited

Loading

		cache_key = calculate_model_hash(model) + "_" + flow.__version__
		return f"{file_path}_{count}_{cache_key}.graph"

add graph cache key #576

add graph cache key #576

Conversation

ccssu commented Jan 26, 2024

ccssu Jan 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ccssu Jan 29, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ccssu Jan 28, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ccssu Jan 29, 2024 • edited Loading

Choose a reason for hiding this comment

ccssu Jan 26, 2024 •

edited

Loading

ccssu Jan 29, 2024 •

edited

Loading

ccssu Jan 28, 2024 •

edited

Loading

ccssu Jan 29, 2024 •

edited

Loading