Does OmniGen support changing the sdxl model? #24

XueniLuo · 2024-11-15T02:28:19Z

Hii, Thanks for this stunning work!
If I understood the paper in a proper way...
Since the SDXL and VAE part is fixed, only the transformer is trained. Is it possible to have a solely transformer conditioning Comfyui node, thus support common checkpoint loader and KSampler nodes in OmniGen workflow. In this way, we are allowed to change different SDXL checkpoints, even support flux checkpoint?

set-soft · 2024-11-23T13:57:16Z

OmniGen is derived from Phi3, which is an LLM model (technically an SLM).
Derives from the one that has vision included https://huggingface.co/microsoft/Phi-3-vision-128k-instruct
The fact that uses the SDXL VAE doesn't make it even similar to SDXL.
I created some nodes that separate the functionality: https://github.com/set-soft/ComfyUI_OmniGen_Nodes/
You can use the VAE loader and decoder from core nodes, and internally I use the VAE class from ComfyUI to encode the input images.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does OmniGen support changing the sdxl model? #24

Does OmniGen support changing the sdxl model? #24

XueniLuo commented Nov 15, 2024

set-soft commented Nov 23, 2024

Does OmniGen support changing the sdxl model? #24

Does OmniGen support changing the sdxl model? #24

Comments

XueniLuo commented Nov 15, 2024

set-soft commented Nov 23, 2024