Skip to content

Latest commit

 

History

History
34 lines (25 loc) · 1.16 KB

gaudi-xl.md

File metadata and controls

34 lines (25 loc) · 1.16 KB

GAUDI-XL

i've been thinking about decoupling pose and scene for a while, this is basically a demonstration of that idea.

fit a separate model as a prior over the (scene) latent. maybe an adaptor layer/module/network over a frozen TTI mode like stable diffusion or maybe even CLIP. I bet noised CLIP could work. simple way to regularize over the semantic content of a whole scene? that might be a separate idea to explore.

https://github.com/apple/ml-gaudi

throw some of these in the pot too, baby we got a stew goin!

from the citations