mist over green hills
shattered plates on the grass
cosmic love and attention
a time traveler in the crowd
Simple command line tool for text to image generation using OpenAI's CLIP and Siren. Credit goes to Ryan Murdock for the discovery of this technique (and for coming up with the great name)!
This will require that you have an Nvidia GPU
$ pip install deep-daze
$ imagine "a house in the forest"
That's it.
If you have enough memory, you can get better quality by adding a --deeper
flag
$ imagine "shattered plates on the ground" --deeper
In true deep learning fashion, more layers will yield better results. Default is at 16
, but can be increased to 32
depending on your resources.
$ imagine "stranger in strange lands" --num-layers 32
If you would like to invoke it in code.
from deep_daze import Imagine
imagine = Imagine(
text = 'cosmic love and attention',
num_layers = 24
)
imagine()
This is just a teaser. We will be able to generate images, sound, anything at will, with natural language. The holodeck is about to become real in our lifetimes.
Please join replication efforts for DALL-E for Pytorch or Mesh Tensorflow if you are interested in furthering this technology.
@misc{unpublished2021clip,
title = {CLIP: Connecting Text and Images},
author = {Alec Radford, Ilya Sutskever, Jong Wook Kim, Gretchen Krueger, Sandhini Agarwal},
year = {2021}
}
@misc{sitzmann2020implicit,
title = {Implicit Neural Representations with Periodic Activation Functions},
author = {Vincent Sitzmann and Julien N. P. Martel and Alexander W. Bergman and David B. Lindell and Gordon Wetzstein},
year = {2020},
eprint = {2006.09661},
archivePrefix = {arXiv},
primaryClass = {cs.CV}
}