The project utilizes Stable Diffusion, a state-of-the-art model that converts text descriptions into high-quality images. It incorporates:
- CLIP: To understand and process the text input for better alignment with the generated images.
- U-Net Encoder/Decoder: For the image generation pipeline ensuring detailed and accurate imagery.
- Generate images from text.
- Integration of robust models like CLIP and U-Net.
- Scalable architecture for further enhancements.
To set up this project, follow these steps:
# Clone the repository
git clone https://github.com/badaramoni/StableDiffusionText2Img.git
cd StableDiffusionText2Img