Art generation using VQGAN + CLIP using docker containers. A simplified, updated, and expanded upon version of Kevin Costa's work. This project tries to make generating art as easy as possible for anyone with a GPU by providing a simple web UI.
For samples check out the AI Generated section on the art page on website.
This works best if you have an NVIDIA GPU however there is a fallback CPU mode included. I've found the CPU mode to take significantly longer than even the most basic of GPUs though.
For running just deploy this SDL and wait for the models to finish loading, watching the process in the logs.
The larger the image the more VRAM your graphics card needs:
- 6 GB of VRAM is required to generate 256x256 images.
- 12 GB of VRAM is required to generate 512x512 images.
- 24 GB of VRAM is required to generate 1024x1024 images.
If you don't know how much VRAM your graphics card has you probably have 6 GB or less so stick with smaller images.
That being said you can do non-square images if you want as long as you don't go above the number of pixels your GPU's VRAM supports, for example you could do ultrawide images with 6 GB of ram at "384x128" or do tall images at "128x384" and so on. You do not have to use numbers with a power of 2, "300x100" is also perfectly valid.
https://github.com/alfset/awesome-akash/blob/master/AI-Image-App/ai-image-app_screen-capture.webm