Generating visual arts from text prompt and input guiding image.
On‑device, high‑resolution image synthesis from text and image prompts. ControlNet guides Stable‑diffusion with provided input image to generate accurate images from given input prompt.
Input:Text prompt and input image as a reference
Conditioning Input:Canny-Edge
Text Encoder Number of parameters:340M
UNet Number of parameters:865M
VAE Decoder Number of parameters:83M
ControlNet Number of parameters:361M
Model size:1.4GB
Image Generation
Image Editing
Content Creation
Source Model: APACHE-2.0
Deployable Model: AI-HUB-MODELS-LICENSE