SD text prompt 文生图系列

最新推荐文章于 2024-08-02 19:46:20 发布

平丘月初

最新推荐文章于 2024-08-02 19:46:20 发布

阅读量421

点赞数 9

文章标签： prompt 人工智能深度学习

本文链接：https://blog.csdn.net/u011994454/article/details/135340653

版权

文章介绍了如何在PyTorch中加载和使用StableDiffusion模型进行图像生成，特别是在GPU显存有限的情况下，如何调整精度为float16以节省内存。同时提及了使用预训练的AutoencoderKL进行编码和解码操作。

摘要由CSDN通过智能技术生成

import torch
from diffusers import StableDiffusionPipeline
model_id = "CompVis/stable-diffusion-v1-4"
device = "cuda"
pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
pipe = pipe.to(device)
# pipe.enable_attention_slicing()

prompt = "a photo of an astronaut riding a horse on mars"
image = pipe(prompt).images[0]
image.save("astronaut_rides_horse.png")

如果GPU显存空间有限，可用空间小于4GB，需要确保以float16精度加载StableDiffusionPipeline。

vae = AutoencoderKL.from_pretrained(
		"CompVis/stable-diffusion-v1-4",
		subfolder="vae",
		revision="ebb811dd71cdc38a204ecbdd6ac5d580f529fd8c")

latents = vae.encode(images)
images = vae.decode(latents).sample