25、潜伏扩散(Latent diffusion)

最新推荐文章于 2024-07-19 22:46:50 发布

Klein&Macmillan

最新推荐文章于 2024-07-19 22:46:50 发布

阅读量139

点赞数 5

分类专栏：面向编程人员的实用深度学习文章标签：人工智能机器学习深度学习

本文链接：https://blog.csdn.net/qq18218628646/article/details/139339877

版权

面向编程人员的实用深度学习专栏收录该内容

39 篇文章 0 订阅

订阅专栏

In this final lesson of the series, Johno begins by showing us how we can convert sounds into pictures, and then take advantage of what we’ve learned in this course to generate audio! He builds and demonstrates a very effective bird-song generator using this approach.

Then Jeremy wraps up “Stable diffusion from scratch” by showing how to use the latents in a variational encoder as the “pixels” in a regular diffusion model. He also describes an intriguing new idea for students to follow up: what if you use latents for other purposes, such as a classification model? Perhaps this would open up a whole world of possibilities, such as latents-FID, latents-perceptual-loss, and new approaches to diffusion guidance!