Master the essential models, algorithms, tools, and techniques for interpreting and generating images using AI.
From digital special effects to medical image augmentation and analysis, generative AI is revolutionizing the way we create and interpret visual information. Innovations including diffusion models, text-to-image generators, GANs, and more help you create photorealistic graphics, empower creative expression with text-to-image tools, and accurately recognize and describe visual elements for applications like image search.
Generative AI in Computer Vision will teach you the foundations of modern computer vision and equip you with the practical techniques you need to bring your ideas to life.
In
Generative AI in Computer Vision you’ll learn about:
- Variational autoencoders (VAEs) and generative adversarial networks (GANs)
- Diffusion models for high-quality image generation
- Evaluating models with metrics such as inception score and Fréchet inception distance
- Conditional and guided generation techniques
- Bridging language and vision using transformers and models like CLIP
- Implementing text-to-image models
Generative AI in Computer Vision guides you from core concepts of digital image creation to the cutting edge of AI-powered visual computing. You’ll unpack tools like DALL-E and Stable Diffusion and learn to build your own by following the detailed code samples and practical tutorials.