Manning Early Access Program (MEAP)
Read chapters as they are written, get the finished eBook as soon as it’s ready, and receive the pBook long before it's in bookstores.
AI applications need much more than a connection to a model. To work well in the real world, they need memory, access to company knowledge, safe ways to take action, clear rules, and reliable operations behind the scenes. Rather than rebuilding those pieces for every project, organizations can create a shared AI platform that supports all their AI applications.
Designing AI Systems shows you how to build that platform. Starting from scratch, you’ll create the core services that real AI applications depend on: a way to connect to different AI providers, a way to remember earlier interactions, a way to bring in your organization’s own data, and a way to let AI systems use tools and act safely within company rules. By the end, you’ll understand not just how to call an AI model but how to build the full system around it.
This book focuses on the part of AI development that usually gets overlooked: everything required to make an AI system reliable, manageable, and ready for production. With complete Python code in every chapter, you’ll build a working reference implementation you can adapt to your own needs. You’ll also learn the practical operational skills that matter in production, including managing costs, routing requests, enforcing safety controls, and evaluating system quality.
The result is a solid foundation for AI applications that can use company knowledge, carry out multi-step work, connect to outside tools, and remain trustworthy and governable as they scale.
what's inside
How to design and build an AI platform from the ground up
The core services that AI applications need in order to work reliably
A flexible way to work with different AI providers
Memory across conversations and strategies for handling long context
Methods for connecting AI to company data and documents
Safe, controlled ways for AI systems to use tools and take action
Production-ready communication patterns and request routing
Operational practices for monitoring, cost control, and quality evaluation
about the reader
For software engineers, platform engineers, and technical architects building and scaling AI systems. You should be comfortable with Python and basic software architecture.
about the authors
Suhas Suresha is a Senior Machine Learning Engineer at Adobe, where he builds large-scale generative AI platforms across the full machine learning lifecycle. He previously co-founded QALY, where he helped deploy real-time ECG analysis models to more than 100,000 users. He holds a master’s degree in computational and applied mathematics from Stanford University.
Dewang Sultania is a Senior Machine Learning Engineer at Netflix, where he designs scalable systems for multimodal generative AI, diffusion models, and video processing. Previously at Adobe, he built production systems for large language models, including data pipelines, fine-tuning, retrieval-based systems, and prompt engineering. He also helped design the platform infrastructure used to deploy large language models across Adobe’s product suite.
Introductory offer Save 50% for a limited time!
eBook
pdf, ePub, online
$47.99
$23.99
you save $24.00 (50%)
Introductory offer Save 50% for a limited time!
print
includes eBook
$59.99
$29.99
you save $30.00 (50%)
with subscription
free or 50% off
$24.99
pro $24.99 per month
access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!