click to
read an excerpt

Look inside

read now

ch 1 audio

first chapter summary

Resources

chapter briefs Source code Book forum Source code on GitHub Register your pBook for a free eBook more

Become a
Reviewer

Help us create great books

Transformers in Action

you own this product

Nicole Koenigstein
Foreword by Luis Serrano

November 2025
ISBN 9781633437883
256 pages

Included with a Manning Online subscription

printed in black & white

available in Russian

catalog / Other / Computer Science / Artificial Intelligence

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

pdf, ePub, online

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

Look inside

Understand the architecture that underpins today’s most powerful AI models.

Transformers are the superpower behind large language models (LLMs) like ChatGPT, Gemini, and Claude. Transformers in Action gives you the insights, practical techniques, and extensive code samples you need to adapt pretrained transformer models to new and exciting tasks.

Inside Transformers in Action you’ll learn:

How transformers and LLMs work
Modeling families and architecture variants
Efficient and specialized large language models
Adapt HuggingFace models to new tasks
Automate hyperparameter search with Ray Tune and Optuna
Optimize LLM model performance
Advanced prompting and zero/few-shot learning
Text generation with reinforcement learning
Responsible LLMs

Transformers in Action takes you from the origins of transformers all the way to fine-tuning an LLM for your own projects. Author Nicole Koenigstein demonstrates the vital mathematical and theoretical background of the transformer architecture practically through executable Jupyter notebooks. You’ll discover advice on prompt engineering, as well as proven-and-tested methods for optimizing and tuning large language models. Plus, you’ll find unique coverage of AI ethics, specialized smaller models, and the decoder encoder architecture.

about the technology

Transformers are the beating heart of large language models (LLMs) and other generative AI tools. These powerful neural networks use a mechanism called self-attention, which enables them to dynamically evaluate the relevance of each input element in context. Transformer-based models can understand and generate natural language, translate between languages, summarize text, and even write code—all with impressive fluency and coherence.

about the book

Transformers in Action introduces you to transformers and large language models with careful attention to their design and mathematical underpinnings. You’ll learn why architecture matters for speed, scale, and retrieval as you explore applications including RAG and multi-modal models. Along the way, you’ll discover how to optimize training and performance using advanced sampling and decoding techniques, use reinforcement learning to align models with human preferences, and more. The hands-on Jupyter notebooks and real-world examples ensure you’ll see transformers in action as you go.

what's inside

Optimizing LLM model performance
Adapting HuggingFace models to new tasks
How transformers and LLMs work under the hood
Mitigating bias and responsible ethics in LLMs

about the reader

For data scientists and machine learning engineers.

about the author

Nicole Koenigstein is the Co-Founder and Chief AI Officer at the fintech company Quantmate.

eBook

pdf, ePub, online

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

pro $24.99 per month

access to all Manning books, MEAPs, liveVideos, liveProjects, and audiobooks!
choose one free eBook per month to keep
exclusive 50% discount on all purchases
renews monthly, pause or cancel renewal anytime

lite $19.99 per month

access to all Manning books, including MEAPs!

team

5, 10 or 20 seats+ for your team - learn more

eBook

$47.99 $23.99

you save $24.00 (50%)

include audio $24.99 $12.49

An absolute joy to read and learn from.

Ankit Virmani, Google

New insights and high impact applications in almost every chapter.

Hobson Lane, Tangible AI

Finally, the transformers book that prioritizes code over theory!

Olena Sokol, Samsung

A sharp, no-fluff deep dive into transformers.

Priyanka Neelakrishnan, Palo Alto Networks

Transformers in Action

pro $24.99 per month

lite $19.99 per month

team

pro $24.99 per month

lite $19.99 per month

team

about the technology

about the book

what's inside

about the reader

about the author

pro $24.99 per month

lite $19.99 per month

team

Add to Reading List

related titles

related titles

pro

team

pro

team

pro

team