Build a DeepSeek Model from Scratch: Design, Train, and Scale High-Performance LLMs with MoE, Long Context, and Efficient Attention: 2 (AI & Applied ML) - Softcover

Book 2 of 3: AI & Applied ML

Dunavant, Clinton S.

9798278967132: Build a DeepSeek Model from Scratch: Design, Train, and Scale High-Performance LLMs with MoE, Long Context, and Efficient Attention: 2 (AI & Applied ML)

Softcover

ISBN 13: 9798278967132

Publisher: Independently published, 2025

View all copies of this ISBN edition

0 Used

3 New

From � 21.09

Build a DeepSeek Model from Scratch addresses a hard truth many AI engineers face today: most resources explain what large language models are, but very few show how to actually build one that scales, stays stable, and performs competitively under real-world constraints. If you’ve tried to move beyond toy models—only to hit walls around memory limits, training instability, slow attention, or runaway costs—this book is written for you.

This book delivers a complete, production-minded blueprint for designing and training DeepSeek-class large language models from the ground up. It walks through the full lifecycle of modern LLM engineering: defining an efficient decoder-only architecture, integrating Mixture of Experts for scale, enabling long-context reasoning with efficient attention, and deploying models that can be served reliably and cost-effectively. Every design choice is explained from an engineering perspective, grounded in practices that work at billion-parameter scale.

You’ll learn how to move from architectural intent to operational reality—without hand-waving, fragile shortcuts, or purely academic abstractions.

By the end of this book, you’ll be able to:

Design a DeepSeek-style LLM architecture optimized for throughput, memory, and cost
Implement and scale Mixture of Experts layers without load collapse or routing instability
Train long-context models using efficient attention and KV cache strategies
Build streaming data pipelines that scale cleanly and remain reproducible
Stabilize billion-parameter training with the right optimizers, precision, and recovery workflows
Evaluate reasoning, language, and code performance without benchmark overfitting
Deploy and serve large models using quantization and modern inference patterns

Written for AI engineers, ML researchers, and systems builders, this book emphasizes practical execution over theory and replaces guesswork with tested engineering patterns. It assumes you want to build, not just experiment—and that reliability, performance, and scalability matter as much as raw capability.

"synopsis" may belong to another edition of this title.

Publisher: Independently published
Publication date: 2025
Language: English
ISBN 13: 9798278967132
Binding: Paperback
Number of pages: 145

Search results for Build a DeepSeek Model from Scratch: Design, Train,...

Stock Image

Build a DeepSeek Model from Scratch (Paperback)

Clinton S. Dunavant

Published by Independently Published, 2025

ISBN 13: 9798278967132

New Paperback

Print on Demand

Seller: Grand Eagle Retail, Bensenville, IL, U.S.A.

Seller rating 5 out of 5 stars

Paperback. Condition: new. Paperback. Build a DeepSeek Model from Scratch addresses a hard truth many AI engineers face today: most resources explain what large language models are, but very few show how to actually build one that scales, stays stable, and performs competitively under real-world constraints. If you've tried to move beyond toy models-only to hit walls around memory limits, training instability, slow attention, or runaway costs-this book is written for you.This book delivers a complete, production-minded blueprint for designing and training DeepSeek-class large language models from the ground up. It walks through the full lifecycle of modern LLM engineering: defining an efficient decoder-only architecture, integrating Mixture of Experts for scale, enabling long-context reasoning with efficient attention, and deploying models that can be served reliably and cost-effectively. Every design choice is explained from an engineering perspective, grounded in practices that work at billion-parameter scale.You'll learn how to move from architectural intent to operational reality-without hand-waving, fragile shortcuts, or purely academic abstractions.By the end of this book, you'll be able to: Design a DeepSeek-style LLM architecture optimized for throughput, memory, and costImplement and scale Mixture of Experts layers without load collapse or routing instabilityTrain long-context models using efficient attention and KV cache strategiesBuild streaming data pipelines that scale cleanly and remain reproducibleStabilize billion-parameter training with the right optimizers, precision, and recovery workflowsEvaluate reasoning, language, and code performance without benchmark overfittingDeploy and serve large models using quantization and modern inference patternsWritten for AI engineers, ML researchers, and systems builders, this book emphasizes practical execution over theory and replaces guesswork with tested engineering patterns. It assumes you want to build, not just experiment-and that reliability, performance, and scalability matter as much as raw capability. This item is printed on demand. Shipping may be from multiple locations in the US or from the UK, depending on stock availability. Seller Inventory # 9798278967132

Contact seller

Buy New

� 21.09

Free Shipping
Ships within U.S.A.

Quantity: 1 available

Add to basket

Stock Image

Build a DeepSeek Model from Scratch

Clinton S Dunavant

Published by Amazon Digital Services LLC - Kdp, 2025

ISBN 13: 9798278967132

New PAP

Print on Demand

Seller: PBShop.store UK, Fairford, GLOS, United Kingdom

Seller rating 5 out of 5 stars

PAP. Condition: New. New Book. Delivered from our UK warehouse in 4 to 14 business days. THIS BOOK IS PRINTED ON DEMAND. Established seller since 2000. Seller Inventory # L0-9798278967132

Contact seller

Buy New

� 17.73

� 4.16 shipping
Ships from United Kingdom to U.S.A.

Quantity: Over 20 available

Add to basket

Stock Image

Build a DeepSeek Model from Scratch (Paperback)

Clinton S. Dunavant

Published by Independently Published, 2025

ISBN 13: 9798278967132

New Paperback

Print on Demand

Seller: CitiRetail, Stevenage, United Kingdom

Seller rating 5 out of 5 stars

Paperback. Condition: new. Paperback. Build a DeepSeek Model from Scratch addresses a hard truth many AI engineers face today: most resources explain what large language models are, but very few show how to actually build one that scales, stays stable, and performs competitively under real-world constraints. If you've tried to move beyond toy models-only to hit walls around memory limits, training instability, slow attention, or runaway costs-this book is written for you.This book delivers a complete, production-minded blueprint for designing and training DeepSeek-class large language models from the ground up. It walks through the full lifecycle of modern LLM engineering: defining an efficient decoder-only architecture, integrating Mixture of Experts for scale, enabling long-context reasoning with efficient attention, and deploying models that can be served reliably and cost-effectively. Every design choice is explained from an engineering perspective, grounded in practices that work at billion-parameter scale.You'll learn how to move from architectural intent to operational reality-without hand-waving, fragile shortcuts, or purely academic abstractions.By the end of this book, you'll be able to: Design a DeepSeek-style LLM architecture optimized for throughput, memory, and costImplement and scale Mixture of Experts layers without load collapse or routing instabilityTrain long-context models using efficient attention and KV cache strategiesBuild streaming data pipelines that scale cleanly and remain reproducibleStabilize billion-parameter training with the right optimizers, precision, and recovery workflowsEvaluate reasoning, language, and code performance without benchmark overfittingDeploy and serve large models using quantization and modern inference patternsWritten for AI engineers, ML researchers, and systems builders, this book emphasizes practical execution over theory and replaces guesswork with tested engineering patterns. It assumes you want to build, not just experiment-and that reliability, performance, and scalability matter as much as raw capability. This item is printed on demand. Shipping may be from our UK warehouse or from our Australian or US warehouses, depending on stock availability. Seller Inventory # 9798278967132

Contact seller

Buy New

� 20.99

� 37 shipping
Ships from United Kingdom to U.S.A.

Quantity: 1 available

Add to basket

Build a DeepSeek Model from Scratch: Design, Train, and Scale High-Performance LLMs with MoE, Long Context, and Efficient Attention: 2 (AI & Applied ML) - Softcover

Dunavant, Clinton S.

Synopsis

Search results for Build a DeepSeek Model from Scratch: Design, Train,...

Build a DeepSeek Model from Scratch (Paperback)

Buy New

Build a DeepSeek Model from Scratch

Buy New

Build a DeepSeek Model from Scratch (Paperback)

Buy New