Guide
How to Use This Book
Choose the path that matches your goal
Silicon-Up Reader
Start with Ch01–Ch05 to build first-principles performance intuition from hardware to TCO.
Inference Fast-Track
Read Ch01, Ch06, Ch08, Ch09, Ch11, Ch13, and Ch17 for LLM serving readiness.
Training Specialist
Prioritize Ch01, Ch04, Ch10, Ch12, Ch14, Ch15, and Ch17 for distributed training systems.
Interview Prep
Use Ch01, Ch02, Ch06, Ch10, Ch11, Ch14, and Ch18 for principal-level system design.
989.4T
H100 Dense BF16
3.35
H100 HBM TB/s
295
Dense Ridge F/B
16B
Bytes/param AdamW
TP≤8
NVLink Domain Rule
82%
KV Cache Alert
6N
Training FLOPs/Token
00–18
Chapters
HTML where available · PDF always available
Fig
Architecture Diagrams
Open in browser · print to PDF if needed
App
Appendices
Reference tables, tools, and glossary