One sync, end to end (frontier-scale model)

Llama-3.1-405B in bf16 is 810 GB on disk. At RL learning rates, PULSE (Mihai & Belilovsky) reports ~99% per-step bf16 sparsity, so the actual delta is on the order of ~6 GB. Watch the two modes side by side.

payload sent
— MB
inference paused
— s
reduction vs full