Llama-3.1-405B in bf16 is 810 GB on disk. At RL learning rates, PULSE (Mihai & Belilovsky) reports ~99% per-step bf16 sparsity, so the actual delta is on the order of ~6 GB. Watch the two modes side by side.