580 California St., Suite 400
San Francisco, CA, 94104
Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
Figure 7 Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency Figure 7: Measured speedup versus theoretical speedup at varying sparsity levels for a GPT-3 layer 12k x 12k matrix multiplication (MatMul) (Lie, 2021).
Discover breakthrough research and expand your academic network
Join for free