Academia.eduAcademia.edu

Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency   Figure 5: Benchmarking unstructured sparsity during (left) inference on Neural Magic’s DeepSparse runtime and (right) training acceleration on the Cerebras CS-2. In both setups, we measure the relative increase in latency or training speed for Sparse-IFT variants against the dense model.

Figure 5 Sparse-IFT: Sparse Iso-FLOP Transformations for Maximizing Training Efficiency Figure 5: Benchmarking unstructured sparsity during (left) inference on Neural Magic’s DeepSparse runtime and (right) training acceleration on the Cerebras CS-2. In both setups, we measure the relative increase in latency or training speed for Sparse-IFT variants against the dense model.