Programming Tools For Scientific Computing On Personal Desktop Systems

Figure 7 Nvidia GF100 (Fermi) processor with parallel kernel execution Single­precision performance of GF100 is about 1.7 Tflops but double­precision performance is only half at 800 Gflops, significantly better than the Radeon 5870. Previous architectures required that all SMs in the chip worked ...