The NVIDIA Pascal architecture enables the Tesla P100 to deliver the highest absolute performance for HPC and hyperscale workloads. With more than 21 TeraFLOPS of FP16 performance, Pascal is optimized to drive exciting possibilities in deep learning applications. Pascal also delivers over 5 and 10 TeraFLOPS of double and single precision performance for HPC workloads.
Performance is often throttled by the interconnect. The revolutionary NVIDIA NVLink high-speed bidirectional interconnect is designed to scale applications across multiple GPUs by delivering 5X higher performance compared to today's best-in-class technology.