GPU Comparison
GEFORCE
NVIDIA Tesla M40
CORE STATE
GM200
VRAM
12 GB
CLOCK SPEED
1112 MHz
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Maxwell 2.0
PROCESS
28 nm
LAUNCH DATE
2015
VS
GEFORCE
Tesla T4
CORE STATE
TU104
VRAM
16 GB
CLOCK SPEED
1590 MHz
TDP
70 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2018
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
Tesla M40
Tesla T4
Core Specs
Shading Units
3,072
2,560
-16.7%
Shaders
3,072
2,560
-16.7%
TMUs
192
160
-16.7%
ROPs
96
64
-33.3%
SM Count
—
40
Clocks
Base Clock
948 MHz
585 MHz
Boost Clock
1112 MHz
1590 MHz
Memory Clock
1502 MHz
6 Gbps effective
1250 MHz
10 Gbps effective
Memory
Memory Size
12 GB
16 GB
VRAM (MB)
12,288
16,384
+33.3%
Memory Type
GDDR5
GDDR6
Memory Bus
384 bit
256 bit
Bandwidth
288.4 GB/s
320.0 GB/s
Cache
L1 Cache
48 KB (per SMM)
64 KB (per SM)
L2 Cache
3 MB
4 MB
Performance
Pixel Rate
106.8 GPixel/s
101.8 GPixel/s
Texture Rate
213.5 GTexel/s
254.4 GTexel/s
FP32 (TFLOPS)
6.832 TFLOPS
8.141 TFLOPS
FP64 (TFLOPS)
213.5 GFLOPS (1:32)
254.4 GFLOPS (1:32)
FP16 (TFLOPS)
—
65.13 TFLOPS (8:1)
AI/RT
RT Cores
—
40
Tensor Cores
—
320
Power
TDP
250 W
70 W
TDP (W)
250
70
-72.0%
Suggested PSU
600 W
250 W
Power Connectors
8-pin EPS
None
Architecture
Architecture
Maxwell 2.0
Turing
GPU Name
GM200
TU104
Generation
Tesla Maxwell
(Mxx)
Tesla Turing
(Txx)
Process Size
28 nm
12 nm
Transistors
8,000 million
13,600 million
Die Size
601 mm²
545 mm²
Foundry
TSMC
TSMC
Density
13.3M / mm²
25.0M / mm²
API Support
DirectX
12 (12_1)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
5.2
7.5
Shader Model
6.8
6.8
Physical
Slot Width
Dual-slot
Single-slot
Length
267 mm
10.5 inches
168 mm
6.6 inches
Outputs
No outputs
No outputs
Bus Interface
PCIe 3.0 x16
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Kepler
Tesla Volta
Successor
Tesla Pascal
Server Ampere