GPU Comparison
GEFORCE
NVIDIA A10M
CORE STATE
GA102
VRAM
20 GB
CLOCK SPEED
1635 MHz
TDP
150 W
BUS WIDTH
320 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
—
VS
GEFORCE
Tesla T4
CORE STATE
TU104
VRAM
16 GB
CLOCK SPEED
1590 MHz
TDP
70 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2018
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
A10M
Tesla T4
Core Specs
Shading Units
7,168
2,560
-64.3%
Shaders
7,168
2,560
-64.3%
TMUs
224
160
-28.6%
ROPs
80
64
-20.0%
SM Count
56
40
-28.6%
Clocks
Base Clock
975 MHz
585 MHz
Boost Clock
1635 MHz
1590 MHz
Memory Clock
1563 MHz
12.5 Gbps effective
1250 MHz
10 Gbps effective
Memory
Memory Size
20 GB
16 GB
VRAM (MB)
20,480
16,384
-20.0%
Memory Type
GDDR6
GDDR6
Memory Bus
320 bit
256 bit
Bandwidth
500.2 GB/s
320.0 GB/s
Cache
L1 Cache
128 KB (per SM)
64 KB (per SM)
L2 Cache
6 MB
4 MB
Performance
Pixel Rate
130.8 GPixel/s
101.8 GPixel/s
Texture Rate
366.2 GTexel/s
254.4 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
8.141 TFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
254.4 GFLOPS (1:32)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
65.13 TFLOPS (8:1)
AI/RT
RT Cores
56
40
-28.6%
Tensor Cores
224
320
+42.9%
Power
TDP
150 W
70 W
TDP (W)
150
70
-53.3%
Suggested PSU
450 W
250 W
Power Connectors
8-pin EPS
None
Architecture
Architecture
Ampere
Turing
GPU Name
GA102
TU104
Generation
Server Ampere
(Axx)
Tesla Turing
(Txx)
Process Size
8 nm
12 nm
Transistors
28,300 million
13,600 million
Die Size
628 mm²
545 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
25.0M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
7.5
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Single-slot
Length
267 mm
10.5 inches
168 mm
6.6 inches
Height
112 mm
4.4 inches
—
Outputs
No outputs
No outputs
Bus Interface
PCIe 4.0 x16
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Tesla Volta
Successor
Server Ada
Server Ampere