GPU Comparison
GEFORCE
NVIDIA A10M
CORE STATE
GA102
VRAM
20 GB
CLOCK SPEED
1635 MHz
TDP
150 W
BUS WIDTH
320 bit
ARCHITECTURE
Ampere
PROCESS
8 nm
LAUNCH DATE
—
VS
GEFORCE
Tesla M2090
CORE STATE
GF110
VRAM
6 GB
CLOCK SPEED
—
TDP
250 W
BUS WIDTH
384 bit
ARCHITECTURE
Fermi 2.0
PROCESS
40 nm
LAUNCH DATE
2011
PERFORMANCE BENCHMARKS
geekbench_opencl
DETAILED SPECIFICATIONS
SPECIFICATION
A10M
Tesla M2090
Core Specs
Shading Units
7,168
512
-92.9%
Shaders
7,168
512
-92.9%
TMUs
224
64
-71.4%
ROPs
80
48
-40.0%
SM Count
56
16
-71.4%
Clocks
Base Clock
975 MHz
—
Boost Clock
1635 MHz
—
GPU Clock
—
651 MHz
Shader Clock
—
1301 MHz
Memory Clock
1563 MHz
12.5 Gbps effective
924 MHz
3.7 Gbps effective
Memory
Memory Size
20 GB
6 GB
VRAM (MB)
20,480
6,144
-70.0%
Memory Type
GDDR6
GDDR5
Memory Bus
320 bit
384 bit
Bandwidth
500.2 GB/s
177.4 GB/s
Cache
L1 Cache
128 KB (per SM)
64 KB (per SM)
L2 Cache
6 MB
768 KB
Performance
Pixel Rate
130.8 GPixel/s
20.83 GPixel/s
Texture Rate
366.2 GTexel/s
41.66 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
1,332.2 GFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
666.1 GFLOPS (1:2)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
—
AI/RT
RT Cores
56
—
Tensor Cores
224
—
Power
TDP
150 W
250 W
TDP (W)
150
250
+66.7%
Suggested PSU
450 W
600 W
Power Connectors
8-pin EPS
1x 6-pin + 1x 8-pin
Architecture
Architecture
Ampere
Fermi 2.0
GPU Name
GA102
GF110
Generation
Server Ampere
(Axx)
Tesla Fermi
(x20xx)
Process Size
8 nm
40 nm
Transistors
28,300 million
3,000 million
Die Size
628 mm²
520 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
5.8M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
—
OpenCL
3.0
1.1
CUDA
8.6
2.0
Shader Model
6.8
5.1
Physical
Slot Width
Single-slot
Dual-slot
Length
267 mm
10.5 inches
248 mm
9.8 inches
Height
112 mm
4.4 inches
—
Outputs
No outputs
No outputs
Bus Interface
PCIe 4.0 x16
PCIe 2.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Tesla
Successor
Server Ada
Tesla Kepler