GPU Comparison

NVIDIA
GEFORCE

NVIDIA A10M

CORE STATE GA102
VRAM 20 GB
CLOCK SPEED 1635 MHz
TDP 150 W
BUS WIDTH 320 bit
ARCHITECTURE Ampere
nm
PROCESS 8 nm
LAUNCH DATE
VS
NVIDIA
GEFORCE

Tesla C2070

CORE STATE GF100
VRAM 6 GB
CLOCK SPEED
TDP 238 W
BUS WIDTH 384 bit
ARCHITECTURE Fermi
nm
PROCESS 40 nm
LAUNCH DATE 2011

PERFORMANCE BENCHMARKS

geekbench_opencl
135,230
9,716

DETAILED SPECIFICATIONS

SPECIFICATION
A10M
Tesla C2070
Core Specs
Shading Units
7,168
448 -93.8%
Shaders
7,168
448 -93.8%
TMUs
224
56 -75.0%
ROPs
80
48 -40.0%
SM Count
56
14 -75.0%
Clocks
Base Clock
975 MHz
Boost Clock
1635 MHz
GPU Clock
574 MHz
Shader Clock
1147 MHz
Memory Clock
1563 MHz 12.5 Gbps effective
747 MHz 3 Gbps effective
Memory
Memory Size
20 GB
6 GB
VRAM (MB)
20,480
6,144 -70.0%
Memory Type
GDDR6
GDDR5
Memory Bus
320 bit
384 bit
Bandwidth
500.2 GB/s
143.4 GB/s
Cache
L1 Cache
128 KB (per SM)
64 KB (per SM)
L2 Cache
6 MB
768 KB
Performance
Pixel Rate
130.8 GPixel/s
16.07 GPixel/s
Texture Rate
366.2 GTexel/s
32.14 GTexel/s
FP32 (TFLOPS)
23.44 TFLOPS
1,027.7 GFLOPS
FP64 (TFLOPS)
732.5 GFLOPS (1:32)
513.9 GFLOPS (1:2)
FP16 (TFLOPS)
23.44 TFLOPS (1:1)
AI/RT
RT Cores
56
Tensor Cores
224
Power
TDP
150 W
238 W
TDP (W)
150
238 +58.7%
Suggested PSU
450 W
550 W
Power Connectors
8-pin EPS
1x 6-pin + 1x 8-pin
Architecture
Architecture
Ampere
Fermi
GPU Name
GA102
GF100
Generation
Server Ampere (Axx)
Tesla Fermi (x20xx)
Process Size
8 nm
40 nm
Transistors
28,300 million
3,100 million
Die Size
628 mm²
529 mm²
Foundry
Samsung
TSMC
Density
45.1M / mm²
5.9M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
OpenCL
3.0
1.1
CUDA
8.6
2.0
Shader Model
6.8
5.1
Physical
Slot Width
Single-slot
Dual-slot
Length
267 mm 10.5 inches
248 mm 9.8 inches
Height
112 mm 4.4 inches
Outputs
No outputs
1x DVI
Bus Interface
PCIe 4.0 x16
PCIe 2.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Tesla Turing
Tesla
Successor
Server Ada
Tesla Kepler
View A10M Details View Tesla C2070 Details