GPU Comparison

NVIDIA
GEFORCE

NVIDIA RTX A4000

CORE STATE GA104
VRAM 16 GB
CLOCK SPEED 1560 MHz
TDP 140 W
BUS WIDTH 256 bit
ARCHITECTURE Ampere
nm
PROCESS 8 nm
LAUNCH DATE 2021
VS
NVIDIA
GEFORCE

Tesla M40

CORE STATE GM200
VRAM 12 GB
CLOCK SPEED 1112 MHz
TDP 250 W
BUS WIDTH 384 bit
ARCHITECTURE Maxwell 2.0
nm
PROCESS 28 nm
LAUNCH DATE 2015

PERFORMANCE BENCHMARKS

3dmark_3dmark_steel_nomad_dx12
2,604
N/A
geekbench_opencl
121,988
39,192
geekbench_vulkan
111,712
44,602
passmark_directx_10
126
N/A
passmark_directx_11
158
N/A
passmark_directx_12
72
N/A
passmark_directx_9
240
N/A
passmark_g2d
1,024
N/A
passmark_g3d
19,459
N/A
passmark_gpu_compute
9,760
N/A

DETAILED SPECIFICATIONS

SPECIFICATION
RTX A4000
Tesla M40
Core Specs
Shading Units
6,144
3,072 -50.0%
Shaders
6,144
3,072 -50.0%
TMUs
192
192 0.0%
ROPs
96
96 0.0%
SM Count
48
Clocks
Base Clock
735 MHz
948 MHz
Boost Clock
1560 MHz
1112 MHz
Memory Clock
1750 MHz 14 Gbps effective
1502 MHz 6 Gbps effective
Memory
Memory Size
16 GB
12 GB
VRAM (MB)
16,384
12,288 -25.0%
Memory Type
GDDR6
GDDR5
Memory Bus
256 bit
384 bit
Bandwidth
448.0 GB/s
288.4 GB/s
Cache
L1 Cache
128 KB (per SM)
48 KB (per SMM)
L2 Cache
4 MB
3 MB
Performance
Pixel Rate
149.8 GPixel/s
106.8 GPixel/s
Texture Rate
299.5 GTexel/s
213.5 GTexel/s
FP32 (TFLOPS)
19.17 TFLOPS
6.832 TFLOPS
FP64 (TFLOPS)
299.5 GFLOPS (1:64)
213.5 GFLOPS (1:32)
FP16 (TFLOPS)
19.17 TFLOPS (1:1)
AI/RT
RT Cores
48
Tensor Cores
192
Power
TDP
140 W
250 W
TDP (W)
140
250 +78.6%
Suggested PSU
300 W
600 W
Power Connectors
1x 6-pin
8-pin EPS
Architecture
Architecture
Ampere
Maxwell 2.0
GPU Name
GA104
GM200
Generation
Workstation Ampere (Ax000)
Tesla Maxwell (Mxx)
Process Size
8 nm
28 nm
Transistors
17,400 million
8,000 million
Die Size
392 mm²
601 mm²
Foundry
Samsung
TSMC
Density
44.4M / mm²
13.3M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (12_1)
OpenGL
4.6
4.6
Vulkan
1.4
1.4
OpenCL
3.0
3.0
CUDA
8.6
5.2
Shader Model
6.8
6.8
Physical
Slot Width
Single-slot
Dual-slot
Length
241 mm 9.5 inches
267 mm 10.5 inches
Height
112 mm 4.4 inches
Outputs
4x DisplayPort 1.4a
No outputs
Bus Interface
PCIe 4.0 x16
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Quadro Turing
Tesla Kepler
Successor
Workstation Ada
Tesla Pascal
View RTX A4000 Details View Tesla M40 Details