GPU Comparison
GEFORCE
NVIDIA L40
CORE STATE
AD102
VRAM
48 GB
CLOCK SPEED
2490 MHz
TDP
300 W
BUS WIDTH
384 bit
ARCHITECTURE
Ada Lovelace
PROCESS
5 nm
LAUNCH DATE
2022
VS
GEFORCE
Tesla C2070
CORE STATE
GF100
VRAM
6 GB
CLOCK SPEED
—
TDP
238 W
BUS WIDTH
384 bit
ARCHITECTURE
Fermi
PROCESS
40 nm
LAUNCH DATE
2011
PERFORMANCE BENCHMARKS
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
L40
Tesla C2070
Core Specs
Shading Units
18,176
448
-97.5%
Shaders
18,176
448
-97.5%
TMUs
568
56
-90.1%
ROPs
192
48
-75.0%
SM Count
142
14
-90.1%
Clocks
Base Clock
735 MHz
—
Boost Clock
2490 MHz
—
GPU Clock
—
574 MHz
Shader Clock
—
1147 MHz
Memory Clock
2250 MHz
18 Gbps effective
747 MHz
3 Gbps effective
Memory
Memory Size
48 GB
6 GB
VRAM (MB)
49,152
6,144
-87.5%
Memory Type
GDDR6
GDDR5
Memory Bus
384 bit
384 bit
Bandwidth
864.0 GB/s
143.4 GB/s
Cache
L1 Cache
128 KB (per SM)
64 KB (per SM)
L2 Cache
96 MB
768 KB
Performance
Pixel Rate
478.1 GPixel/s
16.07 GPixel/s
Texture Rate
1,414.3 GTexel/s
32.14 GTexel/s
FP32 (TFLOPS)
90.52 TFLOPS
1,027.7 GFLOPS
FP64 (TFLOPS)
1,414.3 GFLOPS (1:64)
513.9 GFLOPS (1:2)
FP16 (TFLOPS)
90.52 TFLOPS (1:1)
—
AI/RT
RT Cores
142
—
Tensor Cores
568
—
Power
TDP
300 W
238 W
TDP (W)
300
238
-20.7%
Suggested PSU
700 W
550 W
Power Connectors
1x 16-pin
1x 6-pin + 1x 8-pin
Architecture
Architecture
Ada Lovelace
Fermi
GPU Name
AD102
GF100
Generation
Server Ada
(Lxx)
Tesla Fermi
(x20xx)
Process Size
5 nm
40 nm
Transistors
76,300 million
3,100 million
Die Size
609 mm²
529 mm²
Foundry
TSMC
TSMC
Density
125.3M / mm²
5.9M / mm²
API Support
DirectX
12 Ultimate (12_2)
12 (11_0)
OpenGL
4.6
4.6
Vulkan
1.4
—
OpenCL
3.0
1.1
CUDA
8.9
2.0
Shader Model
6.8
5.1
Physical
Slot Width
Dual-slot
Dual-slot
Length
267 mm
10.5 inches
248 mm
9.8 inches
Height
111 mm
4.4 inches
—
Outputs
4x DisplayPort 1.4a
1x DVI
Bus Interface
PCIe 4.0 x16
PCIe 2.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Server Ampere
Tesla
Successor
Server Hopper
Tesla Kepler