GPU Comparison
RADEON
AMD Radeon R9 M295X
CORE STATE
Amethyst
VRAM
4 GB
CLOCK SPEED
—
TDP
250 W
BUS WIDTH
256 bit
ARCHITECTURE
GCN 3.0
PROCESS
28 nm
LAUNCH DATE
2014
VS
GEFORCE
L40S
CORE STATE
AD102
VRAM
48 GB
CLOCK SPEED
2520 MHz
TDP
300 W
BUS WIDTH
384 bit
ARCHITECTURE
Ada Lovelace
PROCESS
5 nm
LAUNCH DATE
2022
PERFORMANCE BENCHMARKS
geekbench_metal
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
R9 M295X
L40S
Core Specs
Shading Units
2,048
18,176
+787.5%
Shaders
2,048
18,176
+787.5%
TMUs
128
568
+343.8%
ROPs
32
192
+500.0%
Compute Units
32
—
SM Count
—
142
Clocks
Base Clock
—
1110 MHz
Boost Clock
—
2520 MHz
GPU Clock
723 MHz
—
Memory Clock
1250 MHz
5 Gbps effective
2250 MHz
18 Gbps effective
Memory
Memory Size
4 GB
48 GB
VRAM (MB)
4,096
49,152
+1100.0%
Memory Type
GDDR5
GDDR6
Memory Bus
256 bit
384 bit
Bandwidth
160.0 GB/s
864.0 GB/s
Cache
L1 Cache
16 KB (per CU)
128 KB (per SM)
L2 Cache
512 KB
48 MB
Performance
Pixel Rate
23.14 GPixel/s
483.8 GPixel/s
Texture Rate
92.54 GTexel/s
1,431.4 GTexel/s
FP32 (TFLOPS)
2.961 TFLOPS
91.61 TFLOPS
FP64 (TFLOPS)
185.1 GFLOPS (1:16)
1,431.4 GFLOPS (1:64)
FP16 (TFLOPS)
2.961 TFLOPS (1:1)
91.61 TFLOPS (1:1)
AI/RT
RT Cores
—
142
Tensor Cores
—
568
Power
TDP
250 W
300 W
TDP (W)
250
300
+20.0%
Suggested PSU
—
700 W
Power Connectors
None
1x 16-pin
Architecture
Architecture
GCN 3.0
Ada Lovelace
GPU Name
Amethyst
AD102
Generation
Gem System
(R9 M200)
Server Ada
(Lxx)
Process Size
28 nm
5 nm
Transistors
5,000 million
76,300 million
Die Size
366 mm²
609 mm²
Foundry
TSMC
TSMC
Density
13.7M / mm²
125.3M / mm²
API Support
DirectX
12 (12_0)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.2.170
1.4
OpenCL
2.1
3.0
CUDA
—
8.9
Shader Model
6.5
6.8
Physical
Slot Width
MXM Module
Dual-slot
Length
—
267 mm
10.5 inches
Height
—
111 mm
4.4 inches
Outputs
Portable Device Dependent
1x HDMI 2.13x DisplayPort 1.4a
Bus Interface
MXM-B (3.0)
PCIe 4.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Solar System
Server Ampere
Successor
Polaris Mobile
Server Hopper