GPU Comparison
RADEON
AMD Radeon R9 M295X
CORE STATE
Amethyst
VRAM
4 GB
CLOCK SPEED
—
TDP
250 W
BUS WIDTH
256 bit
ARCHITECTURE
GCN 3.0
PROCESS
28 nm
LAUNCH DATE
2014
VS
GEFORCE
Tesla T4
CORE STATE
TU104
VRAM
16 GB
CLOCK SPEED
1590 MHz
TDP
70 W
BUS WIDTH
256 bit
ARCHITECTURE
Turing
PROCESS
12 nm
LAUNCH DATE
2018
PERFORMANCE BENCHMARKS
geekbench_metal
geekbench_opencl
geekbench_vulkan
DETAILED SPECIFICATIONS
SPECIFICATION
R9 M295X
Tesla T4
Core Specs
Shading Units
2,048
2,560
+25.0%
Shaders
2,048
2,560
+25.0%
TMUs
128
160
+25.0%
ROPs
32
64
+100.0%
Compute Units
32
—
SM Count
—
40
Clocks
Base Clock
—
585 MHz
Boost Clock
—
1590 MHz
GPU Clock
723 MHz
—
Memory Clock
1250 MHz
5 Gbps effective
1250 MHz
10 Gbps effective
Memory
Memory Size
4 GB
16 GB
VRAM (MB)
4,096
16,384
+300.0%
Memory Type
GDDR5
GDDR6
Memory Bus
256 bit
256 bit
Bandwidth
160.0 GB/s
320.0 GB/s
Cache
L1 Cache
16 KB (per CU)
64 KB (per SM)
L2 Cache
512 KB
4 MB
Performance
Pixel Rate
23.14 GPixel/s
101.8 GPixel/s
Texture Rate
92.54 GTexel/s
254.4 GTexel/s
FP32 (TFLOPS)
2.961 TFLOPS
8.141 TFLOPS
FP64 (TFLOPS)
185.1 GFLOPS (1:16)
254.4 GFLOPS (1:32)
FP16 (TFLOPS)
2.961 TFLOPS (1:1)
65.13 TFLOPS (8:1)
AI/RT
RT Cores
—
40
Tensor Cores
—
320
Power
TDP
250 W
70 W
TDP (W)
250
70
-72.0%
Suggested PSU
—
250 W
Power Connectors
None
None
Architecture
Architecture
GCN 3.0
Turing
GPU Name
Amethyst
TU104
Generation
Gem System
(R9 M200)
Tesla Turing
(Txx)
Process Size
28 nm
12 nm
Transistors
5,000 million
13,600 million
Die Size
366 mm²
545 mm²
Foundry
TSMC
TSMC
Density
13.7M / mm²
25.0M / mm²
API Support
DirectX
12 (12_0)
12 Ultimate (12_2)
OpenGL
4.6
4.6
Vulkan
1.2.170
1.4
OpenCL
2.1
3.0
CUDA
—
7.5
Shader Model
6.5
6.8
Physical
Slot Width
MXM Module
Single-slot
Length
—
168 mm
6.6 inches
Outputs
Portable Device Dependent
No outputs
Bus Interface
MXM-B (3.0)
PCIe 3.0 x16
Other
Production
End-of-life
End-of-life
Predecessor
Solar System
Tesla Volta
Successor
Polaris Mobile
Server Ampere