Nvidia GeForce GTX 980 White Paper page 7

Hide thumbs Also See for GeForce GTX 980:
Table of Contents

Advertisement

GeForce GTX 980 Whitepaper
In GeForce GTX 980, each GPC ships with a dedicated raster engine and four SMMs. Each SMM has 128
CUDA cores, a PolyMorph Engine, and eight texture units. With 16 SMMs, the GeForce GTX 980 ships
with a total of 2048 CUDA cores and 128 texture units.
The GeForce GTX 980 features four 64-bit memory controllers (256-bit total). Tied to each memory
controller are 16 ROP units and 512KB of L2 cache. The full chip ships with a total of 64 ROPs and
2048KB of L2 cache (this compared to 32 ROPs and 512K L2 on GK104).
The following table provides a high-level comparison of Maxwell vs. our previous-generation GK104
GPU:
GPU
SMs
CUDA Cores
Base Clock
GPU Boost Clock
GFLOPs
Texture Units
Texel fill-rate
Memory Clock
Memory Bandwidth
ROPs
L2 Cache Size
TDP
Transistors
Die Size
Manufacturing Process
The GeForce GTX 980 has double the SMs compared to the GK104 GPU used in the GeForce GTX 680
released two years ago. Because of the changes implemented in GTX 980's new Maxwell SM, we were
able to integrate 2x more SMs without doubling the die size. With each SM also containing its own
dedicated PolyMorph Engine, GeForce GTX 980 also has twice the number of geometry units as its direct
predecessor. We'll be discussing more details on the new SM design in the next section of the
whitepaper.
Based on efficiency and workload analysis, and math vs. texture processing requirements of modern
games, NVIDIA engineers determined that eight texture units per SMM is the best architectural balance
for Maxwell; therefore, the total number of texture units is the same as Kepler, 128. However, thanks to
GeForce GTX 980's higher clocks, texture fill rate improves by 12% from one generation to the next. To
improve performance in high AA/high resolution gaming scenarios, we doubled the number of ROPs
1
The GFLOPS and texel fill rates in this chart are based on GPU Base Clock
GeForce GTX 680 (Kepler)
8
1536
1006 MHz
1058 MHz
3090
128
128.8 Gigatexels/sec
6000 MHz
192 GB/sec
32
512KB
195 Watts
3.54 billion
mm²
294
28-nm
GM204 HARDWARE ARCHITECTURE
IN-DEPTH
GeForce GTX 980 (Maxwell)
16
2048
1126 MHz
1216 MHz
1
4612
128
144.1 Gigatexels/sec
7000 MHz
224 GB/sec
64
2048KB
165 Watts
5.2 billion
mm²
398
28-nm
7

Advertisement

Table of Contents
loading

Table of Contents