Knowledge

Thursday, May 30, 2013

Nvidia GeForce GTX 580 Review

Nvdia had significant trouble releasing the first GF100 Fermi-based GPU back in March, with launch delays letting ATI rule the DX11 battleground for roughly six months. It didn’t help that when the GeForce GTX 480 1.5GB did find its way to our test rigs, it wasn’t fast enough for the price, and also ran hot, loud and consumed too much power. What a difference a few months make, as the new GeForce GTX 580 1.5GB solves all of these issues.

Nvidia GeForce GTX 580 Review
The new Nvidia GeForce GTX 580 1.5GB is a hefty card, but is still only 10.5in long

We’ve leave the GeForce GTX 580 1.5GB performance analysis till after the benchmarks, and instead talk about the upgrades that the GPU and the reference card have received on this and the next page.

GPU Upgrades

While Nvidia has made the obvious move of unlocking the 16th and final SM (Streaming Multiprocessor, or ‘stream processor cluster’ in neutral terminology) of the GF100 architecture for the GeForce GTX 580 1.5GB, the new GF110 codename reveals that this isn’t the full extent of the changes. The most significant difference between GF110 and GF100 is the use of different grades of transistor.

Typically a GPU will use the fastest switching transistors to attain the highest possible frequencies, but these transistors are also the most leaky, resulting in higher power consumption and more waste heat. With the GF110 design, Nvidia has used less leaky transistors for non-performance-critical areas of the GPU, thus lowering the overall power draw of the chip.

Nvidia GeForce GTX 580 Review
The new GTX 580 1.5GB requires plenty of power

However, the power-saving transistors – despite their slower switching tendencies – haven’t lead to a lower GPU frequency for the GeForce GTX 580 1.5GB. The GTX 580 1.5GB’s GPU core operates at 772MHz rather than the 700MHz of the GeForce GTX 480 1.5GB, with the 512 stream processors ripping along at 1,544MHz rather than 1,400MHz.

Nvidia has also added temperature and power draw monitoring to the GeForce GTX 580 1.5GB via two additional chips on the card. This means that if the GPU or the card’s VRMs get too hot or try to draw more power than is safe, the GPU will clock down to avoid damage to the hardware.

There are three things to note about this power management technology, the first being that as it’s enabled by two separate chips on the card - board partners can choose to leave them off to lower the cost of their card. Secondly, the GPU won’t increase in frequency if the power draw or temperature are lower than the maximums – the technology is more akin to Intel’s SpeedStep than Turbo Boost. Finally, the monitoring is software-based and at the moment only detects OCCT and the latest version of FurMark. This means that any thermal or power draw test using these applications are inaccurate, but as we use 3DMark06 to stress the GPUs, our numbers are perfect.

Nvidia GeForce GTX 580 Review
The new GPU, surrounded by memory chips and power phases

While a hardware-only implementation of power monitoring might be preferable – it would be automatic, based on the actual power draw or temperature of the components, and not be dependent on driver interaction – Nvdia said that actually a software-based implementation had more advantages. It allows Nvidia to be more flexible, letting Nvidia add extra applications that it finds to be particularly power hungry. However, the option to disable power monitoring isn't exposed in the driver, so extreme overclockers and anyone wanting to verify that their card is working properly and not overheating will have to be careful.

Apart from the unlocked SM, the GPU has seen its high-precision fp16 capabilities increased, which Nvidia claims improves performance by 4-12 per cent alone.

GeForce GTX 580 Specifications

The GeForce GTX 580 1.5GB uses all 16 SMsThe GeForce GTX 580 1.5GB uses all 16 SMs
The headline specification is undoubtedly the enabling of the 16th and final SM (Streaming Multiprocessor, or ‘stream processor cluster’ in neutral terminology) of the GF100 Fermi design.

However, the new GF110 design still uses the 32 stream processors per SM layout of the original GF100 rather than the 48 per SM of the GeForce GTX 460 GPU.

Even Nvidia sees the 32 layout as a less efficient design, but we suspect that it’s not possible to get four GPCs (Graphics Processing Clusters) with four SMs each onto a die small enough to actually make if each SM contained 48 stream processors rather than 32. In the end, brute force wins out.

Perhaps this layout will change with TSMC’s 28nm process, but that’s not due until halfway through 2011, with GPUs based on this process (from ATI and Nvidia) pencilled in for the autumn of that year.

As well as the extra resources, and the increased high-precision fp16 capabilities (which Nvidia claims is worth a 4-12 per cent performance increase), the GeForce GTX 580 1.5GB operates at higher frequencies than the GeForce GTX 480 1.5GB. While the GPU core of the latter runs at 700MHz (meaning that its 480 stream processors operate at 1.4GHz) the GPU core of the GeForce GTX 580 1.5GB runs at 772MHz, with its 512 stream processors clocked at 1.544GHz.

The 1.5GB of GDDR5 memory also runs faster, with an effective frequency of 4.08GHz rather than 3.7GHz. While this gives the GTX 580 1.5GB more memory bandwidth, the rest of the GPU is the same, with the same 384-bit memory interface and 48 ROPs. The reason for the rise in texture units (from 60 to 64) is because each SM of the GF100 design contains four texture units - unlocking the 16th SM unlocked four more textures.

Nvidia GeForce GTX 580 1.5GBNvidia GeForce GTX 480 1.5GBNvidia GeForce GTX 470 1,280MBATI Radeon HD 5870 1GBATI Radeon HD 6870 1GBATI Radeon HD 5970 2GB
GPU
CodenameGF110GF100GF100Cypress XTBarts XTHemlock XT
Frequency772MHz700MHz607MHz850MHz900MHz725MHz
Stream Processors512 (1,544MHz)480 (1.4GHz)448 (1,215MHz)1,600 (850MHz)1,120 (900MHz)2 x 1,600 (725MHz)
Layout16 SMs, 4 GPCs15 SMs, 4GPCs14 SMs, 4 GPCs20 SIMD engines14 SIMD engines2 x 20 SIMD engines
Rasterisers444222 x 2
Tesselation Units161514112 x 1
Texture Units64605680562 x 80
ROPs48484032322 x 32
Transistors3 billion3 billion3 billion2.15 billion1.7 billion2 x 2.15 billion
Size530mm2530mm2530mm2334mm2255mm22 x 334mm2
Process40nm40nm40nm40nm40nm40nm
Memory
Amount1.5GB GDDR51.5GB GDDR51,280MB GDDR51GB GDDR51GB GDDR52 x 1GB GDDR5
Frequency1.02GHz (4.08GHz effective)924MHz (3.7GHz effective)837MHz (3.2GHz effective)1,050MHz (4.2GHz effective)1,050MHz (4.2GHz effective)1GHz (4GHz effective)
Interface384-bit384-bit320-bit256-bit256-bit2 x 256-bit
Bandwidth192.4GB/sec177GB/sec134GB/sec134.4GB/sec134.4GB/sec2 x 128GB/sec
Card Specifications
Power Connectors1 x 6-pin, 1 x 8-pin PCI-E1 x 6-pin, 1 x 8-pin PCI-E2 x 6-pin PCI-E2 x 6-pin PCI-E2 x 6-pin PCI-E1 x 6-pin, 1 x 8-pin PCI-E
Maximum Power Draw244W250W215W188W151W294W
Idle Power DrawUnspecifiedUnspecifiedUnspecified27W19WUnspecified
Recommended PSU600W600W550W500WUnspecifiedUnspecified
Typical Street Price£400£330£200£320£220£490

How we tested

As always, we did our best to deliver a clean set of benchmarks, with each test repeated three times and an average of those results is what we’re reporting here. In the rare case where performance was inconsistent, we continued repeating the test until we got three results that were consistent.

The tests performed are a mixture of custom in-game timedemos and manually played sections, using FRAPS to record the average and minimum frame rates. We strive to not only record real-world performance you will actually see, but also present the results in a manner that is easy to digest.

Nvidia GeForce GTX 580 Review GeForce GTX 580 Test Setup

Intel Core i7 Test System

  • Intel Core i7-965 processor (3.2GHz: 133MHz x 24)
  • Asus P6T V2 motherboard (Intel X58 Express with three PCI-Express 2.0 x16 slots)
  • 3x 2GB Corsair TR3X6G1333C9 memory modules (operating in dual channel at DDR3 1,600MHz 9-9-9-24-1T)
  • Corsair X128 120GB SSD running v1 firmware
  • Corsair HX1000W PSU
  • Windows 7 Home Premium x64
  • Antec Twelve Hundred Chassis

ATI graphics cards

  • ATI Radeon HD 6870 1GB (900MHz GPU, 4.2GHz memory) using Catalyst 10.9 WHQL
  • ATI Radeon HD 6850 1GB (725MHz GPU, 4GHz memory) using Catalyst 10.9 WHQL
  • ATI Radeon HD 5970 2GB (2 x 725MHz GPU, 2 x 4GHz memory) using Catalyst 10.9 WHQL
  • ATI Radeon HD 5870 1GB (850MHz GPU, 4.8GHz memory) using Catalyst 10.9 WHQL
  • ATI Radeon HD 5850 1GB (725MHz GPU, 4.0GHz memory) using Catalyst 10.9 WHQL
  • ATI Radeon HD 5770 1GB (850MHz GPU, 4.8GHz memory) using Catalyst 10.9 WHQL

Nvidia graphics cards

  • Nvidia GeForce GTX 580 1.5GB (772MHz core, 1,544MHz stream processors, 4.08GHz memory) using release driver
  • Nvidia GeForce GTX 480 1.5GB (756MHz core, 1,512MHz stream processors, 3.8GHz Memory) using GeForce 260.89 WHQL
  • Nvidia GeForce GTX 470 1.2GB (607MHz core, 1,215MHz stream processors, 3.3GHz memory) using GeForce 260.89 WHQL
  • Nvidia GeForce GTX 460 1GB (675MHz core, 1,350MHz stream processors, 3.6GHz memory) using GeForce 260.89 WHQL
  • Nvidia GeForce GTX 460 768MB (675MHz core, 1,350MHz stream processors, 3.6GHz memory) using GeForce 260.89 WHQL

Games Tested

  • Colin McRae: Dirt 2 (DX11)
  • Arma II: Operation Arrowhead (DX9)
  • Just Cause 2 (DX11)
  • Battlefield: Bad Company 2 (DX11)

Colin McRae: Dirt 2

Publisher: Codemasters

From our Colin McRae: Dirt 2 review:
While Dirt ’s shift away from ‘pure’ rallying to a more contemporary styling will likely divide players, there’s no arguing that the game itself looks simply stunning, improving upon Race Driver: GRID's EGO engine to make Dirt 2 one of the best looking racers we’ve ever seen. Both cars and tracksides are lavishly detailed, and there are dozens of gorgeous touches, from the spattering mud in jungle stages, to the jaw dropping water effects from the driver’s cam when you hit a water hazard.

We drive a lap around the London, Battersea track, with a full eight-car grid, starting at the back. We use the maximum image quality settings in DX11 mode. We repeat each test three times, discarding anomalous results and averaging the consistent ones.

Nvidia GeForce GTX 580 Review GeForce GTX 580 Dirt 2 Performance Nvidia GeForce GTX 580 Review GeForce GTX 580 Dirt 2 Performance Nvidia GeForce GTX 580 Review GeForce GTX 580 Dirt 2 Performance

Colin McRae: Dirt 2

1,680 x 1,050 0xAA 16xAF, DirectX 11, Maximum Detail

  • Nvidia GeForce GTX 580 1.5GB
  • Nvidia GeForce GTX 480 1.5GB
  • ATI Radeon HD 5970 2GB
  • Nvidia GeForce GTX 470 1.2GB
  • ATI Radeon HD 5870 1GB
  • Nvidia Geforce GTX 460 1GB
  • ATI Radeon HD 6870 1GB
  • Nvidia GeForce GTX 460 768MB
  • ATI Radeon HD 5850 1GB
  • ATI Radeon HD 6850 1GB
  • ATI Radeon HD 5770 1GB
    • 122
    • 103
    • 107
    • 97
    • 108
    • 94
    • 87
    • 74
    • 79
    • 69
    • 77
    • 67
    • 74
    • 65
    • 71
    • 60
    • 68
    • 60
    • 63
    • 57
    • 48
    • 43
0
25
50
75
100
125
Frame Per Second
  • Average
  • Minimum

Colin McRae: Dirt 2

1,680 x 1,050 4x AA 16xAF, DirectX 11, Maximum Detail

  • Nvidia GeForce GTX 580 1.5GB
  • ATI Radeon HD 5970 2GB
  • Nvidia GeForce GTX 480 1.5GB
  • Nvidia GeForce GTX 470 1.2GB
  • ATI Radeon HD 5870 1GB
  • Nvidia Geforce GTX 460 1GB
  • ATI Radeon HD 6870 1GB
  • ATI Radeon HD 5850 1GB
  • Nvidia GeForce GTX 460 768MB
  • ATI Radeon HD 6850 1GB
  • ATI Radeon HD 5770 1GB
    • 113
    • 101
    • 98
    • 85
    • 96
    • 85
    • 77
    • 69
    • 72
    • 64
    • 68
    • 61
    • 68
    • 59
    • 61
    • 52
    • 63
    • 51
    • 58
    • 51
    • 45
    • 39
0
25
50
75
100
125
Frame Per Second
  • Average
  • Minimum

Colin McRae: Dirt 2

1,920 x 1,200 0x AA 16xAF, DirectX 11, Maximum Detail

  • Nvidia GeForce GTX 580 1.5GB
  • ATI Radeon HD 5970 2GB
  • Nvidia GeForce GTX 480 1.5GB
  • Nvidia GeForce GTX 470 1.2GB
  • ATI Radeon HD 5870 1GB
  • ATI Radeon HD 6870 1GB
  • Nvidia Geforce GTX 460 1GB
  • ATI Radeon HD 5850 1GB
  • Nvidia GeForce GTX 460 768MB
  • ATI Radeon HD 6850 1GB
  • ATI Radeon HD 5770 1GB
    • 111
    • 97
    • 99
    • 86
    • 90
    • 81
    • 74
    • 66
    • 71
    • 62
    • 66
    • 59
    • 66
    • 58
    • 61
    • 54
    • 61
    • 54
    • 57
    • 50
    • 46
    • 41
0
25
50
75
100
125
Frame Per Second
  • Average
  • Minimum

Colin McRae: Dirt 2

1,920 x 1,200 4x AA 16xAF, DirectX 11, Maximum Detail

  • Nvidia GeForce GTX 580 1.5GB
  • ATI Radeon HD 5970 2GB
  • Nvidia GeForce GTX 480 1.5GB
  • Nvidia GeForce GTX 470 1.2GB
  • ATI Radeon HD 5870 1GB
  • ATI Radeon HD 6870 1GB
  • Nvidia Geforce GTX 460 1GB
  • ATI Radeon HD 5850 1GB
  • Nvidia GeForce GTX 460 768MB
  • ATI Radeon HD 6850 1GB
  • ATI Radeon HD 5770 1GB
    • 100
    • 88
    • 90
    • 79
    • 82
    • 70
    • 66
    • 58
    • 63
    • 54
    • 61
    • 52
    • 58
    • 50
    • 56
    • 48
    • 54
    • 46
    • 52
    • 46
    • 41
    • 35
0
25
50
75
100
Frame Per Second
  • Average
  • Minimum

Colin McRae: Dirt 2

2,560 x 1,600 0x AA 16xAF, DirectX 11, Maximum Detail

  • Nvidia GeForce GTX 580 1.5GB
  • ATI Radeon HD 5970 2GB
  • Nvidia GeForce GTX 480 1.5GB
  • ATI Radeon HD 5870 1GB
  • ATI Radeon HD 6870 1GB
  • Nvidia GeForce GTX 470 1.2GB
  • ATI Radeon HD 5850 1GB
  • Nvidia Geforce GTX 460 1GB
  • ATI Radeon HD 6850 1GB
  • Nvidia GeForce GTX 460 768MB
  • ATI Radeon HD 5770 1GB
    • 76
    • 67
    • 73
    • 65
    • 59
    • 52
    • 51
    • 46
    • 47
    • 42
    • 47
    • 42
    • 43
    • 38
    • 42
    • 37
    • 40
    • 35
    • 39
    • 35
    • 31
    • 27
0
10
20
30
40
50
60
70
80
Frame Per Second
  • Average
  • Minimum

Colin McRae: Dirt 2

2,560 x 1,600 4x AA 16xAF, DirectX 11, Maximum Detail

  • Nvidia GeForce GTX 580 1.5GB
  • ATI Radeon HD 5970 2GB
  • Nvidia GeForce GTX 480 1.5GB
  • ATI Radeon HD 5870 1GB
  • Nvidia GeForce GTX 470 1.2GB
  • ATI Radeon HD 6870 1GB
  • ATI Radeon HD 5850 1GB
  • Nvidia Geforce GTX 460 1GB
  • ATI Radeon HD 6850 1GB
  • Nvidia GeForce GTX 460 768MB
  • ATI Radeon HD 5770 1GB
    • 69
    • 61
    • 67
    • 55
    • 54
    • 48
    • 47
    • 40
    • 42
    • 37
    • 43
    • 36
    • 39
    • 34
    • 37
    • 33
    • 36
    • 31
    • 34
    • 31
    • 28
    • 24
0
10
20
30
40
50
60
70
Frame Per Second
  • Average
  • Minimum

No comments:

Post a Comment