biodoc
Diamond Member
- Dec 29, 2005
- 6,269
- 2,238
- 136
At least with bigger/newer Nvidia GPUs, FahCore_21.exe is obviously almost always loading one CPU hardware thread to its fullest. Therefore I am wondering whether single-core performance of the CPU can bottleneck GPU PPDs.
To test that, I am still planning to find out how to set up FAHClient to process a given copy of always the same Work Unit.
But for now I turned to FAHBench to get somewhat closer to an answer. The current FAHBench v2 supposedly uses the very same code as FahCore_21. FAHBench v2 comes with three different built-in WUs, and it is also possible to add custom WUs to FAHBench which can be derived from "real" Folding@Home WUs. I don't know how well the three built-in WUs reflect typical Folding@Home WUs.
During benchmarking, FAHBench also loads one CPU hardware thread fully. So, there is at least a distinct chance that any CPU single-thread performance bottleneck would also be showing up in FAHBench.
Software used in the tests:
FAHBench v2.2.5, OpenMM version 6.2-core21-0.0.17
options: OpenCL, single precision, accuracy check enabled, NaN check disabled, 60 s run length
Nvidia driver version 272.06
Windows 7
Hardware:
Core i7-6950X, HT off, EIST off
reference GTX 1080
factory-overclocked GTX 1070 (Gainward Phoenix GS, presumedly 170 W TDP)
both cards in 16-lane PCIe 3.0 slots
In all tests, the GPU performance cap was shown to be Voltage (not power or temperature), according to GPU-Z. Temperatures remained moderate, and cards ran at about 1.9 GHz (1080) and 2.0 GHz (1070).
All FAHBench scores shown below in absolute numbers are averages from three consecutive runs. Variability between those triple runs was reasonably low. The percentages in the table are simply the score at the given CPU clock divided by the score at 4.0 GHz CPU clock.
Work Unit: dhfr
Code:CPU clock 4.0 GHz 3.5 GHz 3.0 GHz 2.5 GHz 2.0 GHz 1.5 GHz ---------------------------------------------------------------------------- GTX 1080 scores 110 106 103 98 92 82 (100 %) (97 %) (94 %) (89 %) (84 %) (75 %) ---------------------------------------------------------------------------- GTX 1070 scores 106 103 100 95 89 80 (100 %) (98 %) (94 %) (90 %) (84 %) (76 %)
Work Unit: dhfr-implicit
Code:CPU clock 4.0 GHz 3.5 GHz 3.0 GHz 2.5 GHz 2.0 GHz 1.5 GHz ---------------------------------------------------------------------------- GTX 1080 scores 519 517 516 514 515 520 (100 %) (100 %) (99 %) (99 %) (99 %) (100 %) ---------------------------------------------------------------------------- GTX 1070 scores 477 476 475 473 476 480 (100 %) (100 %) (100 %) (99 %) (100 %) (99 %)
Work Unit: nav
Code:CPU clock 4.0 GHz 3.5 GHz 3.0 GHz 2.5 GHz 2.0 GHz 1.5 GHz ---------------------------------------------------------------------------- GTX 1080 scores 14.4 14.4 14.3 14.3 14.2 14.1 (100 %) (100 %) (100 %) (99 %) (99 %) (98 %) ---------------------------------------------------------------------------- GTX 1070 scores 14.7 14.6 14.6 14.6 14.5 14.4 (100 %) (99 %) (99 %) (99 %) (99 %) (98 %)
So, there is practically no dro-poff in the dhfr-implicit and nav tests, while the dhfr test shows ~5 % loss of performance when going from 4.0 to 3.0 GHz CPU clock, and ~10 % loss at 2.5 GHz. Not as pronounced as I suspected.
It remains to be seen how this scales in FAHClient with typical WUs.
Another use for FAHBench is to test which Nvidia drivers give maximum ppd on FAH with the caveat that the results may not translate to all FAH WUs.
For most of the race, I was using the nvidia driver version 370.28 for my rig with 2 x 1080s on linux mint. This driver version was the first to allow the coolbits option to control fan speed and allow overclocking the GTX 1080 in linux.
Recently I decided to try to upgrade to the 384.111 driver so see if I could increase my ppd on the 1080s. For the FAHBench tests, I shut down folding.
370.28 driver results:
FAHBench: single precision/DHFR score was 144.85
9414 WU TPF: 43 seconds
384.111 driver results:
FAHBench: single precision/DHFR score was 135.781
9414 WU TPF: 47 seconds
As a not of caution for those with 10 series Ti cards, the 370 driver will not work. You'll need a more recent driver version.