[New release] BRP app v1.23/1.24 (OpenCL) feedback thread

Infusioned
Infusioned
Joined: 11 Feb 05
Posts: 38
Credit: 149000
RAC: 0

p2030.20110421.G41.29-00.40.S

Message 79198 in response to message 79197

p2030.20110421.G41.29-00.40.S.b0s0g0.00000_1728_0

For some reason this wu is showing 0% GPU load and 25% CPU load. My initial reaction was that this must be an error, however, you can see the GPU clock was down to 725 from 840.

http://img140.imageshack.us/img140/883/b0s0g00000017280.jpg

Infusioned
Infusioned
Joined: 11 Feb 05
Posts: 38
Credit: 149000
RAC: 0

p2030.20110421.G41.29-00.40.S

Message 79199 in response to message 79198

p2030.20110421.G41.29-00.40.S.b0s0g0.00000_1504_1 using einsteinbinary_BRP4 version 123 (atiOpenCL)

http://img15.imageshack.us/img15/3065/b0s0g00000015041.jpg

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Joined: 28 Aug 06
Posts: 164
Credit: 1864017
RAC: 0

RE: In my AMD HD 6850 i'm

Message 79200 in response to message 79197

Quote:
In my AMD HD 6850 i'm running 2 boinc projects: albert@home and poem@home (3 gpu wu in 1 cpu). When i download an Albert@home gpu wu, the poem wus entered in "suspended" state and albert@home wu doesn't start - aka, no work on gpu. If i reboot boinc client, the poem wu remain suspended, but albert wu starts and runs ok...
Is this normal??

hmmm....theoretically it is possible that the Albert task *thought* it didn't have enough memory and waited for some to get available, which happened after the reboot...still, this looks suspicious. Thanks for reporting.

One question tho: is this reproducible, e.g. after each new WU download from Albert?

Cheers
HBE

Bikeman (Heinz-Bernd Eggenstein)
Bikeman (Heinz-...
Joined: 28 Aug 06
Posts: 164
Credit: 1864017
RAC: 0

RE: p2030.20110421.G41.29-0

Message 79201 in response to message 79198

Quote:

p2030.20110421.G41.29-00.40.S.b0s0g0.00000_1728_0

For some reason this wu is showing 0% GPU load and 25% CPU load. My initial reaction was that this must be an error, however, you can see the GPU clock was down to 725 from 840.

http://img140.imageshack.us/img140/883/b0s0g00000017280.jpg

Strange...this is this one I guess:

http://albertathome.org/task/197941 which has finished in abeout the same time as other tasks. Let's see if it validates.

But I would expect a lower GPU temperature if the load had really been 0% for a longer time, so actually I suspect that the readout is wrong. The app does have phases (at the beginning of each of the 8 subtasks) when there is exclusively CPU load, but this will last only a couple of seconds, not minutes.

THX

HBE

Infusioned
Infusioned
Joined: 11 Feb 05
Posts: 38
Credit: 149000
RAC: 0

p2030.20110421.G41.29-00.40.S

Message 79202 in response to message 79201

p2030.20110421.G41.29-00.40.S.b0s0g0.00000_1920_1 using einsteinbinary_BRP4 version 123 (atiOpenCL)

http://img96.imageshack.us/img96/6813/b0s0g00000019201.jpg

Infusioned
Infusioned
Joined: 11 Feb 05
Posts: 38
Credit: 149000
RAC: 0

Digging through some of the

Message 79203 in response to message 79202

Digging through some of the stderr outputs I notice the atiOpenCl app is doing an awful lot of checkpointing. Curious to see if the cuda app was the same, I looked into one of my wu's:

http://albertathome.org/workunit/68681

My (atiOpenCL) output (abbreviated):

[06:49:19][3424][INFO ] Starting data processing...
[06:49:19][3424][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[06:49:19][3424][INFO ] Using OpenCL device "Cayman" by: Advanced Micro Devices, Inc.
[06:49:19][3424][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[06:49:19][3424][INFO ] Header contents:
------> Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.40
...
[06:50:25][3424][INFO ] Checkpoint committed!
[06:51:30][3424][INFO ] Checkpoint committed!
[06:52:35][3424][INFO ] Checkpoint committed!
[06:53:41][3424][INFO ] Checkpoint committed!
[06:54:46][3424][INFO ] Checkpoint committed!
[06:55:52][3424][INFO ] Checkpoint committed!
[06:56:58][3424][INFO ] Checkpoint committed!
[06:58:03][3424][INFO ] Checkpoint committed!
[06:59:08][3424][INFO ] Checkpoint committed!
[07:00:15][3424][INFO ] Checkpoint committed!
[07:01:20][3424][INFO ] Checkpoint committed!
[07:02:25][3424][INFO ] Checkpoint committed!
[07:03:30][3424][INFO ] Checkpoint committed!
[07:04:36][3424][INFO ] Checkpoint committed!
[07:05:41][3424][INFO ] Checkpoint committed!
[07:06:47][3424][INFO ] Checkpoint committed!
[07:07:53][3424][INFO ] Checkpoint committed!
[07:08:58][3424][INFO ] Checkpoint committed!
[07:09:25][3424][INFO ] OpenCL shutdown complete!
[07:09:25][3424][INFO ] Data processing finished successfully!
...

And then repeats the process for:

Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.50
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.60
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.70
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.80
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.90
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM127.00
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM127.10

Checkpointing each WAPP file once per minute, 20 times.

Comparing to the BRP3cuda32 app (abbreviated):

[12:27:01][5004][INFO ] Starting data processing...
[12:27:01][5004][INFO ] CUDA global memory status (initial GPU state, including context):
------> Used in total: 218 MB (807 MB free / 1025 MB total) -> Used by this application (assuming a single GPU task): 0 MB
[12:27:01][5004][INFO ] Using CUDA device #0 "GeForce GTX 560" (336 CUDA cores / 1105.44 GFLOPS)
[12:27:01][5004][INFO ] Version of installed CUDA driver: 4020
[12:27:01][5004][INFO ] Version of CUDA driver API used: 3020
[12:27:01][5004][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[12:27:01][5004][INFO ] Header contents:
------> Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.40
...
[12:27:31][5004][INFO ] Checkpoint committed!
[12:28:01][5004][INFO ] Checkpoint committed!
[12:28:31][5004][INFO ] Checkpoint committed!
[12:29:01][5004][INFO ] Checkpoint committed!
[12:29:31][5004][INFO ] Checkpoint committed!
[12:30:02][5004][INFO ] Checkpoint committed!
[12:30:32][5004][INFO ] Checkpoint committed!
[12:31:01][5004][INFO ] Data processing finished successfully!
...

which then also repeats for:

Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.50
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.60
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.70
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.80
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM126.90
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM127.00
Original WAPP file: ./p2030.20110421.G41.29-00.40.S.b0s0g0.00000_DM127.10

Checkpointing each WAPP file once per minute, 5 times.

So, my questions are:
* What is checkpointing? An intermidiate state (variables) save in case calculations get interrupted and you don't have to start over?

* Is the aitOpenCl app checkpointing more? Or is it that the two apps are doing the same amount of work (calcs), and it's just that the CUDA app/GTX 560 is doing more work per unit time and therefore only needs to checkpoint 5 vs. my 20 times?

* Is the GTX 560/CUDA app really 4x (20/5=4) than the HD6950/AtiOpenCl? The 6950 shows 2253 SP GFLOPS vs. the GTX 560 SP GFLOPS of 1088.6.
http://en.wikipedia.org/wiki/Comparison_of_AMD_graphics_processing_units
http://en.wikipedia.org/wiki/Comparison_of_Nvidia_graphics_processing_units

To semi-answer that, GPU Time indicates a 2.503x increase for the GTX560/CUDA vs. the AtiOpenCl/HD6950. The CPU time for the CUDA app is ,however, 4.24x less than that of the OpenCl app. Anandtech Bench shows the 2500k vs. my AMD 975BE to be slightly better in single-threaded, multi-threaded, and total MIPS (7-Zip test), but nothing earth shattering.
http://www.anandtech.com/bench/Product/288?vs=435

I know you said before that the OpenCl app uses way more CPU than the CUDA app. Perhaps the OpenCl standard is still yet immature, AMD has crappy drivers, or a mix of both? Regardless, I really commend everyone's efforts. Having done a fair bit of coding myself, I know what a pain this can all be.

Infusioned
Infusioned
Joined: 11 Feb 05
Posts: 38
Credit: 149000
RAC: 0

p2030.20110421.G41.29-00.40.S

Message 79204 in response to message 79203

p2030.20110421.G41.29-00.40.S.b0s0g0.00000_1928_0 using einsteinbinary_BRP4 version 123 (atiOpenCL)

This one seems to have some weid GPU Load spottiness at ~ the 20% completion mark, but seems to have steadied out at 23% load.

http://img210.imageshack.us/img210/4024/b0s0g00000019280.jpg

Edit:
I take that back, I noticed spottiness again, so I ran the latest 3 versions of GPU-Z side-by-side just to see if there was a bug in one of the versions. There doesn't appear to be as they all report the same load %.

http://img196.imageshack.us/img196/7073/gpuzcomparison.jpg

Infusioned
Infusioned
Joined: 11 Feb 05
Posts: 38
Credit: 149000
RAC: 0

p2030.20110421.G41.29-00.40.S

Message 79205 in response to message 79204

p2030.20110421.G41.29-00.40.S.b0s0g0.00000_2504_0 using einsteinbinary_BRP4 version 123 (atiOpenCL)

http://img140.imageshack.us/img140/4502/b0s0g00000025040.jpg

Christoph
Christoph
Joined: 25 Aug 05
Posts: 30
Credit: 208211
RAC: 0

For me the new app takes a

For me the new app takes a full CPU core when it is running. Is that by intention?

Christoph

[VENETO] boboviz
[VENETO] boboviz
Joined: 6 Oct 06
Posts: 7
Credit: 344106
RAC: 0

RE: One question tho: is

Message 79207 in response to message 79200

Quote:


One question tho: is this reproducible, e.g. after each new WU download from Albert?

Cheers
HBE

My pc has 8gb DDR3 on Win7 64bit, it's enough?
If i continue to download and run A@H wus, the wus take precedence over Poem.
After the last A@H wu, Poem restarts correctly and if i download another A@H the situation occurs again... :-(
I forget: during the no-gpu-use state, 1 cpu core is in use (like A@h is running)

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.