OpenCL tasks - Low GPU% on 331.58 drivers?

Jacob Klein

Joined: 6 Nov 11

Posts: 16

Credit: 2938967

RAC: 0

28 Oct 2013 13:59:34 UTC

Topic 84947

(moderation:

)

I recently saw that my GTS 240 GPU was processing a "Gamma-ray pulsar search #2 1.12 (FGRPopencl-nvidia)" task, but the GPU load was 6%. After restarting BOINC, the GPU load wouldn't go above 0%. The task is slowly progressing, but I don't think it's getting any help from the GPU.

I've seen 1 report in the nVidia Forums, on the 331.58 driver feedback thread, where users were complaining about OpenCL performance, but... my situation here seems different.

Is anyone else noticing OpenCL performance issues on the latest drivers, possibly on older GPUs?

Windows 8.1 x64

Jacob Klein

Joined: 6 Nov 11

Posts: 16

Credit: 2938967

RAC: 0

OpenCL tasks - Low GPU% on 331.58 drivers?

28 Oct 2013 14:01:02 UTC

Message 79837

(moderation:

)

Time to test 331.65 drivers on that same task!

Jacob Klein

Joined: 6 Nov 11

Posts: 16

Credit: 2938967

RAC: 0

331.65 drivers are exhibiting

28 Oct 2013 14:27:25 UTC

Message 79838 in response to message 79837

(moderation:

)

331.65 drivers are exhibiting the same behavior - only 0-12% GPU usage.

What I need to know is:
Is this an Albert application issue, an Albert task issue, or is this this an nVidia driver issue?

Jacob Klein

Joined: 6 Nov 11

Posts: 16

Credit: 2938967

RAC: 0

The task that completed

28 Oct 2013 16:35:56 UTC

Message 79839 in response to message 79838

(moderation:

)

The task that completed successfully had a bunch of error info in the Std error output portion.

See:
http://albertathome.org/task/1179842

There were a lot of these lines:
Error in OpenCL context: CL_MEM_OBJECT_ALLOCATION_FAILURE error executing CL_COMMAND_WRITE_BUFFER on GeForce GTS 240 (Device 0).
Error during OpenCL host->device transfer (error: -4)

Any ideas?

Jacob Klein

Joined: 6 Nov 11

Posts: 16

Credit: 2938967

RAC: 0

I wanted to chime in to

29 Oct 2013 23:28:31 UTC

Message 79840 in response to message 79839

(moderation:

)

I wanted to chime in to mention that, today, I spent about 4 hours testing every released beta/whql driver version back to 314.22, on my Windows 8.1 x64 machine. I was testing OpenCL performance of the Albert@Home "Gamma-ray pulsar search #2 1.12 (FGRPopencl-nvidia)" task, on my GTS 240, for each driver version.

The performance results are:
331.65: Bad
331.58: Bad
331.40: Bad
327.23: Very Bad
326.80: Very Bad
326.41: Very Bad
326.19: Very Bad
320.49: Very Bad
320.18: Very Bad
320.14: Very Bad
320.00: Very Bad
314.22: Very Bad

... where "Bad" means GPU Load % fluctuating between 0-25%, and "Very Bad" means GPU Load % fluctuating between 0-9%.

So, if this is a regression, it's not recent. It's entirely possible that my issue is with the task itself, and not within the drivers.

Thoughts?

Thanks,
Jacob

Richard Haselgrove

Joined: 10 Dec 05

Posts: 143

Credit: 5409572

RAC: 0

A GTS 240 GPU (G92a or G92b

30 Oct 2013 0:07:05 UTC

Message 79841 in response to message 79840

(moderation:

)

A GTS 240 GPU (G92a or G92b chip, depending on version) is a very old and slow GPU (Q4 2009).

Most of the available performance will have been squeezed out of those years ago. Newer drivers will still be trying to leverage more performance out of:

GFxxx (Fermi) chips - 2010 onwards
GKxxx (Kepler) chips - 2012 onwards
GK110 (Titan-class) chips - 2013 onwards

And remember the huge driver architecture changes between Windows XP and Vista/7/8 (WDDM model).

Your GTS 240 would probably be happiest with Windows XP and a legacy driver - but in that system (and depending on the application - wait for the CUDA 6 tests), even a baby Kepler should still be showing improvement as the drivers are refined.

I didn't want to spam the boards with my stats - just milestone theads - but apparently signatures are no longer optional. Follow the link if you're interested.

http://www.boincsynergy.com/images/stats/comb-3475.jpg

Jacob Klein

Joined: 6 Nov 11

Posts: 16

Credit: 2938967

RAC: 0

Let me put it another

30 Oct 2013 0:10:27 UTC

Message 79842 in response to message 79841

(moderation:

)

Let me put it another way:

What is the expected behavior of an "FGRPopencl-nvidia" Albert@Home task... on my GTX 660 Ti? What GPU Load % should I expect on that beefy GPU?

Here is what I'm currently seeing on the 331.65 drivers (monitoring with eVGA Precision-X at a 100ms polling interval):
GTX 660 Ti: Super-quick flickers, going from 0% to 33% back to 0%, about 4 times a second.
GTX 460: 23% most of the time, but brief surges downward to 15% for usually less than 2 seconds.
GTS 240: Bouncing around about once a second, between being at 13% and being at 26%.

Does this sound like correct behavior for those architectures, for that task type? :)

My CUDA tasks are usually 90% constant on the GPUs, which is why I thought the GPU Loads reported here looked suspicious. (And yes, I know CUDA is way different than OpenCL, but... are the low loads in this post really the expected loads for this task type?)

Snow Crash

Joined: 11 Aug 13

Posts: 10

Credit: 5011603

RAC: 0

I did some testing a little

31 Oct 2013 16:46:24 UTC

Message 79843

(moderation:

)

I did some testing a little while back and determined that a 660Ti Win7 x64 needs to run 4 concurrent tasks w/ 1 cpu core for each GPU task to keep properly fed. I forget how long they took but IIRC they only garner 70 pts. each ... call me a pt hore but I'm crunch other Einstein\ Albert GPU tasks until we see something new here.

Snow Crash

Joined: 11 Aug 13

Posts: 10

Credit: 5011603

RAC: 0

I only have selected Perseus

25 Nov 2013 16:07:27 UTC

Message 79844 in response to message 79843

(moderation:

)

I only have selected Perseus ARM only but occasionally get an FRGP.
Is this by design or an oddity of running beta?

Run only the selected applications
Binary Radio Pulsar Search: no
Binary Radio Pulsar Search (Arecibo, GPU): no
Binary Radio Pulsar Search (single DM): no
Binary Radio Pulsar Search (Perseus Arm Survey): yes
Gravitational Wave S6 Directed Search (CasA): no
Gamma-ray pulsar search #2: no
 
Run beta/test application versions?
This helps us develop applications, but it may cause jobs to fail on your computer: no 
Run CPU versions of applications for which GPU versions are available: no

TIA,
Steve

Holmis

Joined: 4 Jan 05

Posts: 89

Credit: 2104736

RAC: 0

When this happens, getting

27 Nov 2013 9:34:38 UTC

Message 79845 in response to message 79844

(moderation:

)

When this happens, getting the wrong kind of work, see if you can get the server log for that contact. To do this go to your list of computers and then in the rightmost column click on the datestamp for the last contact to display the log, post it here and maybe that can shed some light on why you are allocated work for an application you have not selected.

Snow Crash

Joined: 11 Aug 13

Posts: 10

Credit: 5011603

RAC: 0

I apologize in advance for

29 Nov 2013 12:27:15 UTC

Message 79846 in response to message 79845

(moderation:

)

I apologize in advance for the wall of text about to ensue as I'm not sure what precisely is relevant and there are references from the beginning to the end for 2 GAMMA WU buried in this. I do move hardware between rigs and I have aborted a couple of GAMMA WUs in the past. I did let these 2 run with mixed results. The first ran on a 7950 concurrently with 2 Milkyway WUS and experienced a computation error which is likely more to do with my card OC than anything else. The second ran to completion on a 7850 with an additional concurrent Perseus Arm task and is now pending validation. One last side note, the message board removes duplicate space characters but in the original the 31st line there was only a single space before the [CRITICAL] tag, all other lines have 4 spaces (likely a tab).

I'm not concerned about getting unselected gamma WU as I'll crunch what I get, I promise no more aborts, but maybe something in the scheduler needs a tweak?

- Steve

2013-11-29 2013-11-29 11:21:39.5705 [PID=11032] 2013-11-29 11:21:39.5705 [PID=11032] 2013-11-29 11:21:39.5705 [PID=11032] 2013-11-29 11:21:39.5705 [PID=11032] 2013-11-29 11:21:39.5705 [PID=11032] 2013-11-29 11:21:39.5705 [PID=11032] 2013-11-29 11:21:39.5706 [PID=11032] 2013-11-29 11:21:39.5706 [PID=11032] 2013-11-29 11:21:39.5743 [PID=11032] 2013-11-29 11:21:39.5743 [PID=11032] 2013-11-29 11:21:39.5743 [PID=11032] 2013-11-29 11:21:39.5743 [PID=11032] 2013-11-29 11:21:39.5743 [PID=11032] 2013-11-29 11:21:39.5743 [PID=11032] 2013-11-29 11:21:39.5743 [PID=11032] 2013-11-29 11:21:39.5744 [PID=11032] 2013-11-29 11:21:39.5744 [PID=11032] 2013-11-29 11:21:39.6231 [PID=11032] 2013-11-29 11:21:39.6232 [PID=11032] 2013-11-29 11:21:39.6232 [PID=11032] 2013-11-29 11:21:39.6241 [PID=11032] 2013-11-29 11:21:39.6241 [PID=11032] 2013-11-29 11:21:39.6242 [PID=11032] 2013-11-29 11:21:39.6242 [PID=11032] 2013-11-29 11:21:39.6242 [PID=11032] 2013-11-29 11:21:39.6242 [PID=11032] 2013-11-29 11:21:39.6242 [PID=11032] 2013-11-29 11:21:39.6242 [PID=11032] 2013-11-29 11:21:39.6274 [PID=11032] 2013-11-29 11:21:39.6275 [PID=11032] [CRITICAL] 2013-11-29 11:21:39.6275 [PID=11032] 2013-11-29 11:21:39.6275 [PID=11032] 2013-11-29 11:21:39.6300 [PID=11032] 2013-11-29 11:21:39.6300 [PID=11032] 2013-11-29 11:21:39.6300 [PID=11032] 2013-11-29 11:21:39.6312 [PID=11032] 2013-11-29 11:21:39.6312 [PID=11032] 2013-11-29 11:21:39.6312 [PID=11032] 2013-11-29 11:21:39.6312 [PID=11032] 2013-11-29 11:21:39.6312 [PID=11032] 2013-11-29 11:21:39.6312 [PID=11032] 2013-11-29 11:21:39.6312 [PID=11032] 2013-11-29 11:21:39.6312 [PID=11032] 2013-11-29 11:21:39.6313 [PID=11032] 2013-11-29 11:21:39.6373 [PID=11032] 2013-11-29 11:21:39.6374 [PID=11032] 2013-11-29 11:21:39.6374 [PID=11032] 2013-11-29 11:21:39.6374 [PID=11032] 2013-11-29 11:21:39.6374 [PID=11032] 2013-11-29 11:21:39.6374 [PID=11032] 2013-11-29 11:21:39.6374 [PID=11032] 2013-11-29 11:21:39.6375 [PID=11032] 2013-11-29 11:21:39.6375 [PID=11032] 2013-11-29 11:21:39.6375 [PID=11032] 2013-11-29 11:21:39.6375 [PID=11032] 2013-11-29 11:21:39.6375 [PID=11032] 2013-11-29 11:21:39.6416 [PID=11032] 2013-11-29 11:21:39.6418 [PID=11032] 2013-11-29 11:21:39.6418 [PID=11032] 2013-11-29 11:21:39.6422 [PID=11032] 2013-11-29 11:21:39.6422 [PID=11032] 2013-11-29 11:21:39.6422 [PID=11032] 2013-11-29 11:21:39.6423 [PID=11032] 2013-11-29 11:21:39.6423 [PID=11032] 2013-11-29 11:21:39.6423 [PID=11032] 2013-11-29 11:21:39.6423 [PID=11032] 2013-11-29 11:21:39.6423 [PID=11032] 2013-11-29 11:21:39.6423 [PID=11032] 2013-11-29 11:21:39.6423 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6424 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6425 [PID=11032] 2013-11-29 11:21:39.6426 [PID=11032] 2013-11-29 11:21:39.6426 [PID=11032] 2013-11-29 11:21:39.6426 [PID=11032] 2013-11-29 11:21:39.6426 [PID=11032] 2013-11-29 11:21:39.6456 [PID=11032] 2013-11-29 11:21:39.6459 [PID=11032] 2013-11-29 11:21:39.6459 [PID=11032] 2013-11-29 11:21:39.6465 [PID=11032] 2013-11-29 11:21:39.6465 [PID=11032] 2013-11-29 11:21:39.6465 [PID=11032] 2013-11-29 11:21:39.6465 [PID=11032] 2013-11-29 11:21:39.6466 [PID=11032] 2013-11-29 11:21:39.6466 [PID=11032] 2013-11-29 11:21:39.6466 [PID=11032] 2013-11-29 11:21:39.6466 [PID=11032] 2013-11-29 11:21:39.6466 [PID=11032] 2013-11-29 11:21:39.6466 [PID=11032] 2013-11-29 11:21:39.6469 [PID=11032] 2013-11-29 11:21:39.6470 [PID=11032] 2013-11-29 11:21:39.6470 [PID=11032] 2013-11-29 11:21:39.6470 [PID=11032] 2013-11-29 11:21:39.6470 [PID=11032] 2013-11-29 11:21:39.6471 [PID=11032] 2013-11-29 11:21:39.6471 [PID=11032] 2013-11-29 11:21:39.6471 [PID=11032] 2013-11-29 11:21:39.6471 [PID=11032] 2013-11-29 11:21:39.6471 [PID=11032] 2013-11-29 11:21:39.6471 [PID=11032] 2013-11-29 11:21:39.6496 [PID=11032] 2013-11-29 11:21:39.6497 [PID=11032] 2013-11-29 11:21:39.6497 [PID=11032] 2013-11-29 11:21:39.6499 [PID=11032] 2013-11-29 11:21:39.6499 [PID=11032] 2013-11-29 11:21:39.6499 [PID=11032] 2013-11-29 11:21:39.6500 [PID=11032] 2013-11-29 11:21:39.6500 [PID=11032] 2013-11-29 11:21:39.6500 [PID=11032] 2013-11-29 11:21:39.6500 [PID=11032] 2013-11-29 11:21:39.6501 [PID=11032] 2013-11-29 11:21:39.6501 [PID=11032] 2013-11-29 11:21:39.6501 [PID=11032] 2013-11-29 11:21:39.6501 [PID=11032] 2013-11-29 11:21:39.6501 [PID=11032] 2013-11-29 11:21:39.6501 [PID=11032] 2013-11-29 11:21:39.6502 [PID=11032] 2013-11-29 11:21:39.6502 [PID=11032] 2013-11-29 11:21:39.6502 [PID=11032] 2013-11-29 11:21:39.6502 [PID=11032] 2013-11-29 11:21:39.6502 [PID=11032] 2013-11-29 11:21:39.6503 [PID=11032] 2013-11-29 11:21:39.6503 [PID=11032] 2013-11-29 11:21:39.6503 [PID=11032] 2013-11-29 11:21:39.6504 [PID=11032] 2013-11-29 11:21:39.6504 [PID=11032] 2013-11-29 11:21:39.6504 [PID=11032] 2013-11-29 11:21:39.6504 [PID=11032] 2013-11-29 11:21:39.6504 [PID=11032] 2013-11-29 11:21:39.6504 [PID=11032] 2013-11-29 11:21:39.7796 [PID=11032] 2013-11-29 11:21:39.7800 [PID=11032] 2013-11-29 11:21:39.7800 [PID=11032] 2013-11-29 11:21:39.7810 [PID=11032] 2013-11-29 11:21:39.7811 [PID=11032] 2013-11-29 11:21:39.7811 [PID=11032] 2013-11-29 11:21:39.7811 [PID=11032] 2013-11-29 11:21:39.7811 [PID=11032] 2013-11-29 11:21:39.7811 [PID=11032] 2013-11-29 11:21:39.7812 [PID=11032] 2013-11-29 11:21:39.7812 [PID=11032] 2013-11-29 11:21:39.7812 [PID=11032] 2013-11-29 11:21:39.7812 [PID=11032] 2013-11-29 11:21:39.7820 [PID=11032] 2013-11-29 11:21:39.7821 [PID=11032] 2013-11-29 11:21:39.7821 [PID=11032] 2013-11-29 11:21:39.7821 [PID=11032] 2013-11-29 11:21:39.7821 [PID=11032] 2013-11-29 11:21:39.7822 [PID=11032] 2013-11-29 11:21:39.7822 [PID=11032] 2013-11-29 11:21:39.7822 [PID=11032] 2013-11-29 11:21:39.7822 [PID=11032] 2013-11-29 11:21:39.7822 [PID=11032] 2013-11-29 11:21:39.7822 [PID=11032] 2013-11-29 11:21:39.7843 [PID=11032] 2013-11-29 11:21:39.7845 [PID=11032] 2013-11-29 11:21:39.7845 [PID=11032] 2013-11-29 11:21:39.7847 [PID=11032] 2013-11-29 11:21:39.7847 [PID=11032] 2013-11-29 11:21:39.7847 [PID=11032] 2013-11-29 11:21:39.7847 [PID=11032] 2013-11-29 11:21:39.7847 [PID=11032] 2013-11-29 11:21:39.7847 [PID=11032] 2013-11-29 11:21:39.7848 [PID=11032] 2013-11-29 11:21:39.7848 [PID=11032] 2013-11-29 11:21:39.7848 [PID=11032] 2013-11-29 11:21:39.7848 [PID=11032] 2013-11-29 11:21:39.7850 [PID=11032] 2013-11-29 11:21:39.7851 [PID=11032] 2013-11-29 11:21:39.7851 [PID=11032] 2013-11-29 11:21:39.7851 [PID=11032] 2013-11-29 11:21:39.7851 [PID=11032] 2013-11-29 11:21:39.7852 [PID=11032] 2013-11-29 11:21:39.7852 [PID=11032] 2013-11-29 11:21:39.7852 [PID=11032] 2013-11-29 11:21:39.7852 [PID=11032] 2013-11-29 11:21:39.7852 [PID=11032] 2013-11-29 11:21:39.7852 [PID=11032] 2013-11-29 11:21:39.7871 [PID=11032] 2013-11-29 11:21:39.7873 [PID=11032] 2013-11-29 11:21:39.7873 [PID=11032] 2013-11-29 11:21:39.7875 [PID=11032] 2013-11-29 11:21:39.7875 [PID=11032] 2013-11-29 11:21:39.7875 [PID=11032] 2013-11-29 11:21:39.7875 [PID=11032] 2013-11-29 11:21:39.7875 [PID=11032] 2013-11-29 11:21:39.7876 [PID=11032] 2013-11-29 11:21:39.7876 [PID=11032] 2013-11-29 11:21:39.7876 [PID=11032] 2013-11-29 11:21:39.7876 [PID=11032] 2013-11-29 11:21:39.7876 [PID=11032] 2013-11-29 11:21:39.7897 [PID=11032] 2013-11-29 11:21:39.7898 [PID=11032] 2013-11-29 11:21:39.7899 [PID=11032] 2013-11-29 11:21:39.7901 [PID=11032] 2013-11-29 11:21:39.7901 [PID=11032] 2013-11-29 11:21:39.7901 [PID=11032] 2013-11-29 11:21:39.7901 [PID=11032] 2013-11-29 11:21:39.7901 [PID=11032] 2013-11-29 11:21:39.7901 [PID=11032] 2013-11-29 11:21:39.7902 [PID=11032] 2013-11-29 11:21:39.7902 [PID=11032] 2013-11-29 11:21:39.7902 [PID=11032] 2013-11-29 11:21:39.7902 [PID=11032] 2013-11-29 11:21:39.9290 [PID=11032] 2013-11-29 11:21:39.9298 [PID=11032] 2013-11-29 11:21:39.9298 [PID=11032] 2013-11-29 11:21:39.9315 [PID=11032] 2013-11-29 11:21:39.9316 [PID=11032] 2013-11-29 11:21:39.9316 [PID=11032] 2013-11-29 11:21:39.9316 [PID=11032] 2013-11-29 11:21:39.9316 [PID=11032] 2013-11-29 11:21:39.9317 [PID=11032] 2013-11-29 11:21:39.9317 [PID=11032] 2013-11-29 11:21:39.9317 [PID=11032] 2013-11-29 11:21:39.9317 [PID=11032] 2013-11-29 11:21:39.9317 [PID=11032] 2013-11-29 11:21:39.9318 [PID=11032] 2013-11-29 11:21:39.9318 [PID=11032] 2013-11-29 11:21:39.9319 [PID=11032] 2013-11-29 11:21:39.9319 [PID=11032] 2013-11-29 11:21:39.9319 [PID=11032] 2013-11-29 11:21:39.9320 [PID=11032] 2013-11-29 11:21:39.9320 [PID=11032] 2013-11-29 11:21:39.9320 [PID=11032] 2013-11-29 11:21:39.9320 [PID=11032] 2013-11-29 11:21:39.9320 [PID=11032] 2013-11-29 11:21:39.9320 [PID=11032] 2013-11-29 11:21:40.0090 [PID=11032] 2013-11-29 11:21:40.0091 [PID=11032] 2013-11-29 11:21:40.0092 [PID=11032] 2013-11-29 11:21:40.0094 [PID=11032] 2013-11-29 11:21:40.0094 [PID=11032] 2013-11-29 11:21:40.0094 [PID=11032] 2013-11-29 11:21:40.0094 [PID=11032] 2013-11-29 11:21:40.0094 [PID=11032] 2013-11-29 11:21:40.0094 [PID=11032] 2013-11-29 11:21:40.0095 [PID=11032] 2013-11-29 11:21:40.0095 [PID=11032] 2013-11-29 11:21:40.0095 [PID=11032] 2013-11-29 11:21:40.0095 [PID=11032] 2013-11-29 11:21:40.0095 [PID=11032] 2013-11-29 11:21:40.0096 [PID=11032] 2013-11-29 11:21:40.0096 [PID=11032] 2013-11-29 11:21:40.0096 [PID=11032] 2013-11-29 11:21:40.0119 [PID=11032] 2013-11-29 11:21:40.0124 [PID=11032]

11:21:39.5693 [PID=11032]   Request: [USER#xxxxx] [HOST#9649] [IP xxx.xxx.xxx.139] client 7.2.5 [send] Not using matchmaker scheduling; Not using EDF sim [send] CPU: req 0.00 sec, 0.00 instances; est delay 0.00 [send] ATI: req 159130.28 sec, 0.00 instances; est delay 0.00 [send] work_req_seconds: 0.00 secs [send] available disk 95.30 GB, work_buf_min 172800 [send] active_frac 0.997831 on_frac 0.974682 [send] p_vm_extensions_disabled: no [send] CPU features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss htt tm pni ssse3 cx16 sse4_1 sse4_2 popcnt aes syscall lm vmx tm2 pbe [send] [HOST#9649] app version 728 is reliable [send] set_trust: random choice for cons valid 15: yes [send] [AV#729] not reliable; cons valid 7 < 10 [send] set_trust: cons valid 7 < 10, don't use single replication [send] [AV#768] not reliable; cons valid 0 < 10 [send] set_trust: cons valid 0 < 10, don't use single replication [mixed] sending locality work first [locality] [HOST#9649] removing file rand_PAS.bank.v3 from file_infos list [locality] [HOST#9649] removing file JPLEPH.405 from file_infos list [version] get_app_version(): getting app version for WU#503247 (LATeah0069U_48.0_500_-4.01e-10) appid:25 [version] looking for version of hsgamma_FGRP2 [version] Checking plan class 'FGRPopencl-ati' [version] reading plan classes from file '/BOINC/projects/AlbertAtHome/plan_class_spec.xml' [version] host_flops: 4.242098e+09, 	speedup: 3.00, 	projected_flops: 1.202914e+10, 	peak_flops: 4.242098e+09, 	peak_flops_factor: 0.21 [version] Checking plan class 'FGRPopencl-nvidia' [version] No NVidia devices found [version] [AV#766] app_plan() returned false [version] [AV#768] (FGRPopencl-ati) adjusting projected flops based on PFC avg: 9.63G [version] Best version of app hsgamma_FGRP2 is [AV#768] (9.63 GFLOPS) [send] est delay 0, skipping deadline check [send] Sending app_version hsgamma_FGRP2 7 112 FGRPopencl-ati; projected 9.63 GFLOPS No filename found in [WU#503247 LATeah0069U_48.0_500_-4.01e-10] [send] est. duration for WU 503247: unscaled 1557.59 scaled 1601.52 [HOST#9649] Sending [RESULT#1212496 LATeah0069U_48.0_500_-4.01e-10_1] (est. dur. 1601.52s (0h26m41s52)) (max time 31151.75s (8h39m11s74)) [locality] send_old_work(LATeah0069U_48.0_500_-4.01e-10_1) sent result created 347.4 hours ago [RESULT#1212496] [locality] Note: sent NON-LOCALITY result LATeah0069U_48.0_500_-4.01e-10_1 [locality] send_new_file_work(): try to send old work [version] get_app_version(): getting app version for WU#509715 (h1_0999.40_S6Direct__S6CasAf40_999.75Hz_277) appid:28 [version] looking for version of einstein_S6CasA [version] Checking plan class 'SSE2' [version] host_flops: 4.242098e+09, 	speedup: 1.60, 	projected_flops: 6.446067e+09, 	peak_flops: 4.028792e+09, 	peak_flops_factor: 1.00 [version] [AV#707] Skipping CPU version - user prefs say no CPUs [version] returning NULL; platforms: [version] windows_x86_64 [version] windows_intelx86 [mixed] sending non-locality work second [version] get_app_version(): getting app version for WU#510419 (PA0087_002B1_366) appid:27 [version] looking for version of einsteinbinary_BRP5 [version] Checking plan class 'BRP5-opencl-ati' [version] parsed project prefs setting 'gpu_util_brp' : true : 0.500000 [version] host_flops: 4.242098e+09, 	speedup: 15.00, 	projected_flops: 5.665771e+10, 	peak_flops: 4.242098e+09, 	peak_flops_factor: 0.21 [version] Checking plan class 'BRP5-opencl-intel_gpu' [version] parsed project prefs setting 'gpu_util_brp' : true : 0.500000 [version] host_flops: 4.242098e+09, 	speedup: 15.00, 	projected_flops: 5.665771e+10, 	peak_flops: 4.242098e+09, 	peak_flops_factor: 0.21 [version] [AV#729] (BRP5-opencl-intel_gpu) setting projected flops based on host elapsed time avg: 40.00G [version] Best version of app einsteinbinary_BRP5 is [AV#729] (40.00 GFLOPS) [send] est. duration for WU 510419: unscaled 11248.66 scaled 11565.94 [send] [WU#510419] meets deadline: 800.76 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510419: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1230913 PA0087_002B1_366_1] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510036 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2496) appid:29 [version] looking for version of einsteinbinary_BRP4G [version] Checking plan class 'BRP4G-opencl-ati' [version] parsed project prefs setting 'gpu_util_brp' : true : 0.500000 [version] host_flops: 4.242098e+09, 	speedup: 15.00, 	projected_flops: 5.665771e+10, 	peak_flops: 4.242098e+09, 	peak_flops_factor: 0.21 [version] [AV#721] (BRP4G-opencl-ati) adjusting projected flops based on PFC avg: 72.94G [version] Best version of app einsteinbinary_BRP4G is [AV#721] (72.94 GFLOPS) [version] get_app_version(): getting app version for WU#505541 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_135) appid:21 [version] looking for version of einsteinbinary_BRP4 [version] Checking plan class 'BRP4X64' [version] parsed project prefs setting 'also_run_cpu' : true : 1.000000 [version] project prefs setting 'also_run_cpu' (1.000000) prevents using plan class. [version] [AV#588] app_plan() returned false [version] Checking plan class 'BRP4SSE' [version] parsed project prefs setting 'also_run_cpu' : true : 1.000000 [version] project prefs setting 'also_run_cpu' (1.000000) prevents using plan class. [version] [AV#598] app_plan() returned false [version] returning NULL; platforms: [version] windows_x86_64 [version] windows_intelx86 [version] get_app_version(): getting app version for WU#505544 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_138) appid:21 [version] get_app_version(): getting app version for WU#503250 (LATeah0069U_48.0_500_-4.04e-10) appid:25 [version] looking for version of hsgamma_FGRP2 [version] Checking plan class 'FGRPopencl-ati' [version] host_flops: 4.242098e+09, 	speedup: 3.00, 	projected_flops: 1.202914e+10, 	peak_flops: 4.242098e+09, 	peak_flops_factor: 0.21 [version] Checking plan class 'FGRPopencl-nvidia' [version] No NVidia devices found [version] [AV#766] app_plan() returned false [version] [AV#768] (FGRPopencl-ati) adjusting projected flops based on PFC avg: 9.63G [version] Best version of app hsgamma_FGRP2 is [AV#768] (9.63 GFLOPS) [version] get_app_version(): getting app version for WU#510430 (PA0087_002B1_388) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510430: unscaled 11248.66 scaled 11565.94 [send] [WU#510430] meets deadline: 3692.25 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510430: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1230934 PA0087_002B1_388_0] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510030 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2400) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505536 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_130) appid:21 [version] get_app_version(): getting app version for WU#505545 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_139) appid:21 [version] get_app_version(): getting app version for WU#503251 (LATeah0069U_48.0_500_-4.05e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510426 (PA0087_002B1_380) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510426: unscaled 11248.66 scaled 11565.94 [send] [WU#510426] meets deadline: 6583.73 + 11565.94 < 1209600 [send] [USER#346066] already has 1 result(s) for [WU#510426] [version] get_app_version(): getting app version for WU#510031 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2416) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505537 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_131) appid:21 [version] get_app_version(): getting app version for WU#505545 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_139) appid:21 [version] get_app_version(): getting app version for WU#503251 (LATeah0069U_48.0_500_-4.05e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510421 (PA0087_002B1_370) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510421: unscaled 11248.66 scaled 11565.94 [send] [WU#510421] meets deadline: 6583.73 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510421: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1230917 PA0087_002B1_370_1] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510031 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2416) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505538 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_132) appid:21 [version] get_app_version(): getting app version for WU#505546 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_140) appid:21 [version] get_app_version(): getting app version for WU#503252 (LATeah0069U_48.0_500_-4.06e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510421 (PA0087_002B1_370) appid:27 [version] returning cached version: [AV#729] [send] [HOST#9649] [WU#510421 PA0087_002B1_370] WU is infeasible: Already in reply [version] get_app_version(): getting app version for WU#510037 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2512) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505539 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_133) appid:21 [version] get_app_version(): getting app version for WU#505546 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_140) appid:21 [version] get_app_version(): getting app version for WU#503252 (LATeah0069U_48.0_500_-4.06e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510430 (PA0087_002B1_388) appid:27 [version] returning cached version: [AV#729] [version] get_app_version(): getting app version for WU#510032 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2432) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505547 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_141) appid:21 [version] get_app_version(): getting app version for WU#505547 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_141) appid:21 [version] get_app_version(): getting app version for WU#503253 (LATeah0069U_48.0_500_-4.07e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510422 (PA0087_002B1_372) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510422: unscaled 11248.66 scaled 11565.94 [send] [WU#510422] meets deadline: 9475.21 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510422: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1230918 PA0087_002B1_372_0] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510036 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2496) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505548 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_142) appid:21 [version] get_app_version(): getting app version for WU#505548 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_142) appid:21 [version] get_app_version(): getting app version for WU#503253 (LATeah0069U_48.0_500_-4.07e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510411 (PA0087_002B1_350) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510411: unscaled 11248.66 scaled 11565.94 [send] [WU#510411] meets deadline: 12366.70 + 11565.94 < 1209600 [RESULT#1230897] expected to be unsent; instead, state is 4 [version] get_app_version(): getting app version for WU#510032 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2432) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505549 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_143) appid:21 [version] get_app_version(): getting app version for WU#505549 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_143) appid:21 [version] get_app_version(): getting app version for WU#503248 (LATeah0069U_48.0_500_-4.02e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510428 (PA0087_002B1_384) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510428: unscaled 11248.66 scaled 11565.94 [send] [WU#510428] meets deadline: 12366.70 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510428: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1230931 PA0087_002B1_384_1] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510035 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2480) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505550 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_144) appid:21 [version] get_app_version(): getting app version for WU#505550 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_144) appid:21 [version] get_app_version(): getting app version for WU#503254 (LATeah0069U_48.0_500_-4.08e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510429 (PA0087_002B1_386) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510429: unscaled 11248.66 scaled 11565.94 [send] [WU#510429] meets deadline: 15258.18 + 11565.94 < 1209600 [send] [USER#346066] already has 1 result(s) for [WU#510429] [version] get_app_version(): getting app version for WU#510033 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2448) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505551 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_145) appid:21 [version] get_app_version(): getting app version for WU#505551 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_145) appid:21 [version] get_app_version(): getting app version for WU#503254 (LATeah0069U_48.0_500_-4.08e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510427 (PA0087_002B1_382) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510427: unscaled 11248.66 scaled 11565.94 [send] [WU#510427] meets deadline: 15258.18 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510427: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1230929 PA0087_002B1_382_1] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510028 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2368) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505552 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_146) appid:21 [version] get_app_version(): getting app version for WU#505518 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_112) appid:21 [version] get_app_version(): getting app version for WU#503229 (LATeah0069U_48.0_500_-3.83e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510424 (PA0087_002B1_376) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510424: unscaled 11248.66 scaled 11565.94 [send] [WU#510424] meets deadline: 18149.67 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510424: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1230923 PA0087_002B1_376_1] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510017 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2192) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505518 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_112) appid:21 [version] get_app_version(): getting app version for WU#505510 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_104) appid:21 [version] get_app_version(): getting app version for WU#503229 (LATeah0069U_48.0_500_-3.83e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510514 (PA0080_024D1_2) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510514: unscaled 11248.66 scaled 11565.94 [send] [WU#510514] meets deadline: 21041.15 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510514: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1232397 PA0080_024D1_2_0] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510035 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2480) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505531 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_125) appid:21 [version] get_app_version(): getting app version for WU#505534 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_128) appid:21 [version] get_app_version(): getting app version for WU#503231 (LATeah0069U_48.0_500_-3.85e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510514 (PA0080_024D1_2) appid:27 [version] returning cached version: [AV#729] [version] get_app_version(): getting app version for WU#510034 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2464) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505540 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_134) appid:21 [version] get_app_version(): getting app version for WU#505540 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_134) appid:21 [version] get_app_version(): getting app version for WU#503247 (LATeah0069U_48.0_500_-4.01e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#505542 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_136) appid:21 [version] get_app_version(): getting app version for WU#503249 (LATeah0069U_48.0_500_-4.03e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510513 (PA0080_024D1_0) appid:27 [version] returning cached version: [AV#729] [send] est. duration for WU 510513: unscaled 11248.66 scaled 11565.94 [send] [WU#510513] meets deadline: 23932.64 + 11565.94 < 1209600 [send] Sending app_version einsteinbinary_BRP5 7 139 BRP5-opencl-intel_gpu; projected 40.00 GFLOPS [send] est. duration for WU 510513: unscaled 11248.66 scaled 11565.94 [HOST#9649] Sending [RESULT#1232396 PA0080_024D1_0_1] (est. dur. 11565.94s (3h12m45s93)) (max time 224973.24s (62h29m33s23)) [version] get_app_version(): getting app version for WU#510029 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2384) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505542 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_136) appid:21 [version] get_app_version(): getting app version for WU#505543 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_137) appid:21 [version] get_app_version(): getting app version for WU#503249 (LATeah0069U_48.0_500_-4.03e-10) appid:25 [version] returning cached version: [AV#768] [version] get_app_version(): getting app version for WU#510513 (PA0080_024D1_0) appid:27 [version] returning cached version: [AV#729] [version] get_app_version(): getting app version for WU#510029 (p2030.20130202.G203.30-00.11.N.b0s0g0.00000_2384) appid:29 [version] returning cached version: [AV#721] [version] get_app_version(): getting app version for WU#505543 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_137) appid:21 [version] get_app_version(): getting app version for WU#505544 (p2030.20130202.G203.30-00.11.N.b3s0g0.00000_138) appid:21 [version] get_app_version(): getting app version for WU#503250 (LATeah0069U_48.0_500_-4.04e-10) appid:25 [version] returning cached version: [AV#768] Sending reply to [HOST#9649]: 10 results, delay req 60.00 Scheduler ran 0.449 seconds

OpenCL tasks - Low GPU% on 331.58 drivers?

Forums › Problems and Bug Reports

Comment viewing options

Forums › Problems and Bug Reports