[OpenCL] app v1.20/v1.21 feedback thread

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 320
Credit: 8545955
RAC: 0
Topic 84820

Hi,

We just released v1.20 which should fix an important bug which prevented proper validation, in particular on AMD 4xxx GPUs. Let's try and collect your feedback to this specific app version (and this one only) in this thread.

FYI, please use the latest Catalyst driver (>=12.1) and BOINC client (>=7.0.12).

Thanks,
Oliver

TRuEQ & TuVaLu
TRuEQ & TuVaLu
Joined: 11 Sep 06
Posts: 75
Credit: 615315
RAC: 0

[OpenCL] app v1.20/v1.21 feedback thread

Hi, I have some tasks that all gave errors.

http://albertathome.org/host/1353/tasks&offset=0&show_names=0&state=5&appid=

I run on an ATI 4850 with 512MB RAM and Catalyst 11.12
BM 7.0.12

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 320
Credit: 8545955
RAC: 0

RE: I run on an ATI 4850

Message 79011 in response to message 79010

Quote:

I run on an ATI 4850 with 512MB RAM and Catalyst 11.12

As discussed earlier, we (still) need ~490 MB free GPU memory. This doesn't seem to be the case here. We'd be able to reduce that by 120 MB but that would mean to exclude all OpenCL 1.0 GPUs, including your 4xxx series card.

FYI, we think about doing this anyway since the OpenCL 1.0 devices don't offer a significant performance gain over the CPU version so it seems worthwhile dropping their support in favor of more relaxed memory requirements.

Cheers,
Oliver

TRuEQ & TuVaLu
TRuEQ & TuVaLu
Joined: 11 Sep 06
Posts: 75
Credit: 615315
RAC: 0

RE: RE: I run on an ATI

Message 79012 in response to message 79011

Quote:
Quote:

I run on an ATI 4850 with 512MB RAM and Catalyst 11.12

As discussed earlier, we (still) need ~490 MB free GPU memory. This does't seem to be the case here. We'd be able to reduce that by 120 MB but that would mean to exclude all OpenCL 1.0 GPUs, including your 4xxx series card.

FYI, we think about doing this anyway since the OpenCL 1.0 devices don't offer a significant performance gain over the CPU version so it seems worthwhile dropping their support in favor of more relaxed memory requirements.

Cheers,
Oliver

Well, if the mem need was 450Mb you would have a few more 4xxx cards....
I think that my card has about 480Mb free so if the demand is 470MB one could run the app on 48xx cards.

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 320
Credit: 8545955
RAC: 0

As I said, we can reduce it

Message 79013 in response to message 79012

As I said, we can reduce it but only by dropping OpenCL 1.0 (i.e. AMD 4xxx) support.

Oliver

TRuEQ & TuVaLu
TRuEQ & TuVaLu
Joined: 11 Sep 06
Posts: 75
Credit: 615315
RAC: 0

RE: As I said, we can

Message 79014 in response to message 79013

Quote:

As I said, we can reduce it but only by dropping OpenCL 1.0 (i.e. AMD 4xxx) support.

Oliver

Ok

oz
oz
Joined: 28 Feb 05
Posts: 10
Credit: 1285478
RAC: 0

Have tried several Catalyst

Have tried several Catalyst driver/AMD-APP-SDK/Boinc combinations.
OS = Ubuntu/oneiric, 2 x AMD 5770 (Juniper) Cards

ATI GPU 0: ATI Radeon HD 5700 series (Juniper) (CAL version 1.4.1664, 1024MB, 817MB available, 2720 GFLOPS peak)
ATI GPU 1: ATI Radeon HD 5700 series (Juniper) (CAL version 1.4.1664, 1024MB, 983MB available, 2720 GFLOPS peak)
02-Feb-2012 16:45:01 [---] OpenCL: ATI GPU 0: Juniper (driver version CAL 1.4.1664, device version OpenCL 1.1 AMD-APP (851.4), 512MB, 817MB available)
02-Feb-2012 16:45:01 [---] OpenCL: ATI GPU 1: Juniper (driver version CAL 1.4.1664, device version OpenCL 1.1 AMD-APP (851.4), 512MB, 983MB available)

Last combination is Catalyst 12.1, AMD-APP-SDK-v2.6, (without OpenCL v1.2 support) boinc 7.0.12. Results are like=>

http://albertathome.org/task/114016

clinfo reports:clinfo

Number of platforms:                             1
  Platform Profile:                              FULL_PROFILE
  Platform Version:                              OpenCL 1.1 AMD-APP (851.4)
  Platform Name:                                 AMD Accelerated Parallel Processing
  Platform Vendor:                               Advanced Micro Devices, Inc.
  Platform Extensions:                           cl_khr_icd cl_amd_event_callback cl_amd_offline_devices

Platform Name: AMD Accelerated Parallel Processing
Number of devices: 3
Device Type: CL_DEVICE_TYPE_GPU
Device ID: 4098
Board name: ATI Radeon HD 5700 Series
Device Topology: PCI[ B#2, D#0, F#0 ]
Max compute units: 10
Max work items dimensions: 3
Max work items[0]: 256
Max work items[1]: 256
Max work items[2]: 256
Max work group size: 256
Preferred vector width char: 16
Preferred vector width short: 8
Preferred vector width int: 4
Preferred vector width long: 2
Preferred vector width float: 4
Preferred vector width double: 0
Native vector width char: 16
Native vector width short: 8
Native vector width int: 4
Native vector width long: 2
Native vector width float: 4
Native vector width double: 0
Max clock frequency: 0Mhz
Address bits: 32
Max memory allocation: 134217728
Image support: Yes
Max number of images read arguments: 128
Max number of images write arguments: 8
Max image 2D width: 8192
Max image 2D height: 8192
Max image 3D width: 2048
Max image 3D height: 2048
Max image 3D depth: 2048
Max samplers within kernel: 16
Max size of kernel argument: 1024
Alignment (bits) of base address: 2048
Minimum alignment (bytes) for any datatype: 128
Single precision floating point capability
Denorms: No
Quiet NaNs: Yes
Round to nearest even: Yes
Round to zero: Yes
Round to +ve and infinity: Yes
IEEE754-2008 fused multiply-add: Yes
Cache type: None
Cache line size: 0
Cache size: 0
Global memory size: 536870912
Constant buffer size: 65536
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Kernel Preferred work group size multiple: 64
Error correction support: 0
Unified memory for Host and Device: 0
Profiling timer resolution: 1
Device endianess: Little
Available: Yes
Compiler available: Yes
Execution capabilities:
Execute OpenCL kernels: Yes
Execute native function: No
Queue properties:
Out-of-Order: No
Profiling : Yes
Platform ID: 0x7fedcd03c100
Name: Juniper
Vendor: Advanced Micro Devices, Inc.
Device OpenCL C version: OpenCL C 1.1
Driver version: CAL 1.4.1664
Profile: FULL_PROFILE
Version: OpenCL 1.1 AMD-APP (851.4)
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_popcnt

pragmatic prancing periodic problem child, left
pragmatic pranc...
Joined: 26 Jan 05
Posts: 153
Credit: 70000
RAC: 0

Great, initial estimated time

Great, initial estimated time to completion, 286 hours. No wonder it went into HP. But that DCF of 11 is way out of whack.
i3-530, Win 7 - 64bit, 8GB RAM, ATI HD6850 2GB, Catalysts 11.12, BOINC 7.0.12

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 320
Credit: 8545955
RAC: 0

RE: Last combination is

Message 79017 in response to message 79015

Quote:

Last combination is Catalyst 12.1, AMD-APP-SDK-v2.6, (without OpenCL v1.2 support) boinc 7.0.12. Results are like=>

http://albertathome.org/task/114016

clinfo reports:clinfo

  Global memory size:                            536870912

Your card has 512 MB video memory. That's right on the edge of what our app requires (see above). In your case it's not sufficient - error 5 means "out of resources" (memory).

Sorry. You would be one who would benefit if we drop OpenCL 1.0 support in favor of lower memory requirements...

Cheers,
Oliver

PS: BOINC's memory information still seems to be off. I'll forward this...

Oliver Behnke
Oliver Behnke
Moderator
Administrator
Joined: 4 Sep 07
Posts: 320
Credit: 8545955
RAC: 0

RE: Great, initial

Message 79018 in response to message 79016

Quote:
Great, initial estimated time to completion, 286 hours. No wonder it went into HP. But that DCF of 11 is way out of whack.
i3-530, Win 7 - 64bit, 8GB RAM, ATI HD6850 2GB, Catalysts 11.12, BOINC 7.0.12

I'll forward this to Bernd.

Thanks,
Oliver

oz
oz
Joined: 28 Feb 05
Posts: 10
Credit: 1285478
RAC: 0

Oops you're right, video

Oops you're right, video memory is reported as 1024MB for both cards. With 817MB, 983MB available, but global memory for OpenCL is reported from AMD-APP-SDK as 512MB , but (strange) boinc says 817MB, 983MB available). Is there a tweak in OpenCL configuration? amdccle (Catalyst Control Center) says 1024MB Video Memory for both cards. Hmm...?

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.