[OpenCL] app v1.20/v1.21 feedback thread

log in

Advanced search

Message boards : Problems and Bug Reports : [OpenCL] app v1.20/v1.21 feedback thread

Author Message
Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111790 - Posted: 2 Feb 2012 | 13:17:20 UTC
Last modified: 7 Feb 2012 | 16:49:03 UTC

Hi,

We just released v1.20 which should fix an important bug which prevented proper validation, in particular on AMD 4xxx GPUs. Let's try and collect your feedback to this specific app version (and this one only) in this thread.

FYI, please use the latest Catalyst driver (>=12.1) and BOINC client (>=7.0.12).

Thanks,
Oliver

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 75
Credit: 125,660
RAC: 865
Message 111792 - Posted: 2 Feb 2012 | 13:35:06 UTC
Last modified: 2 Feb 2012 | 13:50:40 UTC

Hi, I have some tasks that all gave errors.

http://albert.phys.uwm.edu/results.php?hostid=1353&offset=0&show_names=0&state=5&appid=

I run on an ATI 4850 with 512MB RAM and Catalyst 11.12
BM 7.0.12
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111793 - Posted: 2 Feb 2012 | 13:41:25 UTC - in response to Message 111792.
Last modified: 2 Feb 2012 | 14:08:46 UTC


I run on an ATI 4850 with 512MB RAM and Catalyst 11.12


As discussed earlier, we (still) need ~490 MB free GPU memory. This doesn't seem to be the case here. We'd be able to reduce that by 120 MB but that would mean to exclude all OpenCL 1.0 GPUs, including your 4xxx series card.

FYI, we think about doing this anyway since the OpenCL 1.0 devices don't offer a significant performance gain over the CPU version so it seems worthwhile dropping their support in favor of more relaxed memory requirements.


Cheers,
Oliver

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 75
Credit: 125,660
RAC: 865
Message 111794 - Posted: 2 Feb 2012 | 13:53:57 UTC - in response to Message 111793.


I run on an ATI 4850 with 512MB RAM and Catalyst 11.12


As discussed earlier, we (still) need ~490 MB free GPU memory. This does't seem to be the case here. We'd be able to reduce that by 120 MB but that would mean to exclude all OpenCL 1.0 GPUs, including your 4xxx series card.

FYI, we think about doing this anyway since the OpenCL 1.0 devices don't offer a significant performance gain over the CPU version so it seems worthwhile dropping their support in favor of more relaxed memory requirements.


Cheers,
Oliver


Well, if the mem need was 450Mb you would have a few more 4xxx cards....
I think that my card has about 480Mb free so if the demand is 470MB one could run the app on 48xx cards.
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111795 - Posted: 2 Feb 2012 | 14:09:56 UTC - in response to Message 111794.

As I said, we can reduce it but only by dropping OpenCL 1.0 (i.e. AMD 4xxx) support.

Oliver

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 75
Credit: 125,660
RAC: 865
Message 111796 - Posted: 2 Feb 2012 | 14:19:28 UTC - in response to Message 111795.

As I said, we can reduce it but only by dropping OpenCL 1.0 (i.e. AMD 4xxx) support.

Oliver


Ok
____________

oz
Send message
Joined: 28 Feb 05
Posts: 10
Credit: 1,060,681
RAC: 0
Message 111797 - Posted: 2 Feb 2012 | 17:10:37 UTC

Have tried several Catalyst driver/AMD-APP-SDK/Boinc combinations.
OS = Ubuntu/oneiric, 2 x AMD 5770 (Juniper) Cards

ATI GPU 0: ATI Radeon HD 5700 series (Juniper) (CAL version 1.4.1664, 1024MB, 817MB available, 2720 GFLOPS peak)
ATI GPU 1: ATI Radeon HD 5700 series (Juniper) (CAL version 1.4.1664, 1024MB, 983MB available, 2720 GFLOPS peak)
02-Feb-2012 16:45:01 [---] OpenCL: ATI GPU 0: Juniper (driver version CAL 1.4.1664, device version OpenCL 1.1 AMD-APP (851.4), 512MB, 817MB available)
02-Feb-2012 16:45:01 [---] OpenCL: ATI GPU 1: Juniper (driver version CAL 1.4.1664, device version OpenCL 1.1 AMD-APP (851.4), 512MB, 983MB available)


Last combination is Catalyst 12.1, AMD-APP-SDK-v2.6, (without OpenCL v1.2 support) boinc 7.0.12. Results are like=>

http://albert.phys.uwm.edu/result.php?resultid=114016

clinfo reports:clinfo

Number of platforms: 1
Platform Profile: FULL_PROFILE
Platform Version: OpenCL 1.1 AMD-APP (851.4)
Platform Name: AMD Accelerated Parallel Processing
Platform Vendor: Advanced Micro Devices, Inc.
Platform Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices


Platform Name: AMD Accelerated Parallel Processing
Number of devices: 3
Device Type: CL_DEVICE_TYPE_GPU
Device ID: 4098
Board name: ATI Radeon HD 5700 Series
Device Topology: PCI[ B#2, D#0, F#0 ]
Max compute units: 10
Max work items dimensions: 3
Max work items[0]: 256
Max work items[1]: 256
Max work items[2]: 256
Max work group size: 256
Preferred vector width char: 16
Preferred vector width short: 8
Preferred vector width int: 4
Preferred vector width long: 2
Preferred vector width float: 4
Preferred vector width double: 0
Native vector width char: 16
Native vector width short: 8
Native vector width int: 4
Native vector width long: 2
Native vector width float: 4
Native vector width double: 0
Max clock frequency: 0Mhz
Address bits: 32
Max memory allocation: 134217728
Image support: Yes
Max number of images read arguments: 128
Max number of images write arguments: 8
Max image 2D width: 8192
Max image 2D height: 8192
Max image 3D width: 2048
Max image 3D height: 2048
Max image 3D depth: 2048
Max samplers within kernel: 16
Max size of kernel argument: 1024
Alignment (bits) of base address: 2048
Minimum alignment (bytes) for any datatype: 128
Single precision floating point capability
Denorms: No
Quiet NaNs: Yes
Round to nearest even: Yes
Round to zero: Yes
Round to +ve and infinity: Yes
IEEE754-2008 fused multiply-add: Yes
Cache type: None
Cache line size: 0
Cache size: 0
Global memory size: 536870912
Constant buffer size: 65536
Max number of constant args: 8
Local memory type: Scratchpad
Local memory size: 32768
Kernel Preferred work group size multiple: 64
Error correction support: 0
Unified memory for Host and Device: 0
Profiling timer resolution: 1
Device endianess: Little
Available: Yes
Compiler available: Yes
Execution capabilities:
Execute OpenCL kernels: Yes
Execute native function: No
Queue properties:
Out-of-Order: No
Profiling : Yes
Platform ID: 0x7fedcd03c100
Name: Juniper
Vendor: Advanced Micro Devices, Inc.
Device OpenCL C version: OpenCL C 1.1
Driver version: CAL 1.4.1664
Profile: FULL_PROFILE
Version: OpenCL 1.1 AMD-APP (851.4)
Extensions: cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_popcnt

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111798 - Posted: 2 Feb 2012 | 21:10:41 UTC
Last modified: 2 Feb 2012 | 21:12:13 UTC

Great, initial estimated time to completion, 286 hours. No wonder it went into HP. But that DCF of 11 is way out of whack.
i3-530, Win 7 - 64bit, 8GB RAM, ATI HD6850 2GB, Catalysts 11.12, BOINC 7.0.12
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111799 - Posted: 3 Feb 2012 | 8:39:40 UTC - in response to Message 111797.
Last modified: 3 Feb 2012 | 8:40:27 UTC

Last combination is Catalyst 12.1, AMD-APP-SDK-v2.6, (without OpenCL v1.2 support) boinc 7.0.12. Results are like=>

http://albert.phys.uwm.edu/result.php?resultid=114016

clinfo reports:clinfo

Global memory size: 536870912


Your card has 512 MB video memory. That's right on the edge of what our app requires (see above). In your case it's not sufficient - error 5 means "out of resources" (memory).

Sorry. You would be one who would benefit if we drop OpenCL 1.0 support in favor of lower memory requirements...

Cheers,
Oliver

PS: BOINC's memory information still seems to be off. I'll forward this...

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111800 - Posted: 3 Feb 2012 | 8:45:28 UTC - in response to Message 111798.

Great, initial estimated time to completion, 286 hours. No wonder it went into HP. But that DCF of 11 is way out of whack.
i3-530, Win 7 - 64bit, 8GB RAM, ATI HD6850 2GB, Catalysts 11.12, BOINC 7.0.12


I'll forward this to Bernd.


Thanks,
Oliver

oz
Send message
Joined: 28 Feb 05
Posts: 10
Credit: 1,060,681
RAC: 0
Message 111801 - Posted: 3 Feb 2012 | 9:27:40 UTC
Last modified: 3 Feb 2012 | 9:39:02 UTC

Oops you're right, video memory is reported as 1024MB for both cards. With 817MB, 983MB available, but global memory for OpenCL is reported from AMD-APP-SDK as 512MB , but (strange) boinc says 817MB, 983MB available). Is there a tweak in OpenCL configuration? amdccle (Catalyst Control Center) says 1024MB Video Memory for both cards. Hmm...?

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111802 - Posted: 3 Feb 2012 | 10:42:07 UTC - in response to Message 111801.
Last modified: 3 Feb 2012 | 10:42:50 UTC

Oops you're right, video memory is reported as 1024MB for both cards. With 817MB, 983MB available, but global memory for OpenCL is reported from AMD-APP-SDK as 512MB , but (strange) boinc says 817MB, 983MB available). Is there a tweak in OpenCL configuration? amdccle (Catalyst Control Center) says 1024MB Video Memory for both cards. Hmm...?


This is a (sort of) known issue with AMD GPUs/drivers. Do you happen to know the actual/correct value of your card? Look at the retail box if it helps :-)


Oliver

oz
Send message
Joined: 28 Feb 05
Posts: 10
Credit: 1,060,681
RAC: 0
Message 111803 - Posted: 3 Feb 2012 | 11:06:21 UTC - in response to Message 111802.
Last modified: 3 Feb 2012 | 11:10:27 UTC

According to apple specs it's a MacPro with the standard graphics card ATI Radeon 5770 1GB video mem (one auxilary power cable). Maybe AMD-APP-SDK limits to 50%?

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111804 - Posted: 3 Feb 2012 | 15:28:36 UTC - in response to Message 111803.
Last modified: 3 Feb 2012 | 15:32:14 UTC

According to apple specs it's a MacPro with the standard graphics card ATI Radeon 5770 1GB video mem (one auxilary power cable). Maybe AMD-APP-SDK limits to 50%?


Hang on, do you run Linux on your MacPro?

Anyway, you're witnessing the mess we've to deal with here, AMD's driver and CAL/OpenCL runtimes don't reliably report RAM sizes on some GPUs. They don't even agree with each other. I wouldn't wonder if the driver/runtime tells our app that there's no more memory available (hence the error you see) even when there is... This is valuable hint indeed!

Oliver

oz
Send message
Joined: 28 Feb 05
Posts: 10
Credit: 1,060,681
RAC: 0
Message 111805 - Posted: 3 Feb 2012 | 15:48:04 UTC - in response to Message 111804.
Last modified: 3 Feb 2012 | 16:03:27 UTC

Yes, i' am running Linux on my MacPro. (3.1)
So here we are:
BOINC 1GB
FGLRX 1GB
AMD-SDK 512MB
Card label, Apple 1GB

I installed the driver AFTER the AMD-SDK., so the openCL runtime libs maybe replaced by the driver installation.

And now linux kernel:

id:
display
description: VGA compatible controller
product: Juniper [Radeon HD 5700 Series]
vendor: ATI Technologies Inc
physical id:
0
bus info:
pci@0000:01:00.0
version: 00
width: 64 bits
clock: 33MHz
capabilities: pm pciexpress msi vga_controller bus_master cap_list rom
configuration:
driver = fglrx_pci
latency = 0
resources:
irq : 60
memory : 80000000-9fffffff
memory : c0b00000-c0b1ffff
ioport : 3000(size=256)
memory : c0b20000-c0b3ffff

Someone with power of 2 capability may calculate ram from here

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111807 - Posted: 4 Feb 2012 | 13:16:37 UTC
Last modified: 4 Feb 2012 | 13:20:43 UTC

I dont know what is problem in linux clean install ubuntu 11.10 x64 latest 12.1 catalyst. all in clean install. Still <core_client_version>7.0.12</core_client_version>
<![CDATA[
<message>
process exited with code 255 (0xff, -1)
</message>
<stderr_txt>
[12:12:23][2765][INFO ] Application startup - thank you for supporting Einstein@Home!
[12:12:23][2765][INFO ] Starting data processing...
[12:12:23][2765][ERROR] Failed to get OpenCL platform/device info from BOINC (error: -1)!

http://albert.phys.uwm.edu/result.php?resultid=109647

http://albert.phys.uwm.edu/result.php?resultid=109647
[12:12:23][2765][ERROR] Demodulation failed (error: -1)!
12:12:23 (2765): called boinc_finish

</stderr_txt>
]]>

Profile Trog Dog
Avatar
Send message
Joined: 25 Nov 05
Posts: 204
Credit: 64,008
RAC: 0
Message 111809 - Posted: 5 Feb 2012 | 1:07:13 UTC

3 gentoo 64bit boxes all with the same problem and they are crunching openCL wus for poem and primegrid - exit with code 255 - unable to get openCL info - error -1
____________

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111810 - Posted: 5 Feb 2012 | 4:45:41 UTC
Last modified: 5 Feb 2012 | 4:46:03 UTC

Idont know but other openCL pojects run fine but not this. Poem@home fine, moowrapper CAL fine. I not tested MW.

Alex
Send message
Joined: 1 Mar 05
Posts: 58
Credit: 313,531
RAC: 237
Message 111811 - Posted: 5 Feb 2012 | 6:37:03 UTC

Looks like it's more on linux pc, my win7 runs fine
http://albert.phys.uwm.edu/show_host_detail.php?hostid=1894
the last known failing wu is from 28.Jan.


____________

Profile Trog Dog
Avatar
Send message
Joined: 25 Nov 05
Posts: 204
Credit: 64,008
RAC: 0
Message 111812 - Posted: 5 Feb 2012 | 8:44:07 UTC - in response to Message 111811.

I think its restricted to linux 64 bit machines
____________

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111814 - Posted: 5 Feb 2012 | 11:13:39 UTC - in response to Message 111812.
Last modified: 5 Feb 2012 | 11:23:24 UTC

I think its restricted to linux 64 bit machines


Whay it download app if it is restriced. It not find x86 opencl file because folder locations different in x64

App itself not get opencl file info because it look it another place but it not look it on both folders.
Boinc itself detect x64 file but we needed 32bit one. Drivers supply both files.

Profile Trog Dog
Avatar
Send message
Joined: 25 Nov 05
Posts: 204
Credit: 64,008
RAC: 0
Message 111816 - Posted: 5 Feb 2012 | 22:31:57 UTC - in response to Message 111814.

I think its restricted to linux 64 bit machines


Whay it download app if it is restriced. It not find x86 opencl file because folder locations different in x64

App itself not get opencl file info because it look it another place but it not look it on both folders.
Boinc itself detect x64 file but we needed 32bit one. Drivers supply both files.

I meant the problem is restricted to 64bit machines.
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111817 - Posted: 6 Feb 2012 | 9:18:29 UTC - in response to Message 111809.

3 gentoo 64bit boxes all with the same problem and they are crunching openCL wus for poem and primegrid - exit with code 255 - unable to get openCL info - error -1


FYI, those projects don't use BOINC's OpenCL features at all. As far as I know we are the first project trying to get a fully BOINC-compliant OpenCL implementation going, including OpenCL device management controlled by BOINC, not the app. The error you experience is related to that and we're already aware of it and are investigating it as we speak.

HV Margus* and Trog Dog: do you have multiple GPUs in your systems or just one?


Cheers,
Oliver

Profile Trog Dog
Avatar
Send message
Joined: 25 Nov 05
Posts: 204
Credit: 64,008
RAC: 0
Message 111818 - Posted: 6 Feb 2012 | 10:43:14 UTC - in response to Message 111817.

I have no idea whether poems app is a fully boinc compliant app, but they certainly use the boinc opencl detection features - as stated in this thread - to be able to get and run the opencl app you need a 7.0.x client.

As to the number of gpus each of my 3 boxes only has a single gpu.

May I suggest that you release a 64bit opencl app, if that works (at least detects the opencl features) then the problem lies in the detection of 32bit capabilities on a 64bit machine.
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111820 - Posted: 6 Feb 2012 | 14:32:48 UTC - in response to Message 111818.

I have no idea whether poems app is a fully boinc compliant app, but they certainly use the boinc opencl detection features

Ok, fair enough.


As to the number of gpus each of my 3 boxes only has a single gpu.

Ok, thanks.


May I suggest that you release a 64bit opencl app, if that works (at least detects the opencl features) then the problem lies in the detection of 32bit capabilities on a 64bit machine.

Not anytime soon due to our current time constraints. However, there are successful results from 64-bit Linux systems, so it's not a general issue.

Oliver

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111821 - Posted: 6 Feb 2012 | 14:43:57 UTC - in response to Message 111820.
Last modified: 6 Feb 2012 | 14:44:36 UTC

For those of you on Linux who have the AMD APP SDK installed alongside the AMD Catalyst drivers, please make sure you follow the order as details in the AMD developer release notes (chapter 2).

HTH,
Oliver

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111822 - Posted: 6 Feb 2012 | 16:08:15 UTC - in response to Message 111821.
Last modified: 6 Feb 2012 | 16:59:10 UTC

There is no needed anymore APP sdk Because 11.12 and 12.1 catalyst supply all sdk OpenCL files. Whay the poem@home work it is opencl too. Installing top 12.1 catalyst the APP SDK broke the system. I test that stupid thing installing SDK then 12.1 as they say that.


Still no go i dont know what dodo. Installed app sdk 2.6 and then a catalyst 12.1
<core_client_version>7.0.14</core_client_version>
<![CDATA[
<message>
process exited with code 255 (0xff, -1)
</message>
<stderr_txt>
[18:51:04][2510][INFO ] Application startup - thank you for supporting Einstein@Home!
[18:51:04][2510][INFO ] Starting data processing...
[18:51:04][2510][ERROR] Failed to get OpenCL platform/device info from BOINC (error: -1)!
[18:51:04][2510][ERROR] Demodulation failed (error: -1)!
18:51:04 (2510): called boinc_finish

</stderr_txt>
]]>

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111823 - Posted: 6 Feb 2012 | 17:19:04 UTC - in response to Message 111822.

I have one gpu HD6970 AS FAR Boinc tell that.
E 06 veebr 2012 18:48:47 EET | | No config file found - using defaults
E 06 veebr 2012 18:48:47 EET | | Starting BOINC client version 7.0.14 for x86_64-pc-linux-gnu
E 06 veebr 2012 18:48:47 EET | | log flags: file_xfer, sched_ops, task
E 06 veebr 2012 18:48:47 EET | | Libraries: libcurl/7.21.6 OpenSSL/1.0.0e zlib/1.2.3.4 libidn/1.22 librtmp/2.3
E 06 veebr 2012 18:48:47 EET | | Data directory: /home/m/BOINC
E 06 veebr 2012 18:48:47 EET | | Processor: 4 GenuineIntel Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz [Family 6 Model 15 Stepping 11]
E 06 veebr 2012 18:48:47 EET | | Processor: 4.00 MB cache
E 06 veebr 2012 18:48:47 EET | | Processor features: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good nopl aperfmperf pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm lahf_lm dts tpr_shadow vnmi flexpriority
E 06 veebr 2012 18:48:47 EET | | OS: Linux: 3.0.0-15-generic
E 06 veebr 2012 18:48:47 EET | | Memory: 3.87 GB physical, 3.99 GB virtual
E 06 veebr 2012 18:48:47 EET | | Disk: 472.75 GB total, 443.73 GB free
E 06 veebr 2012 18:48:47 EET | | Local time is UTC +2 hours
E 06 veebr 2012 18:48:47 EET | | ATI GPU 0: AMD Radeon HD 6900 series (Cayman) (CAL version 1.4.1664, 2048MB, 1933MB available, 6758 GFLOPS peak)
E 06 veebr 2012 18:48:47 EET | | OpenCL: ATI GPU 0: Cayman (driver version CAL 1.4.1664, device version OpenCL 1.1 AMD-APP (851.4), 1024MB, 1933MB available)
E 06 veebr 2012 18:48:47 EET | | ATI GPU 0 is OpenCL-capable
E 06 veebr 2012 18:48:47 EET | Milkyway@Home | URL http://milkyway.cs.rpi.edu/milkyway/; Computer ID 283213; resource share 300
E 06 veebr 2012 18:48:47 EET | Poem@Home | URL http://boinc.fzk.de/poem/; Computer ID 107783; resource share 100
E 06 veebr 2012 18:48:47 EET | Albert@Home | URL http://albert.phys.uwm.edu/; Computer ID 1734; resource share 100
E 06 veebr 2012 18:48:47 EET | Moo! Wrapper | URL http://moowrap.net/; Computer ID 8810; resource share 100

Alexone
Send message
Joined: 21 Nov 11
Posts: 6
Credit: 1,000
RAC: 0
Message 111824 - Posted: 6 Feb 2012 | 19:10:52 UTC

BM 7.0.14
AMD Driver 11.8
2 4850 1GB (750/1100) MGz 11 hours - Completed and validated

Unit 1
Unit 2

Profile Trog Dog
Avatar
Send message
Joined: 25 Nov 05
Posts: 204
Credit: 64,008
RAC: 0
Message 111825 - Posted: 6 Feb 2012 | 22:23:24 UTC

I don't have the SDK installed, one box has Catalyst 11.12 and the other two 11.11 which provides the opencl drivers. I also have 32bit version of libGLU.so installed as mentioned in the developer release notes.

Maybe we can get some feedback from those successfully returning results from 64bit linux machines with ATI/AMD gpus.
____________

Profile Trog Dog
Avatar
Send message
Joined: 25 Nov 05
Posts: 204
Credit: 64,008
RAC: 0
Message 111826 - Posted: 7 Feb 2012 | 10:23:53 UTC - in response to Message 111825.

Just found out that 12.1 was released as an unstable package a few days ago so I've installed that on all my boxes now and 1 is happily crunching away on an opencl wu
____________

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111827 - Posted: 7 Feb 2012 | 11:27:55 UTC - in response to Message 111826.

Just found out that 12.1 was released as an unstable package a few days ago so I've installed that on all my boxes now and 1 is happily crunching away on an opencl wu


How do you installed it. I use this. http://wiki.cchtml.com/index.php/Ubuntu_Oneiric_Installation_Guide I make debs and install then.

Profile Trog Dog
Avatar
Send message
Joined: 25 Nov 05
Posts: 204
Credit: 64,008
RAC: 0
Message 111828 - Posted: 7 Feb 2012 | 12:09:50 UTC - in response to Message 111827.

Just found out that 12.1 was released as an unstable package a few days ago so I've installed that on all my boxes now and 1 is happily crunching away on an opencl wu


How do you installed it. I use this. http://wiki.cchtml.com/index.php/Ubuntu_Oneiric_Installation_Guide I make debs and install then.


My boxes are gentoo, all I had to do was unmask the unstable package and compile it.
____________

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111829 - Posted: 7 Feb 2012 | 13:52:41 UTC - in response to Message 111828.

Just found out that 12.1 was released as an unstable package a few days ago so I've installed that on all my boxes now and 1 is happily crunching away on an opencl wu


How do you installed it. I use this. http://wiki.cchtml.com/index.php/Ubuntu_Oneiric_Installation_Guide I make debs and install then.


My boxes are gentoo, all I had to do was unmask the unstable package and compile it.


It is pain to do this but maybe this help http://en.gentoo-wiki.com/wiki/Fglrx

Can you try ubuntu on these boxes or it look a pain. Ubuntu is simple then others.

Profile Trog Dog
Avatar
Send message
Joined: 25 Nov 05
Posts: 204
Credit: 64,008
RAC: 0
Message 111830 - Posted: 7 Feb 2012 | 14:32:35 UTC

First successfully completed opencl wu http://albert.phys.uwm.edu/result.php?resultid=109919

@HVMargus - thankyou for your help. I don't use ubuntu, but this may help you - remove the SDK then follow the instructions in the wiki for removing older versions of catalyst, then reinstall, configure and reboot (gentoo always recommends rebooting after upgrading catalyst)
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111831 - Posted: 7 Feb 2012 | 16:38:46 UTC - in response to Message 111825.
Last modified: 7 Feb 2012 | 16:40:11 UTC

Maybe we can get some feedback from those successfully returning results from 64bit linux machines with ATI/AMD gpus.


Guys, we just released v1.21 which adds some useful debugging info to the task/result error log. Please reset the albert project or wait until the next task is fetched. The log output of those tasks will help us to understand the problem.

Thanks,
Oliver

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111832 - Posted: 7 Feb 2012 | 16:43:03 UTC - in response to Message 111830.
Last modified: 7 Feb 2012 | 16:45:27 UTC

First successfully completed opencl wu http://albert.phys.uwm.edu/result.php?resultid=109919


Great, so it turned out that the installation was flawed, most likely because of missing 32-bit compatibility libs.

Anyway, I hope we see at least some of those errors again on other systems as we'd really like to understand them and make BOINC more robust for these types of situations.

@HV Margus*: Please run at least one or two tasks before you fix your driver setup. This would really help us! Thank you!


Best,
Oliver

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111836 - Posted: 8 Feb 2012 | 7:01:09 UTC - in response to Message 111832.
Last modified: 8 Feb 2012 | 7:24:36 UTC

Oliver i installed ubuntu supplied ati catalyst it is 12.1 but its missing opencl
Uninstalling ubuntu supplied catalyst lead missing gl.conf files and unable to go to previous drives as i used before. I try to install sdk because system is missning opencl and boinc not detect opencl in system i have not time to test but if i back home i try things even reinstall os and go to previous state.

Installed SDK AND NOW IT WORK I LET A DAY WORK with old opencl files.

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111838 - Posted: 8 Feb 2012 | 9:44:35 UTC - in response to Message 111836.
Last modified: 8 Feb 2012 | 9:48:00 UTC

I successfully completed WUs with v1.21 app now under linux i look if they validated too.
http://albert.phys.uwm.edu/result.php?resultid=110194

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111839 - Posted: 8 Feb 2012 | 10:44:12 UTC - in response to Message 111836.
Last modified: 8 Feb 2012 | 10:44:34 UTC

Oliver i installed ubuntu supplied ati catalyst it is 12.1 but its missing opencl


Are you sure you installed all relevant packages? Linux distributions might often split the original/upstream driver installation package into separate packages like the kernel module, the user space driver, the OpenCL runtime and the OpenCL ICD. Debian does this and Ubuntu could do it in a similar way since it's based on Debian.

Good to see that it now works for you!

Cheers,
Oliver

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111840 - Posted: 8 Feb 2012 | 12:18:29 UTC - in response to Message 111839.

The http://albert.phys.uwm.edu/result.php?resultid=110194 really validated and valid.

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111841 - Posted: 8 Feb 2012 | 15:54:37 UTC - in response to Message 111840.
Last modified: 8 Feb 2012 | 16:00:58 UTC

Oliver i tested debs again somehow it not work but i got this i dont know if that help. Supplied catalyst is older then now.


08-Feb-2012 17:37:56 [---] OS: Linux: 3.0.0-15-generic
08-Feb-2012 17:37:56 [---] Memory: 3.87 GB physical, 3.99 GB virtual
08-Feb-2012 17:37:56 [---] Disk: 472.75 GB total, 443.57 GB free
08-Feb-2012 17:37:56 [---] Local time is UTC +2 hours
08-Feb-2012 17:37:56 [---] ATI GPU 0: AMD Radeon HD 6900 series (Cayman) (CAL version 1.4.1664, 2048MB, 1932MB available, 6758 GFLOPS peak)
08-Feb-2012 17:37:56 [---] OpenCL: ATI GPU 0: Cayman (driver version CAL 1.4.1664, device version OpenCL 1.1 AMD-APP (851.4), 1024MB, 1932MB available)
08-Feb-2012 17:37:56 [---] ATI GPU 0 is OpenCL-capable


Stderr output

<core_client_version>7.0.14</core_client_version>
<![CDATA[
<message>
process exited with code 255 (0xff, -1)
</message>
<stderr_txt>
[17:49:25][2413][INFO ] Application startup - thank you for supporting Einstein@Home!
[17:49:25][2413][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
BOINC: No OpenCL platform found (-1001)!
[17:49:25][2413][ERROR] Failed to get OpenCL platform/device info from BOINC (error: -1)!
[17:49:25][2413][ERROR] Demodulation failed (error: -1)!
17:49:25 (2413): called boinc_finish

</stderr_txt>
]]>

Alex
Send message
Joined: 1 Mar 05
Posts: 58
Credit: 313,531
RAC: 237
Message 111842 - Posted: 8 Feb 2012 | 22:39:39 UTC

I've updated my main system to be able to run albert wu's
first finished (HD6950) is wuid=37677
resultid=110257
on this pc hostid=1767

just if someone wants to compare performance

____________

Alex
Send message
Joined: 1 Mar 05
Posts: 58
Credit: 313,531
RAC: 237
Message 111843 - Posted: 8 Feb 2012 | 23:13:11 UTC

next wu finished.
HD5850
resultid=116606
wuid=35510
hostid=1767
____________

choks
Send message
Joined: 24 Feb 05
Posts: 5
Credit: 345,604
RAC: 188
Message 111844 - Posted: 9 Feb 2012 | 6:33:13 UTC

Hi

I had this error with the 1.2 version:

[debug] BOINC: Trying to find requested ATI OpenCL device #0
[07:09:27][29377][ERROR] Application caught signal 11.
......
Frame 2:
Binary file: /usr/lib/libamdocl32.so (0xf5fcad78)
Offset info: +0xead78
------> End of backtrace


It turned out on debian64, the AMD driver is installed on /usr/lib32/libamdocl32.so and not /usr/lib/libamdocl32.so.
I don't know why but I had an old version in /usr/lib.
Just copied the new lib over, and it's now working.

Christophe
____________

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111845 - Posted: 9 Feb 2012 | 7:20:05 UTC - in response to Message 111844.

Hi

I had this error with the 1.2 version:

[debug] BOINC: Trying to find requested ATI OpenCL device #0
[07:09:27][29377][ERROR] Application caught signal 11.
......
Frame 2:
Binary file: /usr/lib/libamdocl32.so (0xf5fcad78)
Offset info: +0xead78
------> End of backtrace


It turned out on debian64, the AMD driver is installed on /usr/lib32/libamdocl32.so and not /usr/lib/libamdocl32.so.
I don't know why but I had an old version in /usr/lib.
Just copied the new lib over, and it's now working.

Christophe

Thanks for hint,

Oliver i tested not debs or supplied catalyst but its 12.1 installer and found that opencl missing on system nothing helped even installing sdk after 12.1 install. Only what is work is supplied in ubuntu and installing sdk after but this combination is terrible old.

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111846 - Posted: 9 Feb 2012 | 7:55:19 UTC - in response to Message 111845.
Last modified: 9 Feb 2012 | 8:01:29 UTC

Now i got improvement but its far from working
Stderr output

<core_client_version>7.0.14</core_client_version>
<![CDATA[
<message>
process exited with code 11 (0xb, -245)
</message>
<stderr_txt>
[08:07:21][2740][INFO ] Application startup - thank you for supporting Einstein@Home!
[08:07:21][2740][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0

[08:07:22][2740][ERROR] Application caught signal 11.

------> Obtained 5 stack frames for this thread.
------> Backtrace:
Frame 5:
Binary file: ../../projects/albert.phys.uwm.edu/einsteinbinary_BRP4_1.21_i686-pc-linux-gnu__atiOpenCL (0x80b6d32)
Offset info: pthread_mutex_lock+0x672
Source file: erp_boinc_wrapper.cpp (Function: sighandler / Line: 166)
Frame 4:
Binary file: ../../projects/albert.phys.uwm.edu/einsteinbinary_BRP4_1.21_i686-pc-linux-gnu__atiOpenCL (0x842eff6)
Offset info: _ZNSsD1Ev+0x26
Frame 3:
Binary file: ../../projects/albert.phys.uwm.edu/einsteinbinary_BRP4_1.21_i686-pc-linux-gnu__atiOpenCL (0x842eff6)
Offset info: _ZNSsD1Ev+0x26
Frame 2:
Binary file: /usr/lib32/libamdocl32.so (0xf600ad78)
Offset info: +0xead78
------> End of backtrace

08:07:22 (2740): called boinc_finish
Frame 1:
Binary file: [ ((nil))
Offset info: nil
'[': No such file

</stderr_txt>
]]>

choks, can you upload the silly catalyst opencl files somehow i not have right ones.

choks
Send message
Joined: 24 Feb 05
Posts: 5
Credit: 345,604
RAC: 188
Message 111847 - Posted: 9 Feb 2012 | 8:25:17 UTC - in response to Message 111846.

Hi,

I updated Catalyst to version 11.12.
The file I have is:

-rw-r--r-- 1 root root 25369464 4 févr. 07:10 /usr/lib32/libamdocl32.so

I think it is written when you run amd-driver-installer-12-1-x86.x86_64.run.

Christophe
____________

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111848 - Posted: 9 Feb 2012 | 9:12:08 UTC - in response to Message 111847.
Last modified: 9 Feb 2012 | 9:16:58 UTC

You guys use linux for years i use maybe in total one month i not guru yet.
Is needed install catalyst in root or not, whay it is broke always something or not work.

terencewee*
Send message
Joined: 2 Feb 12
Posts: 5
Credit: 4,500
RAC: 0
Message 111849 - Posted: 10 Feb 2012 | 7:06:32 UTC


http://albert.phys.uwm.edu/results.php?userid=333612

All WUs validated.
v1.19 and v1.20 - running on 5850 (Cypress) and M7820 (Juniper).
Will continue test run again when finished with POEM++ OpenCL WUs.

Good job!

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111850 - Posted: 10 Feb 2012 | 9:20:59 UTC - in response to Message 111844.
Last modified: 10 Feb 2012 | 9:29:57 UTC

It turned out on debian64, the AMD driver is installed on /usr/lib32/libamdocl32.so and not /usr/lib/libamdocl32.so.
I don't know why but I had an old version in /usr/lib.
Just copied the new lib over, and it's now working.


I experienced the same issue on my machine. I turns out that this is an inconsistency between the APP SDK and Catalyst. Both install this 32-bit compatibility library, but in different locations, hence the mix-up that leads to the segfault. I already filed a bug report with AMD.

I recommend to not install the APP SDK, you don't need it to run OpenCL GPU applications as long as you installed Catalyst >= 11.3 (Windows, 11.11 for Linux). The SDK is for developers only.


Cheers,
Oliver

HV Margus*
Send message
Joined: 10 Jan 12
Posts: 17
Credit: 45,000
RAC: 0
Message 111853 - Posted: 14 Feb 2012 | 5:11:37 UTC - in response to Message 111850.

I test old combination drivers without opencl from PPA
Installed SDK and it work.

With generating deb is broken on catalyst 12.1 i wait next one.
Because catalyst 12.1 not install some files and something is broken.

oz
Send message
Joined: 28 Feb 05
Posts: 10
Credit: 1,060,681
RAC: 0
Message 111857 - Posted: 16 Feb 2012 | 15:03:15 UTC

After getting it running at all, I finished some atiOpenCL Tasks on two ATI cards (GPU0=Screen0,GPU1=Screen1(SingleDesktop) are both ATI 5700) simultaniously. The runtime of the task on GPU0 is using 3 times of the task on GPU1. After starting ubuntu with "unity2d" without 3D WindowManager both task are running fast. So I blamed compiz for the slow performance on GPU0.

I had no success in a non X setup. (I do not need X here)
Stopping lighdm so that all modules are loaded and DISPLAY (COMPUTE) variable set to ":0" atiOpenCL apps do not start on the commandline in a console.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111860 - Posted: 17 Feb 2012 | 16:11:17 UTC - in response to Message 111857.
Last modified: 17 Feb 2012 | 16:11:50 UTC


I had no success in a non X setup. (I do not need X here)
Stopping lighdm so that all modules are loaded and DISPLAY (COMPUTE) variable set to ":0" atiOpenCL apps do not start on the commandline in a console.


Known limitation of the AMD drivers/runtime. If your machine is isolated you may also just allow all users to connect to your (running) X display, works just fine for me:

export DISPLAY=:0 && xhost +


Best,
Oliver

Jon Fox
Send message
Joined: 9 Feb 05
Posts: 7
Credit: 111,308
RAC: 7
Message 111861 - Posted: 19 Feb 2012 | 2:11:55 UTC

Results from OS X OpenCL app:

Sat Feb 18 10:28:13 2012 | | Starting BOINC client version 7.0.15 for x86_64-apple-darwin
Sat Feb 18 10:28:13 2012 | | Libraries: libcurl/7.21.7 OpenSSL/0.9.7l zlib/1.2.5 c-ares/1.7.4
Sat Feb 18 10:28:13 2012 | | Processor: 4 GenuineIntel Intel(R) Core(TM) i5-2500S CPU @ 2.70GHz [x86 Family 6 Model 42 Stepping 7]
Sat Feb 18 10:28:13 2012 | | Processor features: FPU VME DE PSE TSC MSR PAE MCE CX8 APIC SEP MTRR PGE MCA CMOV PAT PSE36 CLFSH DS ACPI MMX FXSR SSE SSE2 SS HTT TM PBE SSE3 PCLMULQDQ DTES64 MON DSCPL VMX SMX EST TM2 SSSE3 CX16 TPR PDCM SSE4.1 SSE4.2 xAPIC POPCNT AES PCID XSAVE OSXSAVE TSCTMR AVX1.0
Sat Feb 18 10:28:13 2012 | | OS: Mac OS X 10.7.3 (Darwin 11.3.0)
Sat Feb 18 10:28:13 2012 | | Memory: 12.00 GB physical, 891.06 GB virtual
Sat Feb 18 10:28:13 2012 | | OpenCL: ATI GPU 0: ATI Radeon HD 6770M (driver version 1.0, device version OpenCL 1.1, 512MB, 512MB available)
.
.
.
Name p2030.20111110.G39.19-00.79.N.b3s0g0.00100_3288_0
Workunit 39033
Created 3 Feb 2012 | 18:48:47 UTC
Sent 18 Feb 2012 | 22:05:38 UTC
Received 18 Feb 2012 | 23:12:55 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status 229 (0xe5)
Computer ID 1301
Report deadline 3 Mar 2012 | 22:05:38 UTC
Run time 9.64
CPU time 3.12
Validate state Invalid
Credit 0.00
Application version Binary Radio Pulsar Search v1.21 (atiOpenCLLion)
Stderr output

<core_client_version>7.0.15</core_client_version>
<![CDATA[
<message>
process exited with code 229 (0xe5, -27)
</message>
<stderr_txt>
[17:06:59][5382][INFO ] Application startup - thank you for supporting Einstein@Home!
[17:06:59][5382][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: AMD
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[17:06:59][5382][INFO ] Using OpenCL platform provided by: Apple
[17:06:59][5382][INFO ] Using OpenCL device "ATI Radeon HD 6770M" by: AMD
[17:07:00][5382][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[17:07:00][5382][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00100_DM637.60
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841882757326
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 637.6 cm^-3 pc
------> Scale factor: 0.100816
[17:07:02][5382][INFO ] Seed for random number generator is -1006570859.
[17:07:06][5382][ERROR] Error in OpenCL context: OpenCL Build Warning : Compiler build log:
<program source>:365:1: warning: unused variable 'c'
fftKernel8(a + 0, dir);
^
<program source>:61:9: note: instantiated from:
float2 c; \
^
<program source>:366:1: warning: unused variable 'c'
fftKernel8(a + 8, dir);
^
<program source>:61:9: note: instantiated from:
float2 c; \
^
<program source>:193:15: warning: unused variable 'r'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:193:37: warning: unused variable 'index'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:194:9: warning: unused variable 's'
int s, ii, jj, offset;
^
<program source>:194:12: warning: unused variable 'ii'
int s, ii, jj, offset;
^
<program source>:194:16: warning: unused variable 'jj'
int s, ii, jj, offset;
^
<program source>:194:20: warning: unused variable 'offset'
int s, ii, jj, offset;
^
<program source>:196:16: warning: unused variable 'angf'
float ang, angf, ang1;
^
<program source>:699:1: warning: unused variable 'c'
fftKernel8(a + 0, dir);
^
<program source>:61:9: note: instantiated from:
float2 c; \
^
<program source>:700:1: warning: unused variable 'c'
fftKernel8(a + 8, dir);
^
<program source>:61:9: note: instantiated from:
float2 c; \
^
<program source>:527:15: warning: unused variable 'r'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:527:37: warning: unused variable 'index'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:528:9: warning: unused variable 's'
int s, ii, jj, offset;
^
<program source>:528:12: warning: unused variable 'ii'
int s, ii, jj, offset;
^
<program source>:528:16: warning: unused variable 'jj'
int s, ii, jj, offset;
^
<program source>:528:20: warning: unused variable 'offset'
int s, ii, jj, offset;
^
<program source>:530:16: warning: unused variable 'angf'
float ang, angf, ang1;
^
<program source>:963:1: warning: unused variable 'c'
fftKernel8(a + 0, dir);
^
<program source>:61:9: note: instantiated from:
float2 c; \
^
<program source>:964:1: warning: unused variable 'c'
fftKernel8(a + 8, dir);
^
<program source>:61:9: note: instantiated from:
float2 c; \
^
<program source>:791:15: warning: unused variable 'r'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:791:37: warning: unused variable 'index'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:792:9: warning: unused variable 's'
int s, ii, jj, offset;
^
<program source>:792:12: warning: unused variable 'ii'
int s, ii, jj, offset;
^
<program source>:792:16: warning: unused variable 'jj'
int s, ii, jj, offset;
^
<program source>:792:20: warning: unused variable 'offset'
int s, ii, jj, offset;
^
<program source>:794:16: warning: unused variable 'angf'
float ang, angf, ang1;
^
<program source>:1093:1: warning: unused variable 'c'
fftKernel8(a, dir);
^
<program source>:61:9: note: instantiated from:
float2 c; \
^
<program source>:1054:15: warning: unused variable 'r'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:1054:37: warning: unused variable 'index'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:1054:61: warning: unused variable 'k'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:1054:64: warning: unused variable 'l'
int i, j, r, indexIn, indexOut, index, tid, bNum, xNum, k, l;
^
<program source>:1055:9: warning: unused variable 's'
int s, ii, jj, offset;
^
<program source>:1055:12: warning: unused variable 'ii'
int s, ii, jj, offset;
^
<program source>:1055:16: warning: unused variable 'jj'
int s, ii, jj, offset;
^
<program source>:1055:20: warning: unused variable 'offset'
int s, ii, jj, offset;
^
<program source>:1056:12: warning: unused variable 'w'
float2 w;
^
<program source>:1057:11: warning: unused variable 'ang'
float ang, angf, ang1;
^
<program source>:1057:16: warning: unused variable 'angf'
float ang, angf, ang1;
^
<program source>:1057:22: warning: unused variable 'ang1'
float ang, angf, ang1;
^
<program source>:1058:20: warning: unused variable 'lMemStore'
__local float *lMemStore, *lMemLoad;
^
<program source>:1058:32: warning: unused variable 'lMemLoad'
__local float *lMemStore, *lMemLoad;
^


[17:07:06][5382][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-09
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[17:07:06][5382][ERROR] Error in OpenCL context: [CL_MEM_OBJECT_ALLOCATION_FAILURE] : OpenCL Error : clEnqueueNDRangeKernel failed: memory usage (268435456 bytes) is more than the device can support (201326592 bytes)
[17:07:06][5382][ERROR] Error during OpenCL FFT setup (error: -4)
[17:07:06][5382][ERROR] Demodulation failed (error: 2021)!
17:07:06 (5382): called boinc_finish

</stderr_txt>
]]>


--
jon
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111863 - Posted: 20 Feb 2012 | 10:05:10 UTC - in response to Message 111861.
Last modified: 22 Feb 2012 | 9:00:48 UTC

As it says, not enough memory...

Update: we'll improve the error handling and messaging for such cases. The next release will state it more explicitly and the app will ask BOINC to retry again after 5 minutes, as it does for other GPU memory issues...

Oliver

Christoph
Send message
Joined: 25 Aug 05
Posts: 48
Credit: 148,613
RAC: 15
Message 111869 - Posted: 26 Feb 2012 | 20:34:43 UTC
Last modified: 26 Feb 2012 | 21:01:39 UTC

I have one task still running which is stuck with 100%. I did stop BOINC and now after restarting it is still there and 'running'.
I did save the slot files so tell me what you need.

The last line of stderr is 'clled boinc_finish' but it still runs as I type.

This one http://albert.phys.uwm.edu/result.php?resultid=117800.

When I did quit BOINC because of this WU it showed a runtime of more than 12 hours. It did restart with 11 something. Last checkpoint on 9h23m processor time.

Christoph

[Edit] here the stderr.txt

I did remove plenty checkpoint entries to reduce the length of the post. It did checkpoint once per minute in between.

Activated exception handling...
[19:39:29][6036][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[19:39:31][6036][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[19:39:31][6036][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[19:39:32][6036][WARN ] Kernel "kernelTimeSeriesModulation" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[19:39:32][6036][WARN ] Kernel "kernelTimeSeriesResampling" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[19:39:32][6036][WARN ] Kernel "kernelTimeSeriesMeanReduction" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[19:39:32][6036][WARN ] Kernel "kernelTimeSeriesPadding" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[19:39:32][6036][WARN ] Kernel "kernelPowerSpectrum" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[19:39:32][6036][WARN ] Kernel "kernelHarmonicSumming" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[19:39:32][6036][WARN ] Kernel "kernelFillFloatBuffer" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[19:39:32][6036][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[19:39:32][6036][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM589.60
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841883733527
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 589.6 cm^-3 pc
------> Scale factor: 0.113705
[19:39:36][6036][INFO ] Seed for random number generator is -1001188666.
[19:39:51][6036][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[19:40:30][6036][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[20:49:30][6036][INFO ] Checkpoint committed!
[20:50:02][6036][INFO ] OpenCL shutdown complete!
[20:50:02][6036][INFO ] Data processing finished successfully!
[20:50:02][6036][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[20:50:02][6036][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[20:50:02][6036][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[20:50:03][6036][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[20:50:03][6036][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM590.10
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841883723362
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 590.1 cm^-3 pc
------> Scale factor: 0.113806
[20:50:07][6036][INFO ] Seed for random number generator is -1007574167.
[20:50:19][6036][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[20:50:30][6036][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[21:57:31][6036][INFO ] Checkpoint committed!
Activated exception handling...
[02:49:02][6532][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_0' already exists - skipping pass
[02:49:02][6532][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[02:49:21][6532][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[02:49:21][6532][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[02:49:22][6532][WARN ] Kernel "kernelTimeSeriesModulation" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[02:49:22][6532][WARN ] Kernel "kernelTimeSeriesResampling" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[02:49:22][6532][WARN ] Kernel "kernelTimeSeriesMeanReduction" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[02:49:22][6532][WARN ] Kernel "kernelTimeSeriesPadding" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[02:49:22][6532][WARN ] Kernel "kernelPowerSpectrum" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[02:49:22][6532][WARN ] Kernel "kernelHarmonicSumming" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[02:49:22][6532][WARN ] Kernel "kernelFillFloatBuffer" exceeds device-specific maximum work group size (requested: 256)!
------> Reducing kernel's work group size to allowed maximum of: 128 work items
[02:49:22][6532][INFO ] Continuing work on ../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3193.binary at template no. 5178
[02:49:22][6532][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM590.10
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841883723362
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 590.1 cm^-3 pc
------> Scale factor: 0.113806
[02:49:26][6532][INFO ] Seed for random number generator is -1007574167.
[02:49:40][6532][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[02:50:03][6532][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[03:04:18][6532][INFO ] Checkpoint committed!
[03:05:17][6532][INFO ] OpenCL shutdown complete!
[03:05:18][6532][INFO ] Data processing finished successfully!
[03:05:18][6532][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[03:05:18][6532][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[03:05:18][6532][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[03:05:19][6532][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[03:05:19][6532][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM590.60
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.84188371319
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 590.6 cm^-3 pc
------> Scale factor: 0.113909
[03:05:23][6532][INFO ] Seed for random number generator is 1108113069.
[03:05:35][6532][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[03:05:37][6532][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[04:41:06][6532][INFO ] Checkpoint committed!
[04:41:13][6532][INFO ] OpenCL shutdown complete!
[04:41:13][6532][INFO ] Data processing finished successfully!
[04:41:13][6532][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[04:41:14][6532][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[04:41:14][6532][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[04:41:14][6532][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[04:41:14][6532][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM591.10
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841883703026
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 591.1 cm^-3 pc
------> Scale factor: 0.113705
[04:41:18][6532][INFO ] Seed for random number generator is -1014476427.
[04:41:31][6532][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[04:42:07][6532][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[06:05:21][6532][INFO ] Checkpoint committed!
[06:05:54][6532][INFO ] OpenCL shutdown complete!
[06:05:54][6532][INFO ] Data processing finished successfully!
[06:05:54][6532][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[06:05:54][6532][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[06:05:54][6532][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[06:05:55][6532][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[06:05:55][6532][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM591.60
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841883692854
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 591.6 cm^-3 pc
------> Scale factor: 0.113806
[06:05:59][6532][INFO ] Seed for random number generator is -1005088072.
[06:06:11][6532][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[06:06:21][6532][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[07:28:22][6532][INFO ] Checkpoint committed!
[07:28:53][6532][INFO ] OpenCL shutdown complete!
[07:28:53][6532][INFO ] Data processing finished successfully!
[07:28:53][6532][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[07:28:53][6532][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[07:28:53][6532][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[07:28:54][6532][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[07:28:54][6532][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM592.10
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841883682682
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 592.1 cm^-3 pc
------> Scale factor: 0.113909
[07:28:58][6532][INFO ] Seed for random number generator is -1002507948.
[07:29:10][6532][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[07:29:22][6532][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[08:51:23][6532][INFO ] Checkpoint committed!
[08:52:05][6532][INFO ] OpenCL shutdown complete!
[08:52:05][6532][INFO ] Data processing finished successfully!
[08:52:05][6532][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[08:52:05][6532][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[08:52:05][6532][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[08:52:05][6532][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[08:52:05][6532][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM592.60
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841883672518
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 592.6 cm^-3 pc
------> Scale factor: 0.113603
[08:52:09][6532][INFO ] Seed for random number generator is -1037038261.
[08:52:22][6532][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[08:52:23][6532][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[10:16:24][6532][INFO ] Checkpoint committed!
[10:17:13][6532][INFO ] OpenCL shutdown complete!
[10:17:13][6532][INFO ] Data processing finished successfully!
[10:17:13][6532][INFO ] Starting data processing...
[debug] BOINC: Trying to find requested ATI OpenCL device #0
[debug] BOINC: Found 1 (1) devices on platform #0
[debug] BOINC: vendor of device #0 on platform #0: Advanced Micro Devices, Inc.
[debug] BOINC: Found requested ATI OpenCL device #0 on platform #0! Returning pointers...
[10:17:13][6532][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[10:17:13][6532][INFO ] Using OpenCL device "Cedar" by: Advanced Micro Devices, Inc.
[10:17:14][6532][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[10:17:14][6532][INFO ] Header contents:
------> Original WAPP file: ./p2030.20111110.G39.19-00.79.N.b3s0g0.00000_DM593.10
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55875.841883662346
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 190527.4746
------> DEC (J2000): 51146.9529991
------> Galactic l: 0
------> Galactic b: 0
------> Name: G39.19-00.79.N
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 593.1 cm^-3 pc
------> Scale factor: 0.113705
[10:17:19][6532][INFO ] Seed for random number generator is -998883183.
[10:17:32][6532][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[10:17:33][6532][INFO ] Checkpoint committed!
-snip- Checkpoint info deleted
[11:42:44][6532][INFO ] Checkpoint committed!
[11:43:43][6532][INFO ] OpenCL shutdown complete!
[11:43:43][6532][INFO ] Data processing finished successfully!
11:43:43 (6532): called boinc_finish
Activated exception handling...
[14:08:39][6412][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_0' already exists - skipping pass
[14:08:39][6412][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_1' already exists - skipping pass
[14:08:39][6412][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_2' already exists - skipping pass
[14:08:39][6412][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_3' already exists - skipping pass
[14:08:39][6412][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_4' already exists - skipping pass
[14:08:39][6412][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_5' already exists - skipping pass
[14:08:39][6412][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_6' already exists - skipping pass
[14:08:39][6412][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_7' already exists - skipping pass
14:08:39 (6412): called boinc_finish
Activated exception handling...
[18:38:56][5628][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_0' already exists - skipping pass
[18:38:56][5628][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_1' already exists - skipping pass
[18:38:56][5628][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_2' already exists - skipping pass
[18:38:56][5628][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_3' already exists - skipping pass
[18:38:56][5628][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_4' already exists - skipping pass
[18:38:56][5628][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_5' already exists - skipping pass
[18:38:56][5628][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_6' already exists - skipping pass
[18:38:56][5628][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_7' already exists - skipping pass
18:38:56 (5628): called boinc_finish
Activated exception handling...
[21:05:51][5848][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_0' already exists - skipping pass
[21:05:51][5848][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_1' already exists - skipping pass
[21:05:51][5848][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_2' already exists - skipping pass
[21:05:52][5848][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_3' already exists - skipping pass
[21:05:52][5848][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_4' already exists - skipping pass
[21:05:52][5848][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_5' already exists - skipping pass
[21:05:52][5848][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_6' already exists - skipping pass
[21:05:52][5848][INFO ] Output file: '../../projects/albert.phys.uwm.edu/p2030.20111110.G39.19-00.79.N.b3s0g0.00000_3192_1_7' already exists - skipping pass
21:05:52 (5848): called boinc_finish


And still running.....
[/Edit]
____________
Christoph

Christoph
Send message
Joined: 25 Aug 05
Posts: 48
Credit: 148,613
RAC: 15
Message 111870 - Posted: 26 Feb 2012 | 21:11:10 UTC

Here is one of mine which is inconclusiv.

http://albert.phys.uwm.edu/workunit.php?wuid=39325

Mine did take the 40000+ seconds. I did check the following numbers:

Mine:
[05:58:36][4380][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672

Other host:
[18:24:08][20563][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-09
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672

So, probably the wrong numbers to look for, because they match.
____________
Christoph

Christoph
Send message
Joined: 25 Aug 05
Posts: 48
Credit: 148,613
RAC: 15
Message 111871 - Posted: 26 Feb 2012 | 22:00:48 UTC - in response to Message 111869.

I have one task still running which is stuck with 100%. I did stop BOINC and now after restarting it is still there and 'running'.
I did save the slot files so tell me what you need.

-snip-

And still running.....
[/Edit]


So in the meantime I wanted to play a game. It did crash during starting so I did a system reboot.
That kicked the hanging task. I did already update the project so you can see the result when you follow the previous posted link.

Christoph
____________
Christoph

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111872 - Posted: 27 Feb 2012 | 8:49:51 UTC - in response to Message 111871.

Looks like a BOINC issue. Thanks for reporting.

Oliver

Christoph
Send message
Joined: 25 Aug 05
Posts: 48
Credit: 148,613
RAC: 15
Message 111873 - Posted: 27 Feb 2012 | 10:46:56 UTC - in response to Message 111872.

You are welcome. So I will upgrade to 7.0.18. and observe. Still running 7.0.15.
____________
Christoph

Christoph
Send message
Joined: 25 Aug 05
Posts: 48
Credit: 148,613
RAC: 15
Message 111874 - Posted: 27 Feb 2012 | 13:08:54 UTC

Since there are not many posts about invalid, I will post the links to WUs which validated for me but not to somebody else.

http://albert.phys.uwm.edu/workunit.php?wuid=38770
http://albert.phys.uwm.edu/workunit.php?wuid=35682
http://albert.phys.uwm.edu/workunit.php?wuid=39490

If you get this info anyway I guess you will tell me. No need to do not necessary posts.

Christoph
____________
Christoph

Profile mickydl*
Send message
Joined: 8 Dec 11
Posts: 6
Credit: 6,000
RAC: 0
Message 111875 - Posted: 27 Feb 2012 | 13:17:46 UTC

I have run one WU (this one).
It's still waiting for validation but I didn't have any problems (except that the app uses a full CPU core). The host is this one.

mickydl*

Profile Viking69
Avatar
Send message
Joined: 2 Mar 05
Posts: 5
Credit: 25,597
RAC: 5
Message 111898 - Posted: 3 Mar 2012 | 6:21:51 UTC

12 hours, said 100% but had not completed. I stopped another task to let this one finish and about 30 seconds later "computation error"!

GRRR.http://albert.phys.uwm.edu/result.php?resultid=123404

WHY? I can't make out this stuff.
____________

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111899 - Posted: 3 Mar 2012 | 10:17:22 UTC

About run times.
One of my tasks finished on my ATI HD6850 2GB versus an Nvidia GTX570:
http://albert.phys.uwm.edu/workunit.php?wuid=40741

I'd say, continue sending work like this to CUDA only. What's the use, really, when OpenCL is so poorly slow? Or it's OpenCL on my GPU that's so poorly slow, seeing how the next task was a clincher to see who validated on an all OpenCL show. All I know is that that task restarted multiple times, and these OpenCLs don't like to be restarted.

Anyway, I've for now run my cache dry, in anticipation of my new motherboard and CPU. If all is well, I'll be changing later today, from my present Asrock H55DE3 with an i3-530, going to an Asrock Extreme3 Gen3 with an i5-2500K (Although I wished it could've been an i7-2600K).

Wonder what that does for Windows 7... ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Alex
Send message
Joined: 1 Mar 05
Posts: 58
Credit: 313,531
RAC: 237
Message 111954 - Posted: 11 Apr 2012 | 10:08:39 UTC
Last modified: 11 Apr 2012 | 10:12:19 UTC

I've tried to run the ati openCL app on my main system.
They all failed after few seconds, sometimes with a blue screen.

<core_client_version>7.0.25</core_client_version>
<![CDATA[
<message>
couldn't start Input file p2030.20111210.G37.27-00.02.S.b3s0g0.00000_170.bin4 missing or invalid: -119: -119
</message>
]]>

Name p2030.20111210.G37.27-00.02.S.b3s0g0.00000_168_0
Workunit 59467
Created 8 Apr 2012 | 13:41:00 UTC
Sent 10 Apr 2012 | 22:25:25 UTC
Received 11 Apr 2012 | 6:45:01 UTC
Server state Over
Outcome Computation error
Client state Compute error
Exit status -185 (0xffffffffffffff47)
Computer ID 1767
Report deadline 24 Apr 2012 | 22:25:25 UTC
Run time 0.00
CPU time 0.00
Validate state Invalid
Credit 0.00
Application version Binary Radio Pulsar Search v1.22 (atiOpenCL)

They also fail when running only one app per GPU.

Any ideas?

Alexander

Edit
Updated CCC to 12.3 and BM to 7.0.25 before.
____________

Alex
Send message
Joined: 1 Mar 05
Posts: 58
Credit: 313,531
RAC: 237
Message 111955 - Posted: 11 Apr 2012 | 19:29:22 UTC

I just made a complete reset of the project on my mainsys; looks like that solved the problem.
The first task is running now for > 9 min which did never happen before.
Can it be that some of the files in the albert-directory were outdated and not replaced by downloading wu's? I did run albert wu's some months before.
____________

Alex
Send message
Joined: 1 Mar 05
Posts: 58
Credit: 313,531
RAC: 237
Message 111956 - Posted: 11 Apr 2012 | 20:50:40 UTC

2 Bluescreens later:

That (censored) (censored)(censored) ATI-Driver!
I googled for Bluescreen atikmpag.sys windows 7
and found really a lot of postings pointing to that problem. Sorry, no solution found.

Looks like I need to detach from that project on my mainsys for now, waiting for ATI updates.
It's intresting, MW, Collatz, Seti and donation(gpugrid donation project) all work fine.

OK, always look at the bright side of life
____________

Post to thread

Message boards : Problems and Bug Reports : [OpenCL] app v1.20/v1.21 feedback thread


Home · Your account · Message boards

This material is based upon work supported by the National Science Foundation (NSF) under Grant PHY-0555655 and by the Max Planck Gesellschaft (MPG). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the investigators and do not necessarily reflect the views of the NSF or the MPG.

Copyright © 2013 Bruce Allen for the LIGO Scientific Collaboration