Sending work

log in

Advanced search

Message boards : News : Sending work

Author Message
Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111188 - Posted: 31 Oct 2011 | 8:42:08 UTC

Albert@home is still an unofficial, non-public test project. Don't expect anything to work here. The only type of work Albert@home is currently sending out is for a highly experimental BRP4 OpenCL application.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111189 - Posted: 1 Nov 2011 | 0:35:56 UTC - in response to Message 111188.
Last modified: 1 Nov 2011 | 0:36:34 UTC

Great, will expect doom and hell-fire. :D

Ok, so here as well, something breaks the signature. Enigma has the same problem.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Conan
Avatar
Send message
Joined: 19 Jun 05
Posts: 129
Credit: 74,167
RAC: 1,032
Message 111190 - Posted: 1 Nov 2011 | 8:19:56 UTC - in response to Message 111188.

Albert@home is still an unofficial, non-public test project. Don't expect anything to work here. The only type of work Albert@home is currently sending out is for a highly experimental BRP4 OpenCL application.


There is no option in the preferences to select ATI graphic cards so the OpenCL will only be done with Nvidia at the moment.

Conan
____________

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111192 - Posted: 1 Nov 2011 | 10:47:34 UTC - in response to Message 111190.
Last modified: 1 Nov 2011 | 10:51:24 UTC

Thanks for reporting. Although I think due to the way this setting is handled internally you shouldn't be able to opt-out of ATI 'work' right now.

Anyway, we'll add this.

BM

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111193 - Posted: 1 Nov 2011 | 10:56:28 UTC - in response to Message 111189.
Last modified: 1 Nov 2011 | 11:01:28 UTC

Ok, so here as well, something breaks the signature. Enigma has the same problem.


Honestly I don't care much about sigs at all (I usually view posts w/o sig), and in particular on Albert. This is about the last thing I intend to test here.

The problem may occur because Albert doesn't export any stats, or something else.

Can you edit your preferences and set the signature again from scratch?

BM

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111194 - Posted: 1 Nov 2011 | 13:14:26 UTC - in response to Message 111193.
Last modified: 1 Nov 2011 | 13:16:57 UTC

Nah, that doesn't fix it but as you said, it's not a priority. The infinite work duration tasks that are sent... now that's another thing. ;-)

Although I think due to the way this setting is handled internally you shouldn't be able to opt-out of ATI 'work' right now.

Actually, that is possible with the <ignore_ati_dev/> option in the clients that can do that, and possibly with the <exclude_gpu/> option in the latest test clients.

Then it'll just do:
2011-11-01 05:13:40.7956 [PID=8972 ] [version] Checking plan class 'ATIOpenCL'
2011-11-01 05:13:40.7956 [PID=8972 ] [version] No ATI devices found
2011-11-01 05:13:40.7956 [PID=8972 ] [version] [AV#443] app_plan() returned false
2011-11-01 05:13:40.7956 [PID=8972 ] [version] Checking plan class 'NVOpenCL'
2011-11-01 05:13:40.7956 [PID=8972 ] [version] No NVidia devices found
2011-11-01 05:13:40.7956 [PID=8972 ] [version] [AV#440] app_plan() returned false

But really, the BOINC back-end code should self-detect whether an ATI application is available in http://albert.phys.uwm.edu/apps.php and then automatically enable the "Use ATI GPU" preference. Apparently it doesn't work yet with your used version of the back-end and OpenCL applications.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111195 - Posted: 1 Nov 2011 | 14:48:16 UTC - in response to Message 111192.

Thanks for reporting. Although I think due to the way this setting is handled internally you shouldn't be able to opt-out of ATI 'work' right now.

Anyway, we'll add this.

BM


Done.

BM

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111196 - Posted: 1 Nov 2011 | 14:52:00 UTC - in response to Message 111194.

But really, the BOINC back-end code should self-detect whether an ATI application is available in http://albert.phys.uwm.edu/apps.php and then automatically enable the "Use ATI GPU" preference. Apparently it doesn't work yet with your used version of the back-end and OpenCL applications.


Yah, we didn't look into that clever piece of code when we set up the plan classes. It did only work if your plan class name contained the string "ati" (lowercase).

BM

STE\/E
Send message
Joined: 18 Jan 05
Posts: 142
Credit: 3,507,007
RAC: 181
Message 111201 - Posted: 2 Nov 2011 | 14:21:45 UTC
Last modified: 2 Nov 2011 | 14:22:08 UTC

Nothing but Computation errors so far on the OpenCL Wu's after just a few seconds. Tried 11.6 & 11.9 Drivers 5850 & 5870 Cards ...
____________
STE\/E

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111202 - Posted: 2 Nov 2011 | 14:28:47 UTC - in response to Message 111201.
Last modified: 2 Nov 2011 | 14:29:39 UTC

The Linux App appears to work, at least on one of our machines.

I guess this is on Windows, where we are observing the same errors. FWIW the App works when running standalone, it crashes only when running under the BOINC Client.

BM

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111204 - Posted: 2 Nov 2011 | 20:45:27 UTC
Last modified: 2 Nov 2011 | 20:45:44 UTC

[20:05:56][5684][INFO ] Starting data processing...
[20:05:56][5684][ERROR] Failed to get OpenCL platform/device info from BOINC (error: -31)!
[20:05:56][5684][ERROR] Demodulation failed (error: -31)!

http://albert.phys.uwm.edu/result.php?resultid=35469

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111208 - Posted: 3 Nov 2011 | 21:59:45 UTC - in response to Message 111202.

FWIW the App works when running standalone, it crashes only when running under the BOINC Client.

BM

How do you run the app stand-alone? It's not an executable.
It's called einsteinbinary_BRP4_1.09_windows_intelx86__ATIOpenCL ... no .exe Could that be the problem?

D:\ProgramData\BOINC\projects\albert.phys.uwm.edu\einsteinbinary_BRP4_1.09_windows_intelx86__ATIOpenCL caused an Access Violation at location 007b481c in module D:\ProgramData\BOINC\projects\albert.phys.uwm.edu\einsteinbinary_BRP4_1.09_windows_intelx86__ATIOpenCL Writing to location 00000003.

____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111209 - Posted: 3 Nov 2011 | 22:39:14 UTC
Last modified: 3 Nov 2011 | 23:25:52 UTC

OK, I got the new AtiOpenCL app 1.11, got a bucket load of work for it as well, but it doesn't start. I even suspended all my other work and still the OpenCL tasks didn't start.

So I have reset my preferences, set them back to use both CPU & GPU. Doesn't matter. I took out the <exclude_gpu/> option from cc_config.xml as it clearly didn't work yet, but that doesn't matter either.

Now I am confused. :)

For the doubters:
03/11/2011 23:32:02 | | ATI GPU 0: ATI Radeon HD 4700/4800 (RV740/RV770) (CAL version 1.4.1417, 1024MB, 992MB available, 2000 GFLOPS peak)
03/11/2011 23:32:02 | | OpenCL: ATI GPU 0: ATI RV770 (driver version CAL 1.4.1417, device version OpenCL 1.0 AMD-APP-SDK-v2.5 (775.2), 1024MB)
03/11/2011 23:32:02 | | No NVIDIA library found
03/11/2011 23:32:02 | | ATI GPU is OpenCL-capable
Catalysts 11.6 installed.

Edit: tried running the app stand-alone.
Activated exception handling...
00:24:28 (4688): Can't open init data file - running in standalone mode
00:24:28 (4688): Can't open init data file - running in standalone mode
[00:24:28][4688][WARN ] User/host details unavailable...
00:24:28 (4688): called boinc_finish
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Jon Fox
Send message
Joined: 9 Feb 05
Posts: 7
Credit: 111,168
RAC: 0
Message 111210 - Posted: 4 Nov 2011 | 3:43:02 UTC

successfully received work for 2011 iMac/10.7.2 (Darwin 11.2)/ATI Radeon HD 6770M/

Start-up messages:

Wed Nov 2 23:05:55 2011 | | OpenCL: ATI GPU 0: ATI Radeon HD 6770M (driver version 1.0, device version OpenCL 1.1, 512MB)
Wed Nov 2 23:05:55 2011 | | ATI GPU is OpenCL-capable


Work Unit Start messages:

Thu Nov 3 20:27:03 2011 | Albert@Home | Starting task p2030.20100913.G48.73+01.03.S.b5s0g0.00000_1968_3 using einsteinbinary_BRP4 version 110 (ATIOpenCL)


WU started with an estimated remaining time of ~ 1:20:00. It has been processing for 2:10:00 and now estimates 3:46:00 remaining. Status reports: (0.20 CPUs and 1.00 ATI GPUs).

Jon

I'll post updates as processing continues/aborts/etc.

____________

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111211 - Posted: 4 Nov 2011 | 9:50:39 UTC - in response to Message 111209.

Hmm, OK, so all that work mysteriously ran and erred within seconds. Th error is still the same as with 1.09

D:\ProgramData\BOINC\projects\albert.phys.uwm.edu\einsteinbinary_BRP4_1.11_windows_intelx86__atiOpenCL.exe caused an Access Violation at location 007b481c in module D:\ProgramData\BOINC\projects\albert.phys.uwm.edu\einsteinbinary_BRP4_1.11_windows_intelx86__atiOpenCL.exe Writing to location 00000003.


My system has gone through several reboots yesterday, prior to it attacking the 1.11 work.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Jon Fox
Send message
Joined: 9 Feb 05
Posts: 7
Credit: 111,168
RAC: 0
Message 111212 - Posted: 4 Nov 2011 | 10:49:07 UTC - in response to Message 111210.

Appear to have completed the first WU successfully:

Fri Nov 4 04:59:49 2011 | Albert@Home | Computation for task p2030.20100913.G48.73+01.03.S.b5s0g0.00000_1968_3 finished

~8:20:00 of elapsed time. A second work request was made for ATI work and received 47 WUs (more than I really wanted but I'll let it run out as long as there are no major issues).

Jon

____________

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111215 - Posted: 7 Nov 2011 | 10:29:17 UTC - in response to Message 111212.

Great!

Status update:

Currently the Mac OS ATI App appears to be the only OpenCL App that's working properly:
* Windows OpenCL Apps still crash very early when running under BOINC
* The Linux OpenCL App crashed after 6/8 (on our machine), probably due to a memory access problem
* NVidia OpenCL App run but give way wrong results, possibly originating from the same memory access problem

We've disabled the Apps we know to give unusable results, will push out new Apps when ready for testing.

CPU results are still helpful for comparison.

The "zero flops" / "negative FLOPs left -1.#IND00" issue that occurred for CPU plan-class Apps with recent Clients should be fixed now.

BM

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111218 - Posted: 9 Nov 2011 | 19:30:54 UTC

[21:26:46][3292][INFO ] Starting data processing...
GPU type not found in init_data.xml
[21:26:46][3292][ERROR] Failed to get OpenCL platform/device info from BOINC (error: -161)!
[21:26:46][3292][ERROR] Demodulation failed (error: -161)!
[21:26:46][3292][DEBUG] Shutting down BOINC... Bye!
21:26:46 (3292): called boinc_finish

http://albert.phys.uwm.edu/result.php?resultid=37885

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111219 - Posted: 9 Nov 2011 | 20:02:28 UTC - in response to Message 111218.
Last modified: 9 Nov 2011 | 20:03:35 UTC

I wouldn't expect OpenCL to work with clients older than 6.13.

Reminds me: we should probably add a client version restriction for these apps.

BM

Senilix
Send message
Joined: 27 Oct 11
Posts: 1
Credit: 10,497
RAC: 0
Message 111220 - Posted: 9 Nov 2011 | 20:36:23 UTC - in response to Message 111219.

I got the infamous "GPU type not found in init_data.xml" errror with the recommended BOINC client version 6.12.34. Then I decided to update my BOINC client to version 6.13.10 (pre-release).

Now the "GPU type not found in init_data.xml" error is gone, but the computation ends with an access violation, see here.

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111221 - Posted: 9 Nov 2011 | 21:22:03 UTC - in response to Message 111219.

I wouldn't expect OpenCL to work with clients older than 6.13.

Reminds me: we should probably add a client version restriction for these apps.

BM


FYI, look at this: http://setiathome.berkeley.edu/result.php?resultid=2146353363

OpenCL platform detected: NVIDIA Corporation
OpenCL platform detected: Advanced Micro Devices, Inc.

and all other helpful info about OpenCL detection with BOINC 6.10.60

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111223 - Posted: 9 Nov 2011 | 23:34:11 UTC - in response to Message 111221.

So? Just because Lunatics built a 3rd party application for Seti only that does the OpenCL detection, does not mean that the BOINC you're running does the OpenCL detection. This project uses the OpenCL detection built into the 6.13 clients to determine all OpenCL capable co-processors.

Added to that, as far as I know, the application here does not require an SSE3 capable CPU. The Lunatics app does.


10/11/2011 00:32:01 | | ATI GPU 0: ATI Radeon HD 4700/4800 (RV740/RV770) (CAL version 1.4.1417, 1024MB, 992MB available, 2000 GFLOPS peak)
10/11/2011 00:32:01 | | OpenCL: ATI GPU 0: ATI RV770 (driver version CAL 1.4.1417, device version OpenCL 1.0 AMD-APP-SDK-v2.5 (775.2), 1024MB)
10/11/2011 00:32:01 | | ATI GPU is OpenCL-capable

____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111224 - Posted: 10 Nov 2011 | 7:26:33 UTC - in response to Message 111223.

So? Just because Lunatics built a 3rd party application for Seti only that does the OpenCL detection, does not mean that the BOINC you're running does the OpenCL detection. This project uses the OpenCL detection built into the 6.13 clients to determine all OpenCL capable co-processors.

Ok. Fine. Clear.
I'll try 6.13 some later, despite the fact that I don't like > 6.10 interface without Messages tab.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111225 - Posted: 10 Nov 2011 | 10:26:45 UTC - in response to Message 111224.
Last modified: 10 Nov 2011 | 10:29:17 UTC

Then keep the boincmgr.exe and boinc.dll from your present installation. You can run a newer client with an older BOINC Manager.

But the messages aren't gone, they are just in a window of their own, reachable by clicking advanced->Event Log, or pressing CTRL+SHIFT+E.

As for 6.13, do know that even the latest client, 6.13.10 is highly unstable. I have managed to get it to crash 3 times already by just running this project. The 6.13s aren't available for long time use, they're test versions on the way to BOINC 7.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111226 - Posted: 10 Nov 2011 | 15:22:35 UTC
Last modified: 10 Nov 2011 | 15:53:45 UTC

FYI, we just updated the AMD OpenCL binaries for Windows, Mac OS 10.7 and Linux (all OpenCL 1.0, minimum BOINC version: 6.13.x).

Current status:
* Windows: finally works, no known issues
* Mac OS X: no known issues (10.7 support only!)
* Linux: might use full CPU core, please confirm

Thanks for testing!

Oliver

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111227 - Posted: 10 Nov 2011 | 15:45:02 UTC - in response to Message 111225.

But the messages aren't gone, they are just in a window of their own, reachable by clicking advanced->Event Log, or pressing CTRL+SHIFT+E.

I know, but it's inconvenient to use.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111228 - Posted: 10 Nov 2011 | 16:13:20 UTC - in response to Message 111226.
Last modified: 10 Nov 2011 | 16:18:56 UTC

Current status:
* Windows: finally works, no known issues

Yes, look at that. v1.17 is finally running longer than 4 seconds. :-)
Still... not out of the woods yet. At 90 seconds now I have a computation error.

http://albert.phys.uwm.edu/result.php?resultid=38165
http://albert.phys.uwm.edu/result.php?resultid=38123

And now they all end with error at 43 seconds.

<core_client_version>6.13.10</core_client_version>
<![CDATA[
<message>
There was an error while deleting the color transform. (0x7e3) - exit code 2019 (0x7e3)
</message>
<stderr_txt>
Activated exception handling...
[17:10:04][3420][INFO ] Starting data processing...
[17:10:05][3420][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[17:10:05][3420][INFO ] Using OpenCL device "ATI RV770" by: Advanced Micro Devices, Inc.
[17:10:07][3420][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[17:10:07][3420][INFO ] Header contents:
------> Original WAPP file: ./p2030.20100913.G48.73+01.03.S.b6s0g0.00000_DM24.80
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55453.031586060431
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 191718.8153
------> DEC (J2000): 143053.3507
------> Galactic l: 0
------> Galactic b: 0
------> Name: G48.73+01.03.S
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 24.8 cm^-3 pc
------> Scale factor: 0.114462
[17:10:17][3420][INFO ] Seed for random number generator is -1018057414.
[17:10:48][3420][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-009
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[17:10:49][3420][ERROR] Error during OpenCL kernel setup: TSMR-1 (error: -54)
[17:10:49][3420][ERROR] Demodulation failed (error: 2019)!
17:10:49 (3420): called boinc_finish

</stderr_txt>
]]>

So stop deleting the color transform. ;-)
Task list: http://albert.phys.uwm.edu/results.php?userid=7430

While running that work, GPU load is zero according to GPU-Z:
Date , GPU Core Clock [MHz] , GPU Memory Clock [MHz] , GPU Temperature [°C] , Fan Speed [%] , GPU Load [%] , Fan Speed [RPM] , GPU Temp.(DISPIO) [°C] , GPU Temp.(MEMIO) [°C] , GPU Temp.(SHADERCORE) [°C] ,

2010-07-26 08:23:45 , 500.0 , 700.0 , 39.0 , 27 , 1 , - , 39.0 , 38.5 , 34.5 ,

2010-07-26 08:23:46 , 500.0 , 700.0 , 39.0 , 27 , 1 , - , 39.0 , 38.0 , 34.5 ,

2010-07-26 08:23:47 , 500.0 , 700.0 , 39.0 , 27 , 1 , - , 39.5 , 38.5 , 34.5 ,

2010-07-26 08:26:54 , 500.0 , 700.0 , 38.0 , 27 , 0 , - , 37.5 , 36.5 , 33.5 ,

____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111229 - Posted: 10 Nov 2011 | 16:26:13 UTC - in response to Message 111228.

Thanks Jord! That's really helpful.

I need to change some settings as your GPU is not able to handle our current parallel thread block structure. I need to check the specs of AMD's GPUs again...

I'll deploy an updated binary as soon as possible.

Cheers,
Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111230 - Posted: 10 Nov 2011 | 16:29:24 UTC

I rebooted, just to make sure it wasn't something stuck in my GPU.
After reboot, I first started GPU-Z, then BOINC. No load on the GPU. Tasks don't run longer than a minute so far.

In comparison, running Furmark will do this:

2011-11-10 17:27:39 , 625.0 , 950.0 , 54.0 , 30 , - , 100 , 54.0 , 54.0 , 51.5 , 55 , 28 , 1.120 ,

2011-11-10 17:27:40 , 625.0 , 950.0 , 55.0 , 30 , - , 100 , 54.5 , 54.0 , 51.0 , 55 , 28 , 1.120 ,

2011-11-10 17:27:41 , 625.0 , 950.0 , 55.0 , 30 , - , 100 , 54.5 , 54.5 , 52.0 , 55 , 28 , 1.120 ,

2011-11-10 17:27:42 , 625.0 , 950.0 , 55.0 , 30 , - , 100 , 55.5 , 54.5 , 51.5 , 55 , 28 , 1.120 ,

2011-11-10 17:27:43 , 625.0 , 950.0 , 55.0 , 30 , - , 100 , 55.0 , 54.0 , 51.5 , 55 , 28 , 1.120 ,

See the load?
(And yes, I also see that the fan is either not recognized or not working...)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111231 - Posted: 10 Nov 2011 | 17:10:38 UTC

The same result as Ageless already has showed:
http://albert.phys.uwm.edu/result.php?resultid=38235

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111232 - Posted: 10 Nov 2011 | 17:38:23 UTC - in response to Message 111231.

The same result as Ageless already has showed:
http://albert.phys.uwm.edu/result.php?resultid=38235


Yep, you got the same device series (HD 4xxx) as Jord. The issue is understood and I'm working on a fix, but it's not that easy...

You guys should opt-out of GPU tasks until the next version.


Hang in there,
Oliver

STE\/E
Send message
Joined: 18 Jan 05
Posts: 142
Credit: 3,507,007
RAC: 181
Message 111233 - Posted: 10 Nov 2011 | 19:20:41 UTC
Last modified: 10 Nov 2011 | 19:43:23 UTC

After Upgrading to v6.13.10 on my ATI Box I now have 2 Wu's running, 1 on a 5870 & 1 on a 5850 ... Very inefficient though, using only 29% & 25% Of the GPU ... Need a App File to possibly run 2 or maybe 3 @ a time if possible ...
____________
STE\/E

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111234 - Posted: 10 Nov 2011 | 22:07:56 UTC - in response to Message 111226.

FYI, we just updated the AMD OpenCL binaries for Windows, Mac OS 10.7 and Linux (all OpenCL 1.0, minimum BOINC version: 6.13.x).

Current status:
* Mac OS X: no known issues (10.7 support only!)

Oliver, Mac OS X 10.7 does OpenCL 1.1 already, for OpenCL 1.0 OS X 10.6 is also fine.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile tazzduke
Send message
Joined: 10 Nov 11
Posts: 1
Credit: 307,410
RAC: 661
Message 111235 - Posted: 11 Nov 2011 | 1:04:39 UTC - in response to Message 111234.

Greetings

Running a Win 7 x64 box with an HD 5850 on Catalyst 11.9, Boinc 6.13.10 x64, rxing ATI OpenCL work (done so via my preferences), completed my first work unit

http://albert.phys.uwm.edu/result.php?resultid=38178 (Computer ID 1323) Pending

So far no errors, have still 47 tasks that I am working my way through,

Am glad to jump onboard and help out

Keep the work coming and Happy Crunching

Regards
Tazzduke

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111236 - Posted: 11 Nov 2011 | 8:21:28 UTC - in response to Message 111234.
Last modified: 11 Nov 2011 | 8:36:37 UTC


Oliver, Mac OS X 10.7 does OpenCL 1.1 already, for OpenCL 1.0 OS X 10.6 is also fine.


I know.

a) We have an OpenCL 1.1 version, but it would constrain the list of compatible GPUs.
b) 10.6 is buggy and Apple doesn't care about it, hence we can't support it.

Oliver

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111238 - Posted: 14 Nov 2011 | 10:04:11 UTC - in response to Message 111232.
Last modified: 14 Nov 2011 | 14:11:57 UTC

Hi guys,


Yep, you got the same device series (HD 4xxx) as Jord. The issue is understood and I'm working on a fix, but it's not that easy...

You guys should opt-out of GPU tasks until the next version.


We deployed updated binaries (v1.19) that should adjust dynamically to the specific hardware they run on. This should fix the issues you've encountered and ensure support for lower-end GPUs like the HD 4xxx series.

*Finger's crossed*

Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111244 - Posted: 14 Nov 2011 | 15:23:46 UTC - in response to Message 111238.
Last modified: 14 Nov 2011 | 15:30:42 UTC

*Finger's crossed*

Must be difficult to type with crossed fingers. I just tried it. So please uncross. It seems to be working. ;)

GPU-Z:
Date , GPU Core Clock [MHz] , GPU Memory Clock [MHz] , GPU Temperature [°C] , Fan Speed (%) [%] , Fan Speed (RPM) [RPM] , GPU Load [%] , GPU Temp.(DISPIO) [°C] , GPU Temp.(MEMIO) [°C] , GPU Temp.(SHADERCORE) [°C] , Memory Usage (Dedicated) [MB] , Memory Usage (Dynamic) [MB] , VDDC [V] ,
2011-11-14 16:22:02 , 625.0 , 950.0 , 42.0 , 27 , - , 40 , 42.5 , 42.0 , 38.0 , 495 , 45 , 1.120 ,
2011-11-14 16:22:03 , 625.0 , 950.0 , 43.0 , 27 , - , 41 , 43.0 , 42.0 , 38.5 , 495 , 45 , 1.120 ,
2011-11-14 16:22:04 , 625.0 , 950.0 , 43.0 , 27 , - , 50 , 42.5 , 41.5 , 38.0 , 495 , 45 , 1.120 ,
2011-11-14 16:22:05 , 625.0 , 950.0 , 43.0 , 27 , - , 49 , 43.0 , 42.5 , 38.5 , 495 , 45 , 1.120 ,
2011-11-14 16:22:06 , 625.0 , 950.0 , 44.0 , 27 , - , 49 , 44.5 , 42.5 , 38.5 , 495 , 45 , 1.120 ,
2011-11-14 16:22:07 , 625.0 , 950.0 , 43.0 , 27 , - , 47 , 43.5 , 42.0 , 38.0 , 495 , 45 , 1.120 ,
2011-11-14 16:22:08 , 625.0 , 950.0 , 43.0 , 27 , - , 45 , 43.5 , 43.0 , 39.5 , 495 , 61 , 1.120 ,
2011-11-14 16:22:09 , 625.0 , 950.0 , 44.0 , 27 , - , 44 , 43.5 , 42.5 , 39.5 , 495 , 45 , 1.120 ,
2011-11-14 16:22:10 , 625.0 , 950.0 , 43.0 , 27 , - , 43 , 43.5 , 42.0 , 39.0 , 495 , 45 , 1.120 ,
2011-11-14 16:22:11 , 625.0 , 950.0 , 43.0 , 27 , - , 42 , 43.0 , 42.0 , 38.0 , 495 , 45 , 1.120 ,
2011-11-14 16:22:12 , 625.0 , 950.0 , 42.0 , 27 , - , 44 , 43.0 , 42.0 , 38.0 , 495 , 45 , 1.120 ,
2011-11-14 16:22:13 , 625.0 , 950.0 , 43.0 , 27 , - , 47 , 43.0 , 42.5 , 38.0 , 495 , 45 , 1.120 ,
2011-11-14 16:22:14 , 625.0 , 950.0 , 43.0 , 27 , - , 49 , 43.5 , 42.5 , 38.5 , 495 , 45 , 1.120 ,
2011-11-14 16:22:15 , 625.0 , 950.0 , 43.0 , 27 , - , 50 , 43.0 , 42.0 , 38.5 , 495 , 45 , 1.120 ,


Though, can you explain to me why the OpenCL BRP4 app runs at below normal priority?
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111245 - Posted: 14 Nov 2011 | 16:15:21 UTC - in response to Message 111244.

Not yet.

What's <avg_ncpus> and <max_ncpus> set to in sched_reply?

BM

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111246 - Posted: 14 Nov 2011 | 17:17:23 UTC - in response to Message 111245.
Last modified: 14 Nov 2011 | 17:21:36 UTC

<avg_ncpus>0.150000</avg_ncpus>
<max_ncpus>1.000000</max_ncpus>

The task is running, but going slowly. At present rate, with it taking 2 hours to do 22%, it'll take ~9 hours to do the whole 100%. A tad slow, when compared to a similar task at Einstein done on my CPU takes ~10 hours.

I've set BOINC to use 3 CPUs, leaving one open for the GPU and everything else non-BOINC.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111248 - Posted: 15 Nov 2011 | 10:09:31 UTC - in response to Message 111246.
Last modified: 15 Nov 2011 | 10:16:07 UTC

The task is running, but going slowly. At present rate, with it taking 2 hours to do 22%, it'll take ~9 hours to do the whole 100%. A tad slow, when compared to a similar task at Einstein done on my CPU takes ~10 hours.


You're mistaken, these are not similar tasks. If you want to compare GPU vs CPU performance you need to crunch a few of the tasks over here on your CPU. Also, the actual GPU performance varies quite a bit from the Radeon HD 4xxx to the 5xxx and the 6xxx series.

Some approx. runtime numbers:
HD 6970 (Windows): 1 hour
HD 6970 (Linux, no PGO): 1.2 hours
HD 5770 (Mac OS 10.7): 7.2 hours

You see that your 4xxx GPU blends in just fine...


Best,
Oliver

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111249 - Posted: 15 Nov 2011 | 10:13:32 UTC - in response to Message 111226.
Last modified: 18 Nov 2011 | 14:00:12 UTC

Current status as of v1.19:
* Windows: no known issues
* Mac OS X: no known issues
* Linux: might use full CPU core (known bug/feature in AMD Catalyst drivers)

Requirements:
* AMD GPU (NVIDIA will be supported by our CUDA apps for the time being)
* OpenCL 1.0
* BOINC: at least 6.13.10
* Mac OS: 10.7 only


Oliver

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111250 - Posted: 15 Nov 2011 | 11:03:58 UTC - in response to Message 111244.

Though, can you explain to me why the OpenCL BRP4 app runs at below normal priority?


Because it uses less than one CPU (avg_ncpus < 1).

If it would use a full CPU core (like CPU Apps do), it would run in "Idle" priority.

BM

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 74
Credit: 118,590
RAC: 236
Message 111257 - Posted: 15 Nov 2011 | 18:47:59 UTC - in response to Message 111224.

So? Just because Lunatics built a 3rd party application for Seti only that does the OpenCL detection, does not mean that the BOINC you're running does the OpenCL detection. This project uses the OpenCL detection built into the 6.13 clients to determine all OpenCL capable co-processors.

Ok. Fine. Clear.
I'll try 6.13 some later, despite the fact that I don't like > 6.10 interface without Messages tab.


You are right.
The message tab is annoying.

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 74
Credit: 118,590
RAC: 236
Message 111276 - Posted: 16 Nov 2011 | 22:53:41 UTC

I am now using ATI5850 Driver11.9 and SDK2.5
Boinc version 6.13.12 and app 1.19
Progress 38% in 4hours ; hopefully completed in the morning...
GPU load is rarely above 0% sometimes it uses 100% for half a second
Running on win Vista 32bit

Now I will sleep. Goodnite all
____________

robertmiles
Send message
Joined: 16 Nov 11
Posts: 17
Credit: 594,905
RAC: 2,762
Message 111277 - Posted: 16 Nov 2011 | 23:54:21 UTC
Last modified: 16 Nov 2011 | 23:57:13 UTC

Could you mention a little more about what GPUs are suitable for the current A@H application? What I've seen so far suggests that it may require AMD/ATI GPUs, and is not ready for Nvidia GPUs yet.

For example, must the GPU have hardware support for double precision?

Also, is the current CPU application compiled from OpenCL, somewhat similar the the way the current POEM@HOME application is?

I've seen rumors that the 7.0 version of BOINC will be available soon, with at least some support for OpenCL GPU workunits, but almost nothing more on just how much OpenCL GPU support.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111278 - Posted: 17 Nov 2011 | 0:49:45 UTC - in response to Message 111277.
Last modified: 17 Nov 2011 | 0:50:22 UTC

For example, must the GPU have hardware support for double precision?

No, the floating point speed of the GPU has nothing to do with OpenCL.
For AMD/ATI GPUs, only the GPUs at the requirements page on ATI's SDK web site, will have OpenCL capability. In numbers, from the HD4300 onwards and every GPU thereafter, plus the Firepor V3800 and everyone thereafter.

For nVidia it's easier, there any GPU capable of CUDA will be able to do OpenCL. In numbers, from the GEFORCE 8300 onwards and every GPU since.

I've seen rumors that the 7.0 version of BOINC will be available soon, with at least some support for OpenCL GPU workunits, but almost nothing more on just how much OpenCL GPU support.

BOINC 7 (being tested as BOINC 6.13) will support both ATI and nVidia GPUs, with a possibility for Intel GPUs to follow once Intel gets an API out. But really, BOINC doesn't need to support OpenCL, as it doesn't do any of the work. The science application will need to do OpenCL.

All that BOINC will do is detect if your GPU is OpenCL capable and if so, which version it is compliant to. Just as it already did detect if the nVidia GPU is CUDA capable and the ATI GPU CAL/Brook+ capable.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111281 - Posted: 17 Nov 2011 | 10:26:55 UTC - in response to Message 111277.
Last modified: 17 Nov 2011 | 10:30:43 UTC

What I've seen so far suggests that it may require AMD/ATI GPUs, and is not ready for Nvidia GPUs yet.


As a starter please see this post of mine.

Apart from that, we do have a dedicated CUDA app so we don't focus on running our OpenCL app on NVIDIA devices for the time being.

Memory-wise 512 MB cards might be already too "small" for the current app. We always strive to provide apps that run on as many GPUs as possible but there are technical and algorithmic constraints that simply don't allow us to reduce the GPU memory footprint of the OpenCL app right now.


For example, must the GPU have hardware support for double precision?


No.


Cheers,
Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111282 - Posted: 17 Nov 2011 | 10:40:30 UTC - in response to Message 111281.

Apart from that, we do have a dedicated CUDA app so we don't focus on running our OpenCL app on NVIDIA devices for the time being.

On that though, the whole point of OpenCL is to make a new standard in GPGPU and other processors, where presumably one code can work on many pieces of hardware. So in essence, you ought to be able to use the OpenCL app for an ATI GPU on an nVidia GPU, an Intel GPU, an AMD or Intel CPU, and presumably on any Tesla you throw it at. In a perfect world...
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 74
Credit: 118,590
RAC: 236
Message 111283 - Posted: 17 Nov 2011 | 11:54:02 UTC

1 task completed!!!

:)

http://albert.phys.uwm.edu/results.php?hostid=1353&offset=0&show_names=0&state=2&appid=

http://albert.phys.uwm.edu/result.php?resultid=34526


____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111284 - Posted: 17 Nov 2011 | 13:16:03 UTC - in response to Message 111282.


On that though, the whole point of OpenCL is to make a new standard in GPGPU and other processors, where presumably one code can work on many pieces of hardware. So in essence, you ought to be able to use the OpenCL app for an ATI GPU on an nVidia GPU, an Intel GPU, an AMD or Intel CPU, and presumably on any Tesla you throw it at. In a perfect world...


Sorry, that's a misconception. While the OpenCL framework itself indeed facilitates the general approach one still has to take into account the differences of each architecture. OpenCL doesn't help you to overcome that out of the box. While a single codebase should run on most OpenCL capable devices, it won't do so efficiently (e.g. thread parallelism vs. data parallelism on NVIDIA and AMD GPUs). Thus it boils down to the fact that you can stick to a single framework for coding/building/deployment but you still have to write architecture-aware implementations for an efficient solution.


Best,
Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111285 - Posted: 17 Nov 2011 | 13:36:29 UTC - in response to Message 111284.

Not just that. On my travels, I've seen that due to ATI's different approach to GPGPU, from Close-to-Metal through BrookGPU to OpenCL, that their drivers are mature enough to support OpenCL on their hardware.

However, for you as a project engineer to be able to make it work on all the different hardware, you need a different API, SDK, IDE, debugger and probably compiler for each different piece of hardware by vendor. So one of each for ATI, one of each for nVidia, one of each for Intel, etc.

Hence why I ended the previous comment with "In a perfect world..." :-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111286 - Posted: 17 Nov 2011 | 13:58:54 UTC - in response to Message 111285.

It's not that bad after all. However, while our current OpenCL app runs on NVIDIA GPUs indeed it doesn't yet produce valid results on all their models. Most likely due to some subtle differences (optimizations) in NVIDIA's runtime OpenCL compiler... We're investigating but we'll focus on OpenCL@AMD first since we got a CUDA app anyway...

Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111290 - Posted: 17 Nov 2011 | 21:38:09 UTC

Looks like the OpenCL app for ATI can't validate against the CPU app: http://albert.phys.uwm.edu/workunit.php?wuid=11889

Oh and thanks for making the initial replication 3, but leaving the third one unsent. ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111291 - Posted: 17 Nov 2011 | 21:41:41 UTC

Separate post for this, can you stop sending ATI OpenCL work to non-6.13 clients?
I see one of my other tasks is waiting for a wingman, because two people got "Failed to get OpenCL platform/device info from BOINC (error: -161)!" (6.12.34 and 6.10.60), the third had it running on the failed CUDA app.

Since you're checking for the information from the client, you may just as well make the client mandatory.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111298 - Posted: 18 Nov 2011 | 9:25:14 UTC - in response to Message 111291.

Separate post for this, can you stop sending ATI OpenCL work to non-6.13 clients?
I see one of my other tasks is waiting for a wingman, because two people got "Failed to get OpenCL platform/device info from BOINC (error: -161)!" (6.12.34 and 6.10.60), the third had it running on the failed CUDA app.

Since you're checking for the information from the client, you may just as well make the client mandatory.


Guess what we do already? :-)
We're aware of that problem but there's no fix right now... Still investigating...


Oliver

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111299 - Posted: 18 Nov 2011 | 9:27:14 UTC - in response to Message 111290.
Last modified: 18 Nov 2011 | 9:27:39 UTC

Looks like the OpenCL app for ATI can't validate against the CPU app: http://albert.phys.uwm.edu/workunit.php?wuid=11889

We're still tuning the validator. That's part of this test, we need to sample the numerical stability/inaccuracies across various platforms and devices...


Oh and thanks for making the initial replication 3, but leaving the third one unsent. ;-)

Hm?

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111300 - Posted: 18 Nov 2011 | 11:13:58 UTC - in response to Message 111299.


Oh and thanks for making the initial replication 3, but leaving the third one unsent. ;-)

Hm?

Sneaky... now it's sent to a third party.
It wasn't for several hours last night, before I made the comment... I just needed more patience then. ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111301 - Posted: 18 Nov 2011 | 11:20:03 UTC - in response to Message 111298.

Guess what we do already? :-)
We're aware of that problem but there's no fix right now... Still investigating...

One further request then, decrease the deadline here? I see wingmen who have seemingly abandoned the cause, so now I'll have to wait 14 days before anything is resent. That's too much for a test project. Just make the deadline 3 to 5 days, that's time enough to crunch the work, even on a multi-project system, and send it back.

You want the results back fast, don't you? :-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111302 - Posted: 18 Nov 2011 | 13:59:47 UTC - in response to Message 111298.


We're aware of that problem but there's no fix right now... Still investigating...


Should work now...

robertmiles
Send message
Joined: 16 Nov 11
Posts: 17
Credit: 594,905
RAC: 2,762
Message 111312 - Posted: 19 Nov 2011 | 2:10:46 UTC - in response to Message 111278.
Last modified: 19 Nov 2011 | 2:20:40 UTC


I've seen rumors that the 7.0 version of BOINC will be available soon, with at least some support for OpenCL GPU workunits, but almost nothing more on just how much OpenCL GPU support.

BOINC 7 (being tested as BOINC 6.13) will support both ATI and nVidia GPUs, with a possibility for Intel GPUs to follow once Intel gets an API out. But really, BOINC doesn't need to support OpenCL, as it doesn't do any of the work. The science application will need to do OpenCL.

All that BOINC will do is detect if your GPU is OpenCL capable and if so, which version it is compliant to. Just as it already did detect if the nVidia GPU is CUDA capable and the ATI GPU CAL/Brook+ capable.


Looks adequate for computers with only one GPU.

On other BOINC projects, I've seen one thing mentioned that BOINC really needs to do if you want to be able to run both an OpenCL GPU workunit and a non-OpenCL GPU workunit at the same time on computers with more than one GPU - provide a standard way of mapping the way BOINC identifies which GPU it has assigned to a workunit to the way OpenCL identifies those GPUs. Otherwise, it's likely that those two workunits will often attempt to use the same GPU at the same time, with both failing.

Except for this, I've seen nothing on features for OpenCL that need to be in BOINC instead of the application. Adding more features would be helpful, but not required, for BOINC projects wanting OpenCL GPU workunits.

I currently don't have any AMD/ATI GPUs, so I won't be able to much for Albert@Home soon, at least until I find time to measure the size requirements for putting such a graphics board into my newest computer.

robertmiles
Send message
Joined: 16 Nov 11
Posts: 17
Credit: 594,905
RAC: 2,762
Message 111329 - Posted: 20 Nov 2011 | 21:03:17 UTC - in response to Message 111188.

Albert@home is still an unofficial, non-public test project. Don't expect anything to work here. The only type of work Albert@home is currently sending out is for a highly experimental BRP4 OpenCL application.


I've recently received some SSE and CUDA workunits. Should I consider them outdated workunits left over from Einstein@Home, or should I consider them Nvidia and CPU versions of the OpenCL workunits?

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111336 - Posted: 21 Nov 2011 | 10:30:26 UTC - in response to Message 111329.
Last modified: 21 Nov 2011 | 10:32:54 UTC

I've recently received some SSE and CUDA workunits. Should I consider them outdated workunits left over from Einstein@Home, or should I consider them Nvidia and CPU versions of the OpenCL workunits?


The latter. All BRP4 work units on albert can be considered equal and it doesn't matter whether they're run on a CPU, a CUDA device or via OpenCL. This allows us to check cross-platform/device validation and get a feeling for their relative performances. It also allows us to test BOINC's behavior in mixed-GPU (NVIDIA and AMD GPUs in one box) setups - so far the results look good.


Best,
Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111385 - Posted: 21 Nov 2011 | 23:24:51 UTC

ATI OpenCL and CUDA32 don't validate together either.
http://albert.phys.uwm.edu/workunit.php?wuid=12465
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111389 - Posted: 22 Nov 2011 | 10:26:53 UTC - in response to Message 111385.
Last modified: 22 Nov 2011 | 10:28:21 UTC

ATI OpenCL and CUDA32 don't validate together either.
http://albert.phys.uwm.edu/workunit.php?wuid=12465


Not universally true but dependent on the NVIDIA GPU. My observation: a GTX 285 (Tesla) does, a GTX 580 (Fermi) doesn't. We'll investigate and will most likely tune the validator. It just takes some time since we're running low on manpower right now and got the download server issue at first prio.


Stay tuned,
Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111403 - Posted: 23 Nov 2011 | 13:14:40 UTC - in response to Message 111389.

Another one then, http://albert.phys.uwm.edu/workunit.php?wuid=11889.
That's one OpenCL vs two CPU, guess whose work was deemed invalid? ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111405 - Posted: 24 Nov 2011 | 12:29:54 UTC

Also looks like you need more stringent GPU capability detection.
this host has an OpenCL capable GPU, but for some reason on has 384MB memory. Thus all its tasks err.

As far as I know you can set up the scheduler to check for OpenCL capability, memory on the GPU, BOINC client used, etc. GPUs like this need to be locked out, before they're sent work as else it's using unnecessary bandwidth and confuses the user to no end.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111407 - Posted: 24 Nov 2011 | 15:07:05 UTC - in response to Message 111405.

you can set up the scheduler to check for OpenCL capability, memory on the GPU, BOINC client used, etc. GPUs like this need to be locked out, before they're sent work as else it's using unnecessary bandwidth and confuses the user to no end.


Yep, already on our TODO list (mostly done, memory requirements pending)...

Oliver

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111411 - Posted: 24 Nov 2011 | 20:34:40 UTC
Last modified: 24 Nov 2011 | 20:35:55 UTC

HD4890: 1WU = 62,806.60 sec = 17.5 hours and... 500.00 credits only, I guess? )
I understand, that A@H is a test project, but for my opinion it's a waste of time for HD4890.
Don't say me that HD4890 is a low-end GPU )
HD4890 brought me 1Ms in Collatz, MilkyWay, Moo! Wrapper.
In these projects HD4890 is more effective than GTX460, which usually spend only 1 hour for the same WU at Einstein@Home.

Profile [SETI.USA]Tank_Master
Avatar
Send message
Joined: 22 Jan 05
Posts: 17
Credit: 296,000
RAC: 0
Message 111415 - Posted: 25 Nov 2011 | 18:26:21 UTC

credits for this alpha project are 500 per WU that validates, irregardless of how long it takes to run. Also, last I read this project is soon not going to export stats, if they haven't yet turned that off.
____________

Infusioned
Send message
Joined: 11 Feb 05
Posts: 45
Credit: 149,000
RAC: 0
Message 111416 - Posted: 26 Nov 2011 | 1:08:07 UTC
Last modified: 26 Nov 2011 | 1:08:50 UTC

CAL ATI Radeon HD 4700/4800 (RV740/RV770) (1024MB) driver: 0.1

It seems Ageless and I have the same video card. Thus far, I haven't been able to validate a single WU:


http://albert.phys.uwm.edu/workunit.php?wuid=12534
http://albert.phys.uwm.edu/workunit.php?wuid=12549
http://albert.phys.uwm.edu/workunit.php?wuid=12550
http://albert.phys.uwm.edu/workunit.php?wuid=12551
http://albert.phys.uwm.edu/workunit.php?wuid=12294



Also (as mentioned above) it looks like there are problems with the nVidia cards validating with themselves (wu 12294):

The GTX 260 validates with the GTX 590, but not with the GTX 580.
____________

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111417 - Posted: 26 Nov 2011 | 11:51:27 UTC

ATIOpenCL v1.19 doesn't validate with another ATIOpenCL v1.19 either:
TankMaster (HD 69xx) versus me (HD 4850) == http://albert.phys.uwm.edu/workunit.php?wuid=12850 (Completed, validation inconclusive) with a third task going out.

The next two tasks I have will be against other HD 48xx Radeons. I aborted all tasks that had CUDA or CPU as wingmen. They don't validate anyway so it's kinda useless to spend electrons on that at this time. Perhaps later again when there's been time to fine tune the validator even further.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111418 - Posted: 26 Nov 2011 | 18:28:07 UTC - in response to Message 111417.

ATIOpenCL v1.19 doesn't validate with another ATIOpenCL v1.19 either:
TankMaster (HD 69xx) versus me (HD 4850) == http://albert.phys.uwm.edu/workunit.php?wuid=12850 (Completed, validation inconclusive) with a third task going out.

After 5+ hours the third task finally went out, now to a CUDA GPU. I can tell you up front that this isn't going to validate.

Is HR a thought?
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111419 - Posted: 27 Nov 2011 | 9:24:14 UTC
Last modified: 27 Nov 2011 | 9:31:06 UTC

Hmmm... Forget HR. Forget using any HD4850 it seems, or any OpenCL 1.0 only GPUs? I noticed that Infusioned now does have credit, but he did so with a CPU task, not his HD4850.

http://albert.phys.uwm.edu/workunit.php?wuid=12832 is my HD4850 versus Alexone's (presumably) HD4850 or 4870. Validation is inconclusive.
The difference here being my Intel i3 versus his AMD XII perhaps? Or my 8GB of RAM versus his 4GB?

A third task went out to a CUDA, we all know how that's going to end. I'm ending my extra electron burn here, I aborted the other similar task, since it would put my GPU against Alexone's again anyway. Either my GPU is broken, or I use a driver version your app doesn't like (Catalysts 11.6), or it's the direction of the ley lines crossing under my computer room that break things. I stop testing for now and turn my attention fully to Skyrim. ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111421 - Posted: 28 Nov 2011 | 13:11:29 UTC - in response to Message 111407.

you can set up the scheduler to check for OpenCL capability, memory on the GPU, BOINC client used, etc. GPUs like this need to be locked out, before they're sent work as else it's using unnecessary bandwidth and confuses the user to no end.

Yep, already on our TODO list (mostly done, memory requirements pending)...


Due to a bug in the BOINC client this has to wait until at least 6.13.13 (should be released soon).

Stay tuned,
Oliver

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111422 - Posted: 28 Nov 2011 | 13:16:25 UTC - in response to Message 111419.

Either my GPU is broken, or I use a driver version your app doesn't like (Catalysts 11.6)


You need at least Catalyst 11.7 (8.872) since we use APP SDK 2.5 until further notice.

Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111437 - Posted: 29 Nov 2011 | 21:22:46 UTC - in response to Message 111421.

Due to a bug in the BOINC client this has to wait until at least 6.13.13 (should be released soon).

Looks like there ain't gonna be a 6.13.13, but that instead it's going to be BOINC 7.0.x :-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111442 - Posted: 30 Nov 2011 | 10:53:45 UTC - in response to Message 111422.

Either my GPU is broken, or I use a driver version your app doesn't like (Catalysts 11.6)


You need at least Catalyst 11.7 (8.872) since we use APP SDK 2.5 until further notice.


Darn, we just updated the validator (using less strict tolerances). We won't know whether the 11.6 you used before updating to 11.7 played a role in your tasks being considered invalid :-/

Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111443 - Posted: 30 Nov 2011 | 11:22:42 UTC - in response to Message 111442.
Last modified: 30 Nov 2011 | 11:23:43 UTC

What's it you're saying, Oliver? You want me to return to 11.6?
That's no problem. I'm here to test work for you, remember? Totally not in it for the credits. :-)

I'll run the one ATIOpenCL v1.19 I have started already to completion on 11.7, then switch back to 11.6
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111446 - Posted: 30 Nov 2011 | 14:50:16 UTC - in response to Message 111443.

What's it you're saying, Oliver? You want me to return to 11.6?


No, you don't have to. You may of course do so as it wouldn't require a full core anymore. Maybe the relaxed validator settings let your tasks through now (we had to tune it anyway)...

Thanks for supporting this effort!

Oliver

steffen_moeller
Send message
Joined: 9 Feb 05
Posts: 13
Credit: 395,435
RAC: 0
Message 111450 - Posted: 30 Nov 2011 | 22:39:46 UTC

Hello,

HD5770 fails after ~30 seconds
http://albert.phys.uwm.edu/result.php?resultid=51833

[19:27:27][4969][ERROR] Error during OpenCL FFT setup (error: -5)
[19:27:27][4969][ERROR] Demodulation failed (error: 2021)!


It is Debian unstable, fglrx 11.11, amd-app 2.5, BOINC 6.13.12. The installation of amd-app is essential. Without it, neither primegrid nor albert@H binaries can be executed.

Cheers,
Steffen
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111456 - Posted: 1 Dec 2011 | 10:24:54 UTC - in response to Message 111450.

[19:27:27][4969][ERROR] Error during OpenCL FFT setup (error: -5)
[19:27:27][4969][ERROR] Demodulation failed (error: 2021)!



Sorry, not enough GPU memory.


It is Debian unstable, fglrx 11.11, amd-app 2.5, BOINC 6.13.12. The installation of amd-app is essential. Without it, neither primegrid nor albert@H binaries can be executed.


Why? What happens if you don't install it? The runtime libOpenCL.so should already be installed with the driver (as of 11.9 IIRC). Hm, maybe you still need to register the ICD...

Oliver

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111471 - Posted: 2 Dec 2011 | 11:55:22 UTC - in response to Message 111446.
Last modified: 2 Dec 2011 | 11:55:52 UTC

What's it you're saying, Oliver? You want me to return to 11.6?


No, you don't have to. You may of course do so as it wouldn't require a full core anymore.

I did return to 11.6, but left SDK 2.5 on my system.
Immediately the use of the one core went back to 02-10%, instead of the full 25% it was using before that on 11.7.

Skyrim is now also back to being a bit more stable. I had many more CTDs (crash to desktop) with 11.7 than I have had with 11.6; where with 11.6 it would be perhaps once a day, with 11.7 it was 7 times yesterday alone. So after the last CTD I reverted back to 11.6 ;-)

So... will need an eye on what http://albert.phys.uwm.edu/workunit.php?wuid=13760 will go do. It's me versus two CUDA that can't decide between themselves who's right. I doubt I'll be the clincher for them. ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111477 - Posted: 3 Dec 2011 | 8:57:21 UTC - in response to Message 111471.
Last modified: 3 Dec 2011 | 9:01:34 UTC

So... will need an eye on what http://albert.phys.uwm.edu/workunit.php?wuid=13760 will go do. It's me versus two CUDA that can't decide between themselves who's right. I doubt I'll be the clincher for them. ;-)

As I thought, I wasn't the clincher. That task was crunched with catalysts 11.6

Looks like these don't validate to ATIOpenCL yet. Further fine tuning of the validator? I have suspended the last 3 tasks I have until I hear more. Although, I could of course abort them and see what the newer scheduler in 7.0.2 thinks I should get for loads of amounts of work. With a REC of 11000, too much anyway. ;-)

Remember, work that got credit is bad work. Work that didn't validate is good work. It tells the developers here their validator isn't ready yet to work in the outside angry world. :)

(and really developers, how many set it and forget it users do you have on here? ;))
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111480 - Posted: 3 Dec 2011 | 14:15:48 UTC - in response to Message 111471.

So... will need an eye on what http://albert.phys.uwm.edu/workunit.php?wuid=13760 will go do.

I see it was another dud. It took another CUDA32 to get validated, me and another CUDA32 finishing outside the points.

I have 4 new tasks.
12537 is paired against a BRP3cuda32. I think I won't even try.
15980 is two ATIOpenCL. I wish driver detection here was working so I could make a reasonable guess as to what driver the other guy is using. Driver: 0.1 is useless.
15971 also has me paired against a BRP3cuda32.
15943 also has me paired against a BRP3cuda32.

15980 it is then.
Can the developers in the mean time fix the driver detection?
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111484 - Posted: 3 Dec 2011 | 22:29:40 UTC

Completed, marked as invalid
WTF?

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111485 - Posted: 3 Dec 2011 | 22:37:00 UTC - in response to Message 111484.

Easily explained: http://albert.phys.uwm.edu/workunit.php?wuid=12082
You're ATIOpenCL, you were paired against two CUDA which walked away with the credits. The validator isn't tuned enough yet to see that these results may well have been the same.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Infusioned
Send message
Joined: 11 Feb 05
Posts: 45
Credit: 149,000
RAC: 0
Message 111487 - Posted: 4 Dec 2011 | 18:30:00 UTC - in response to Message 111485.
Last modified: 4 Dec 2011 | 18:31:50 UTC

This is from the Seti@Home Beta message boards (developing an ATi OpenCL App):


http://setiweb.ssl.berkeley.edu/beta/forum_thread.php?id=1867


Please, update your hosts with this builds. ATi and NV builds got speed increase, HD5-version added, NV version added.

CPU builds got some updates too.

Known issues:
1) As with OpenCL AstroPulse, last driver version from both vendors show increased CPU usage. AMD already aknowledged this issue and promised to fix in new Catalyst releases, NV still keeping silence about this issue.

2) OpenCL NV app can silently (i.e., w/o errors in stderr) produce incorrect results (overflows). Again, situation resembles NV AstroPulse rev521 case and usually means too long kernel call. Why NV OpenCL runtime doesn't report error code for kernel enqueue runtime call - no idea. But low-end NV GPUs could be not capable to use this app. This testing should determine GPU requirements for NV app too.



Thanks, I downloaded the WU and confirm the difference.

At least part of the problem is a long-standing issue which is seen in SETI@home Enhanced between the stock CPU and stock CUDA applications too. The most efficient order to do the various searches is different for CPU and GPU, so for this kind of task with a lot of potential signals the CPU finds a different subset than the GPU does. Eric Korpela is aware of the issue and maybe if he or Jeff Cobb get a chance the Validator code will be revised to judge quick overflow results differently.

The way that's pertinent to this case is the MB7 r365 sources are primarily targeting openCL builds so the Autocorr and Spike searches are done in a different order than stock 6.97.

At first glance, that doesn't seem to explain all the differences between the MB7 CPU result and 6.97 result. I need to analyze more to really be sure what the data indicates.

FWIW, those results will be considered "weakly similar" and both get credit when the third result is returned.



Maybe this is part of the issue regarding validation?
____________

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 74
Credit: 118,590
RAC: 236
Message 111488 - Posted: 4 Dec 2011 | 18:40:43 UTC - in response to Message 111487.

This is from the Seti@Home Beta message boards (developing an ATi OpenCL App):


http://setiweb.ssl.berkeley.edu/beta/forum_thread.php?id=1867


Please, update your hosts with this builds. ATi and NV builds got speed increase, HD5-version added, NV version added.

CPU builds got some updates too.

Known issues:
1) As with OpenCL AstroPulse, last driver version from both vendors show increased CPU usage. AMD already aknowledged this issue and promised to fix in new Catalyst releases, NV still keeping silence about this issue.

2) OpenCL NV app can silently (i.e., w/o errors in stderr) produce incorrect results (overflows). Again, situation resembles NV AstroPulse rev521 case and usually means too long kernel call. Why NV OpenCL runtime doesn't report error code for kernel enqueue runtime call - no idea. But low-end NV GPUs could be not capable to use this app. This testing should determine GPU requirements for NV app too.



Thanks, I downloaded the WU and confirm the difference.

At least part of the problem is a long-standing issue which is seen in SETI@home Enhanced between the stock CPU and stock CUDA applications too. The most efficient order to do the various searches is different for CPU and GPU, so for this kind of task with a lot of potential signals the CPU finds a different subset than the GPU does. Eric Korpela is aware of the issue and maybe if he or Jeff Cobb get a chance the Validator code will be revised to judge quick overflow results differently.

The way that's pertinent to this case is the MB7 r365 sources are primarily targeting openCL builds so the Autocorr and Spike searches are done in a different order than stock 6.97.

At first glance, that doesn't seem to explain all the differences between the MB7 CPU result and 6.97 result. I need to analyze more to really be sure what the data indicates.

FWIW, those results will be considered "weakly similar" and both get credit when the third result is returned.



Maybe this is part of the issue regarding validation?



I think the problem might be in the FFT and CuFFT variations....
But I am not sure. I saw something about such a discussion in another thread...


____________

Ver Greeneyes
Send message
Joined: 18 Nov 11
Posts: 6
Credit: 530,237
RAC: 1,301
Message 111489 - Posted: 4 Dec 2011 | 21:38:31 UTC - in response to Message 111487.

Maybe this is part of the issue regarding validation?

Nvidia cards are still using the CUDA app though, not OpenCL.

Infusioned
Send message
Joined: 11 Feb 05
Posts: 45
Credit: 149,000
RAC: 0
Message 111490 - Posted: 5 Dec 2011 | 0:44:20 UTC - in response to Message 111489.

It doesn't mean that NVidia cards don't silently generate overflows in general.

Also, the second post I quoted details how even the CUDA app was not validating against a CPU due to the order of calculations and the validator needed tweaking to regard them as weakly similar.
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111491 - Posted: 5 Dec 2011 | 9:18:38 UTC - in response to Message 111480.


Can the developers in the mean time fix the driver detection?


Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue.

Oliver

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 74
Credit: 118,590
RAC: 236
Message 111493 - Posted: 5 Dec 2011 | 10:23:22 UTC - in response to Message 111491.


Can the developers in the mean time fix the driver detection?


Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue.

Oliver



It seems to work in previous boinc manager versions. Maybe not in 6.13xx yet though....
____________

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111494 - Posted: 5 Dec 2011 | 13:47:24 UTC - in response to Message 111491.

Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue.

Oliver

Of course it's up to you. Apparently the server back-end version that you use doesn't store the ATI/CAL driver version, but it is sent to you.

From my 7.0.2 sched_request_albert.phys.uwm.edu.xml file:
<coproc_ati>
<count>1</count>
<name>ATI Radeon HD 4700/4800 (RV740/RV770)</name>
<available_ram>1040187392.000000</available_ram>
<have_cal>1</have_cal>
<have_opencl>1</have_opencl>
<req_secs>0.000000</req_secs>
<req_instances>0.000000</req_instances>
<estimated_delay>0.000000</estimated_delay>
<peak_flops>2000000000000.000000</peak_flops>
<CALVersion>1.4.1417</CALVersion>
<target>5</target>
<localRAM>1024</localRAM>
<uncachedRemoteRAM>2047</uncachedRemoteRAM>
<cachedRemoteRAM>2047</cachedRemoteRAM>
<engineClock>625</engineClock>
<memoryClock>950</memoryClock>
<wavefrontSize>64</wavefrontSize>
<numberOfSIMD>10</numberOfSIMD>
<doublePrecision>1</doublePrecision>
<pitch_alignment>256</pitch_alignment>
<surface_alignment>4096</surface_alignment>
<maxResource1DWidth>8192</maxResource1DWidth>
<maxResource2DWidth>8192</maxResource2DWidth>
<maxResource2DHeight>8192</maxResource2DHeight>
<atirt_detected/>
<coproc_opencl>
<name>ATI RV770</name>
<vendor>Advanced Micro Devices, Inc.</vendor>
<vendor_id>4098</vendor_id>
<available>1</available>
<half_fp_config>0</half_fp_config>
<single_fp_config>62</single_fp_config>
<double_fp_config>63</double_fp_config>
<endian_little>1</endian_little>
<execution_capabilities>1</execution_capabilities>
<extensions>cl_amd_fp64 cl_khr_gl_sharing cl_amd_device_attribute_query cl_khr_d3d10_sharing </extensions>
<global_mem_size>1073741824</global_mem_size>
<local_mem_size>16384</local_mem_size>
<max_clock_frequency>625</max_clock_frequency>
<max_compute_units>10</max_compute_units>
<opencl_platform_version>OpenCL 1.1 AMD-APP-SDK-v2.5 (775.2)</opencl_platform_version>
<opencl_device_version>OpenCL 1.0 AMD-APP-SDK-v2.5 (775.2)</opencl_device_version>
<opencl_driver_version>CAL 1.4.1417</opencl_driver_version>
</coproc_opencl>
</coproc_ati>

You can even use the OpenCL information.
Then with the CAL version we can figure out which Catalysts they are. E.g. CAL 1.4.1417 is Catalysts 11.6
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

TRuEQ & TuVaLu
Send message
Joined: 11 Sep 06
Posts: 74
Credit: 118,590
RAC: 236
Message 111495 - Posted: 5 Dec 2011 | 15:05:40 UTC - in response to Message 111494.

Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue.

Oliver

Of course it's up to you. Apparently the server back-end version that you use doesn't store the ATI/CAL driver version, but it is sent to you.

From my 7.0.2 sched_request_albert.phys.uwm.edu.xml file:
<coproc_ati>
<count>1</count>
<name>ATI Radeon HD 4700/4800 (RV740/RV770)</name>
<available_ram>1040187392.000000</available_ram>
<have_cal>1</have_cal>
<have_opencl>1</have_opencl>
<req_secs>0.000000</req_secs>
<req_instances>0.000000</req_instances>
<estimated_delay>0.000000</estimated_delay>
<peak_flops>2000000000000.000000</peak_flops>
<CALVersion>1.4.1417</CALVersion>
<target>5</target>
<localRAM>1024</localRAM>
<uncachedRemoteRAM>2047</uncachedRemoteRAM>
<cachedRemoteRAM>2047</cachedRemoteRAM>
<engineClock>625</engineClock>
<memoryClock>950</memoryClock>
<wavefrontSize>64</wavefrontSize>
<numberOfSIMD>10</numberOfSIMD>
<doublePrecision>1</doublePrecision>
<pitch_alignment>256</pitch_alignment>
<surface_alignment>4096</surface_alignment>
<maxResource1DWidth>8192</maxResource1DWidth>
<maxResource2DWidth>8192</maxResource2DWidth>
<maxResource2DHeight>8192</maxResource2DHeight>
<atirt_detected/>
<coproc_opencl>
<name>ATI RV770</name>
<vendor>Advanced Micro Devices, Inc.</vendor>
<vendor_id>4098</vendor_id>
<available>1</available>
<half_fp_config>0</half_fp_config>
<single_fp_config>62</single_fp_config>
<double_fp_config>63</double_fp_config>
<endian_little>1</endian_little>
<execution_capabilities>1</execution_capabilities>
<extensions>cl_amd_fp64 cl_khr_gl_sharing cl_amd_device_attribute_query cl_khr_d3d10_sharing </extensions>
<global_mem_size>1073741824</global_mem_size>
<local_mem_size>16384</local_mem_size>
<max_clock_frequency>625</max_clock_frequency>
<max_compute_units>10</max_compute_units>
<opencl_platform_version>OpenCL 1.1 AMD-APP-SDK-v2.5 (775.2)</opencl_platform_version>
<opencl_device_version>OpenCL 1.0 AMD-APP-SDK-v2.5 (775.2)</opencl_device_version>
<opencl_driver_version>CAL 1.4.1417</opencl_driver_version>
</coproc_opencl>
</coproc_ati>

You can even use the OpenCL information.
Then with the CAL version we can figure out which Catalysts they are. E.g. CAL 1.4.1417 is Catalysts 11.6



Jord has a point here.

And it can help users detect possible wrong drivers when they compare with other users. On the question, "which driver is the best driver for my ATI card?"


____________

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111496 - Posted: 5 Dec 2011 | 19:29:36 UTC - in response to Message 111494.

If you want to be totally confused, it does work on Einstein. See my account there.

You can't blame it on the client version either, it's merely all ATI that is affected. Examples: this host uses 6.10.58 and shows driver 0.1; this host uses 6.12.41 and shows as driver version 0.1
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111498 - Posted: 5 Dec 2011 | 21:40:51 UTC - in response to Message 111491.
Last modified: 5 Dec 2011 | 21:44:41 UTC


Can the developers in the mean time fix the driver detection?


Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue.


You may talk about two different things here.

Jord, what exactly do you think should be fixed?

I do see that displaying the ATI CAL/driver version on the host web pages appears broken (on Albert), and possibly the string in the DB is, too.

In the scheduler the ATI "driver" version is stored as "char version[50]" and "int version_num" in coproc_ati, and in "char opencl_driver_version[32]" in opencl_device_prop. These could in principle be used in app_plan(), though we don't check this yet.

BM

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111499 - Posted: 5 Dec 2011 | 22:06:29 UTC - in response to Message 111498.

Jord, what exactly do you think should be fixed?

Showing of the CAL driver version on ATI cards on the account pages here.

Yes, sorry, I said it wrong. I asked for a fix for the driver detection. I know you don't do that, that that's up to the client. I meant that all the driver versions showing for Nvidia GPUs is correct, for all ATI GPUs it's always 0.1, which isn't correct.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111503 - Posted: 6 Dec 2011 | 9:43:18 UTC - in response to Message 111499.
Last modified: 6 Dec 2011 | 9:51:55 UTC

Hm. On your host page I currently read:

AMD ATI Radeon HD 4700/4800 (RV740/RV770) (1024MB) driver: 1.4.1417


I don't see anything wrong with that. Maybe the previous entry was from an old Client version?

The only thing I changed this morning was (parts of) the web page code, but nothing related to the pages involved here.

BM

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111504 - Posted: 6 Dec 2011 | 10:38:04 UTC - in response to Message 111503.
Last modified: 6 Dec 2011 | 10:41:30 UTC

Well, whatever you did fixed that bug. It now shows on all hosts I checked which CAL driver version these people use. It may have been in there all this time, just not showing as such. So thanks. :)

Btw Oliver, BOINC 7.0.1 (the minimum requirement now) was never compiled and stored anywhere. The minimum minimum anyone could download was 7.0.2; only people who got the source code of branch_7.0 and compiled that on the 30th of November will have 7.0.1, all else will have 6.13.12 or 7.0.2. ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111505 - Posted: 6 Dec 2011 | 12:30:17 UTC - in response to Message 111504.


Btw Oliver, BOINC 7.0.1 (the minimum requirement now) was never compiled and stored anywhere. The minimum minimum anyone could download was 7.0.2; only people who got the source code of branch_7.0 and compiled that on the 30th of November will have 7.0.1, all else will have 6.13.12 or 7.0.2. ;-)


I know, I'm just using the exact tag/version that contains the required bug fix :-)

Oliver

robertmiles
Send message
Joined: 16 Nov 11
Posts: 17
Credit: 594,905
RAC: 2,762
Message 111510 - Posted: 8 Dec 2011 | 12:40:17 UTC - in response to Message 111498.


Can the developers in the mean time fix the driver detection?


Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue.


You may talk about two different things here.

Jord, what exactly do you think should be fixed?

I do see that displaying the ATI CAL/driver version on the host web pages appears broken (on Albert), and possibly the string in the DB is, too.

In the scheduler the ATI "driver" version is stored as "char version[50]" and "int version_num" in coproc_ati, and in "char opencl_driver_version[32]" in opencl_device_prop. These could in principle be used in app_plan(), though we don't check this yet.

BM


I just read something related on the boinc_dev mailing list. It seems that BOINC 7.01 and 7.02 don't allocate enough digits in one of the places they store
ATI version numbers, and are therefore likely to get at least some of the version numbers wrong.

Profile Gaurav Khanna
Send message
Joined: 8 Nov 04
Posts: 12
Credit: 2,281,032
RAC: 552
Message 111511 - Posted: 8 Dec 2011 | 15:15:05 UTC

Since the upgrade to 7.0.2 I'm not getting any work for the GPUs ..

08-Dec-2011 07:20:02 [Albert@Home] Sending scheduler request: To fetch work.
08-Dec-2011 07:20:02 [Albert@Home] Requesting new tasks for ATI
08-Dec-2011 07:20:05 [Albert@Home] Scheduler request completed: got 0 new tasks
08-Dec-2011 07:20:05 [Albert@Home] No tasks sent
08-Dec-2011 07:58:09 [Albert@Home] Sending scheduler request: To fetch work.
08-Dec-2011 07:58:09 [Albert@Home] Requesting new tasks for NVIDIA
08-Dec-2011 07:58:11 [Albert@Home] Scheduler request completed: got 0 new tasks
08-Dec-2011 07:58:11 [Albert@Home] No tasks sent

host details are here:
http://albert.phys.uwm.edu/show_host_detail.php?hostid=1396

Any thoughts?
____________

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111513 - Posted: 8 Dec 2011 | 19:31:03 UTC - in response to Message 111511.

Click on the "last contact" link to see the scheduler logs.

In case of your host I see


2011-12-08 15:42:31.5128 [PID=32492] [version] Checking plan class 'atiOpenCL'
2011-12-08 15:42:31.5128 [PID=32492] [version] GPU RAM required min: 536870912.000000, supplied: 0
2011-12-08 15:42:31.5128 [PID=32492] [version] [AV#459] app_plan() returned false


Hm - looks like there's something wrong with the GPU RAM size reporting. I'll look into that tomorrow.

BM

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111514 - Posted: 8 Dec 2011 | 19:56:28 UTC - in response to Message 111513.
Last modified: 8 Dec 2011 | 19:57:07 UTC

Hm ... according to sched_request the ATI device isn't OpenCL capable:

<coproc_ati> <count>1</count> <name>ATI Radeon HD 5800 series (Cypress)</name> <available_ram>1024458752.000000</available_ram> <have_cal>1</have_cal> <have_opencl>0</have_opencl> <req_secs>0.000000</req_secs> <req_instances>0.000000</req_instances> <estimated_delay>0.000000</estimated_delay> <peak_flops>50000000000.000000</peak_flops> <CALVersion>1.4.815</CALVersion> <target>8</target> <localRAM>1024</localRAM> <uncachedRemoteRAM>1788</uncachedRemoteRAM> <cachedRemoteRAM>508</cachedRemoteRAM> <engineClock>0</engineClock> <memoryClock>0</memoryClock> <wavefrontSize>64</wavefrontSize> <numberOfSIMD>20</numberOfSIMD> <doublePrecision>1</doublePrecision> <pitch_alignment>256</pitch_alignment> <surface_alignment>256</surface_alignment> <maxResource1DWidth>16384</maxResource1DWidth> <maxResource2DWidth>16384</maxResource2DWidth> <maxResource2DHeight>16384</maxResource2DHeight> <atirt_detected/> </coproc_ati>

STE\/E
Send message
Joined: 18 Jan 05
Posts: 142
Credit: 3,507,007
RAC: 181
Message 111515 - Posted: 8 Dec 2011 | 20:14:16 UTC - in response to Message 111514.

Hm ... according to sched_request the ATI device isn't OpenCL capable:
<coproc_ati> <count>1</count> <name>ATI Radeon HD 5800 series (Cypress)</name> <available_ram>1024458752.000000</available_ram> <have_cal>1</have_cal> <have_opencl>0</have_opencl> <req_secs>0.000000</req_secs> <req_instances>0.000000</req_instances> <estimated_delay>0.000000</estimated_delay> <peak_flops>50000000000.000000</peak_flops> <CALVersion>1.4.815</CALVersion> <target>8</target> <localRAM>1024</localRAM> <uncachedRemoteRAM>1788</uncachedRemoteRAM> <cachedRemoteRAM>508</cachedRemoteRAM> <engineClock>0</engineClock> <memoryClock>0</memoryClock> <wavefrontSize>64</wavefrontSize> <numberOfSIMD>20</numberOfSIMD> <doublePrecision>1</doublePrecision> <pitch_alignment>256</pitch_alignment> <surface_alignment>256</surface_alignment> <maxResource1DWidth>16384</maxResource1DWidth> <maxResource2DWidth>16384</maxResource2DWidth> <maxResource2DHeight>16384</maxResource2DHeight> <atirt_detected/> </coproc_ati>


I don't think Cal version 1.4.815 is OpenCL Capable ...

____________
STE\/E

Profile Gaurav Khanna
Send message
Joined: 8 Nov 04
Posts: 12
Credit: 2,281,032
RAC: 552
Message 111516 - Posted: 8 Dec 2011 | 20:49:03 UTC - in response to Message 111515.

Weird .. with BOINC 6.13 the same ATI GPU completed scores of work units:

For example:
http://albert.phys.uwm.edu/result.php?resultid=53182

And I use the ATI GPU for OpenCL development all the time ..
____________

Profile Gaurav Khanna
Send message
Joined: 8 Nov 04
Posts: 12
Credit: 2,281,032
RAC: 552
Message 111517 - Posted: 8 Dec 2011 | 20:50:44 UTC

What exactly does BOINC do to check for OpenCL?
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111518 - Posted: 9 Dec 2011 | 9:35:08 UTC
Last modified: 9 Dec 2011 | 10:06:54 UTC


I don't think Cal version 1.4.815 is OpenCL Capable ...


It is, but you have to install the SDK and register the OpenCL ICD yourself. Installing the driver is not sufficient.

Gaurav, have you changed anything in your setup?

Oliver

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111521 - Posted: 9 Dec 2011 | 10:10:21 UTC - in response to Message 111517.
Last modified: 9 Dec 2011 | 14:03:18 UTC

What exactly does BOINC do to check for OpenCL?


It uses libOpenCL via late binding to query a few basic properties. Using 10.8 (as you do) you have to make sure it's available as it's not installed automatically in the usual library paths. Better still, upgrade your driver to 11.7 (11.11 on Linux!). That version of the Catalyst driver installs the OpenCL runtime all by itself. Our app officially requires at least 11.7 anyway as we build it using SDK 2.5.

Oliver

Profile Gaurav Khanna
Send message
Joined: 8 Nov 04
Posts: 12
Credit: 2,281,032
RAC: 552
Message 111531 - Posted: 9 Dec 2011 | 21:22:42 UTC - in response to Message 111521.

Thanks Oliver. That was helpful. Figured it out .. it was a 64-bit Vs 32-bit issue. The ATI app is 32-bit, but the BOINC client I built is 64-bit. So, I had to make both 32 and 64 bit OpenCL libs available in my library path.

Its working now (finished a result successfully):
http://albert.phys.uwm.edu/result.php?resultid=54521

____________

Profile x3mEn
Send message
Joined: 21 Jun 11
Posts: 9
Credit: 10,000
RAC: 0
Message 111540 - Posted: 11 Dec 2011 | 13:42:59 UTC

Aborting task p2030.20100912.G57.94-00.24.S.b5s0g0.00000_416_1: exceeded elapsed time limit 6394.64 (2800000.00G/432.72G)

3 WUs were aborted after ~6395 sec of run time for the same reason:
p2030.20100912.G57.94-00.24.S.b5s0g0.00000_416_1 - 6,395.56 sec
p2030.20100913.G44.55+00.20.C.b5s0g0.00000_1824_2 - 6,395.53 sec
p2030.20100912.G57.94-00.24.S.b5s0g0.00000_744_0 - 6,395.32 sec

WTF?

Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 28 Aug 06
Posts: 1447
Credit: 1,757,982
RAC: 2,126
Message 111541 - Posted: 11 Dec 2011 | 16:17:11 UTC - in response to Message 111540.
Last modified: 11 Dec 2011 | 16:18:45 UTC

Hi!

My guess is that the way BOINC calculates the maximum allowed elapsed time for a task to finish has changed with version 7.x. So the workunits would now have to be generated with a higher number of estimated/max floating point ops per task. Because BRP4 workunits are all created equal, no matter on which platform they will eventually get crunched, one value of estimated floating point ops must be used for CPU, NVIDIA/CUDA and ATI/OpenCL (and all supported BOINC client versions). No trivial task.

Until this gets fixed on the server side for new workunits, one could theoretically do a workaround in the client_state.xml file to prevent more WUs erroring out (and wrecking your quota):

- Stop BOINC
- open the client_state.xml file in an editor
- replace occurrences of the following two lines

<rsc_fpops_est>140000000000000.000000</rsc_fpops_est> <rsc_fpops_bound>2800000000000000.000000</rsc_fpops_bound>

with (say)
<rsc_fpops_est>1400000000000000.000000</rsc_fpops_est> <rsc_fpops_bound>28000000000000000.000000</rsc_fpops_bound>


(Actually you should check that only WUs of the Albert@Home project are changed if you use that BOINC instance for other projects as well).


CU
HBE
____________

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111554 - Posted: 13 Dec 2011 | 14:11:49 UTC - in response to Message 111541.

My guess is that the way BOINC calculates the maximum allowed elapsed time for a task to finish has changed with version 7.x.


No, it's worse - it's the new server code ("credit new") that does this. Although we are still using server-assigned credit here on Albert, the run time estimation etc. is handled by the new system. This is part of what we intend to test here.

BM

Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 28 Aug 06
Posts: 1447
Credit: 1,757,982
RAC: 2,126
Message 111557 - Posted: 13 Dec 2011 | 19:20:44 UTC - in response to Message 111554.

oh....

Were you able to compensate for this effect? That is, will newly generated WUs have a chance to be computed in time without the manual editing I described above?

Thx,
HB
____________

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111562 - Posted: 14 Dec 2011 | 10:46:12 UTC - in response to Message 111557.
Last modified: 14 Dec 2011 | 14:09:37 UTC

So far I didn't get any help from the BOINC devs that I asked for, so I'm still analyzing and digging through the code myself.

Indeed it currently looks like some things have changed on both ends - client and server - and I still need to understand how these changes work together.

BM

Edit:

Hm, apparently rsc_fpops_est and rsc_fpops_bound pass the server code unchanged, that are still the values written by the WUG...

In the Client there is:

max_elapsed_time = rp->wup->rsc_fpops_bound/rp->avp->flops;

where rsc_fpops_bound should be what it gets passed from the server, and avp->flops the "flops" of the "app version". Apparently that's also sent by the server for the App it sends.

Bikeman, what's that in your case? There must be something like

<app_version> ... <flops>62700155339.574905</flops> ... </app_version>

in the client_state.xml you edited.

Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 28 Aug 06
Posts: 1447
Credit: 1,757,982
RAC: 2,126
Message 111569 - Posted: 14 Dec 2011 | 19:24:29 UTC - in response to Message 111562.
Last modified: 14 Dec 2011 | 19:27:21 UTC

Hi!

Let me see...

<app_version> <app_name>einsteinbinary_BRP4</app_name> <version_num>119</version_num> <platform>i686-pc-linux-gnu</platform> <avg_ncpus>0.150000</avg_ncpus> <max_ncpus>1.000000</max_ncpus> <flops>2402526409719.028320</flops> <plan_class>atiOpenCL</plan_class>


(but that is after I made the editing).

As different users see different cut-off times, I would have expected that this scales with the BOINC benchmark result for the individual graphics card. For mine, it seems to be this:

<coproc_ati> <count>1</count> <name>ATI Radeon HD 5800 series (Cypress)</name> <available_ram>1002438656.000000</available_ram> <have_cal>1</have_cal> <have_opencl>1</have_opencl> <peak_flops>4176000000000.000000</peak_flops>


CU
HB
____________

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111570 - Posted: 14 Dec 2011 | 20:12:58 UTC - in response to Message 111569.
Last modified: 14 Dec 2011 | 20:20:57 UTC

The "peak_flops" is not benchmarked. It's merely a theoretical upper bound derived basically from the number of "cores" times the clock frequency. Thus the peak_flops should be identical for similar devices.

According to the CreditNew description "The scheduler adjusts this [peak_flops], using the elapsed time statistics, to get the app_version.flops_est it sends to the client (from which job durations are estimated)."

BM

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111571 - Posted: 14 Dec 2011 | 20:17:33 UTC - in response to Message 111569.

<app_version> <app_name>einsteinbinary_BRP4</app_name> <version_num>119</version_num> <platform>i686-pc-linux-gnu</platform> <avg_ncpus>0.150000</avg_ncpus> <max_ncpus>1.000000</max_ncpus> <flops>2402526409719.028320</flops> <plan_class>atiOpenCL</plan_class>


Yes, let's see. Local stuff, from the HD4850
<app_version> <app_name>einsteinbinary_BRP4</app_name> <version_num>109</version_num> <platform>windows_intelx86</platform> <avg_ncpus>0.200000</avg_ncpus> <max_ncpus>1.000000</max_ncpus> <flops>33321667311.708328</flops> <plan_class>ATIOpenCL</plan_class> <api_version>6.13.8</api_version>

<coproc_ati> <count>1</count> <name>ATI Radeon HD 5800 series (Cypress)</name> <available_ram>1002438656.000000</available_ram> <have_cal>1</have_cal> <have_opencl>1</have_opencl> <peak_flops>4176000000000.000000</peak_flops>

For the actual card, I'll do the whole shebang:
<coproc_ati> <count>1</count> <name>ATI Radeon HD 4700/4800 (RV740/RV770)</name> <available_ram>1040187392.000000</available_ram> <have_cal>1</have_cal> <have_opencl>1</have_opencl> <peak_flops>2000000000000.000000</peak_flops> <CALVersion>1.4.1607</CALVersion> <target>5</target> <localRAM>1024</localRAM> <uncachedRemoteRAM>2047</uncachedRemoteRAM> <cachedRemoteRAM>2047</cachedRemoteRAM> <engineClock>625</engineClock> <memoryClock>900</memoryClock> <wavefrontSize>64</wavefrontSize> <numberOfSIMD>10</numberOfSIMD> <doublePrecision>1</doublePrecision> <pitch_alignment>256</pitch_alignment> <surface_alignment>4096</surface_alignment> <maxResource1DWidth>8192</maxResource1DWidth> <maxResource2DWidth>8192</maxResource2DWidth> <maxResource2DHeight>8192</maxResource2DHeight> <atirt_detected/> <coproc_opencl> <name>ATI RV770</name> <vendor>Advanced Micro Devices, Inc.</vendor> <vendor_id>4098</vendor_id> <available>1</available> <half_fp_config>0</half_fp_config> <single_fp_config>62</single_fp_config> <double_fp_config>63</double_fp_config> <endian_little>1</endian_little> <execution_capabilities>1</execution_capabilities> <extensions>cl_amd_fp64 cl_khr_gl_sharing cl_amd_device_attribute_query cl_khr_d3d10_sharing </extensions> <global_mem_size>1073741824</global_mem_size> <local_mem_size>16384</local_mem_size> <max_clock_frequency>625</max_clock_frequency> <max_compute_units>10</max_compute_units> <opencl_platform_version>OpenCL 1.1 AMD-APP-SDK-v2.5 (793.1)</opencl_platform_version> <opencl_device_version>OpenCL 1.0 AMD-APP-SDK-v2.5 (793.1)</opencl_device_version> <opencl_driver_version>CAL 1.4.1607</opencl_driver_version> </coproc_opencl> </coproc_ati>


How come my flops are so little on the einsteinbinary? Only 33321667311 for Windows versus Heinz's 2402526409719 for Linux? Yeah I get it, those are estimated flops by the server, but heck. When the peak flops my GPU can do is 2000000000000 flops, or 60 times the estimated amount of flops my GPU is getting, no wonder why a) work is estimated so (s)low and b) I have a TDCF of 9.59! Plonk.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 28 Aug 06
Posts: 1447
Credit: 1,757,982
RAC: 2,126
Message 111572 - Posted: 14 Dec 2011 | 21:40:30 UTC - in response to Message 111571.

Hi!

This is quite a mission impossible from a BOINC perspective, isn't it? If the initial estimation is off too much, the WUs will ALL get terminated prematurely, and the server will NEVER get a valid result to adjust the estimation of the computation performance, which is actually needed to provide a good estimation for the max elapsed time in the first place!!

I think (dreaming) that it would be best if (optionally) apps could have a self-profiling option. E.g. if the app info stuff defined for the app in question includes a special tag (say <allow_benchm>), then the BOINC client could call the app with all the WU command line items of a workunit, and the additional command line option --benchmark-only . The app would then do a short test run (app developers would know best how to do that) that returns an estimation of the runtime for the entire workunit.

HB



____________

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111573 - Posted: 14 Dec 2011 | 21:53:21 UTC - in response to Message 111572.
Last modified: 14 Dec 2011 | 22:08:06 UTC

This is quite a mission impossible from a BOINC perspective, isn't it? If the initial estimation is off too much, the WUs will ALL get terminated prematurely, and the server will NEVER get a valid result to adjust the estimation of the computation performance, which is actually needed to provide a good estimation for the max elapsed time in the first place!!

If no work ever validates, like in my situation, then the adjustment will also never happen. Why will the work never validate? That's still where I come up clueless - no hint from Oliver either. Where is he by the way, seems like he evaporated. ;-)

... The app would then do a short test run (app developers would know best how to do that) that returns an estimation of the runtime for the entire workunit.

Either that or allow that the user sets the amount of flops for all OpenCL capable hardware. Now it can only be set on CUDA work and then only when using the anonymous platform.

But really, why is the flops count for my hardware set so low on the server, when the peak flops show that it can do way better. For the HD5800 the amount of digits for the peak flops and the 'actual flops' is the same (13). For my HD4850 it's 13 for the peak flops and 11 for the actual flops. There's got to be something wrong there.

I think I'll get work, then exit BOINC, adjust the flops number in client_state.xml, restart BOINC and see if that will make a difference. Maybe that will even validate work.

Edit: hehe, I edited the flops value to <flops>1919403979592.269394</flops>, now all tasks think they'll run for 11 minutes. That's gonna wreck my DCF completely. ;-)

Maybe I should make it even less...
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111574 - Posted: 14 Dec 2011 | 22:11:07 UTC - in response to Message 111572.

This is quite a mission impossible from a BOINC perspective, isn't it? If the initial estimation is off too much, the WUs will ALL get terminated prematurely, and the server will NEVER get a valid result to adjust the estimation of the computation performance, which is actually needed to provide a good estimation for the max elapsed time in the first place!!


Yep, that occurred to me, too.

I already added some code to our plan-class stuff that should allow me to play around with the flops estimation a bit. I intend to do this tomorrow, together with some more analysis of the scheduler code (sched_version.cpp).

BM

Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 28 Aug 06
Posts: 1447
Credit: 1,757,982
RAC: 2,126
Message 111575 - Posted: 14 Dec 2011 | 22:19:43 UTC
Last modified: 14 Dec 2011 | 22:36:02 UTC

@Jord: Well, because I fiddled manually in client_state.xml, finally a lot of workunits were completed (even validated, not sure if that matters), and my card took ca 4500 sec per WU.

I think your card takes around 33k sec to complete a WU, so no matter what the theoretical (computed) peak performance is, the server would be right to assign a ca 7 times lower performance to your card.

Why is your card slower? Actually the debugging output gives a hint that because of the physical capabilities of your card, the app was forced to re-size the internal processing layout to make it fit. I'm afraid it requires some deep analysis to find out whether this re-sizing is leading to differences in the result of the computation, and whether the differences are tolerable (==>validator adjustment) or intolerable (maybe the re-sizing has a bug).

It would be instructive to see whether the debugging output in question is common to all 4xxx series cards. Ah..but that's a different subject.

@Bernd: my host is now almost out of work (was on nomorework) so I'll give it a try tomorrow or whenever it's ready.

CU
HB
____________

robertmiles
Send message
Joined: 16 Nov 11
Posts: 17
Credit: 594,905
RAC: 2,762
Message 111576 - Posted: 14 Dec 2011 | 22:54:54 UTC

Over on GPUGRID, I saw something about them finding that the HD4xxx series cards had some type of memory access problem - a limit on the amount of graphics memory each processor on the GPU can access before it starts using a much lower bandwidth path to the computer's main memory instead. I haven't kept up with whether more recent software updates have removed this restriction.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111577 - Posted: 14 Dec 2011 | 23:14:54 UTC - in response to Message 111575.

I think your card takes around 33k sec to complete a WU, so no matter what the theoretical (computed) peak performance is, the server would be right to assign a ca 7 times lower performance to your card.

7 or 60? Quite some difference. But OK, I am running with a changed flops value, still only 11 digits long but different than what Albert gave me. Since its estimates are all too low (you're right about the ~32k seconds) I've made it think that the tasks are actually longer, not shorter.

Just too bad I'm still quite busy with Skyrim. That hacks into the time anything else can use the GPU. ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111580 - Posted: 15 Dec 2011 | 19:02:22 UTC - in response to Message 111577.
Last modified: 15 Dec 2011 | 19:02:56 UTC

The way of calculating (projected_)flops differs largely depending on how many tasks with this app version your host has successfully computed. Maybe this value differs between your hosts.

Anyway, I did change the scheduler (the "projected_flops" supplied by th eplan classes should be much lower now). At least they should bot overestimate the actual flops now, which could lead to "maximum time exceeded" errors. Time estimates on the Client side may be far off now, though. Have a try.

BM

Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 28 Aug 06
Posts: 1447
Credit: 1,757,982
RAC: 2,126
Message 111581 - Posted: 15 Dec 2011 | 20:44:28 UTC - in response to Message 111580.
Last modified: 15 Dec 2011 | 20:45:37 UTC

Hmmm....I get the same cut-off time as before (even tho I resetted the Albert project before allowing new work). In addition, the app now seems to be configured to use a full CPU core.

http://albert.phys.uwm.edu/result.php?resultid=66620

CU
HB
____________

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111582 - Posted: 15 Dec 2011 | 21:16:35 UTC - in response to Message 111581.

Ok, scheduler reverted. Needs further investigation.

BM

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111585 - Posted: 16 Dec 2011 | 0:32:33 UTC - in response to Message 111580.
Last modified: 16 Dec 2011 | 0:33:15 UTC

The way of calculating (projected_)flops differs largely depending on how many tasks with this app version your host has successfully computed. Maybe this value differs between your hosts.

LOL, like zero times for me? None of the tasks I do validate, remember?

As for testing your over_flops, what do I do with the extra tasks? Stupid BOINC always fetches 6 tasks, doesn't matter that it then takes ~9 days to do them... Though you now have ~9 days to come up with a better schedule(r). ;-)
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

robertmiles
Send message
Joined: 16 Nov 11
Posts: 17
Credit: 594,905
RAC: 2,762
Message 111586 - Posted: 16 Dec 2011 | 0:54:09 UTC
Last modified: 16 Dec 2011 | 0:55:32 UTC

Have you thought of starting with a certain number of dummy tasks, to be replaced with similar information from tasks actually completed as soon as there are enough of them?

Some BOINC projects limit the number of tasks any computer can have downloaded and in progress at first, with this limit relaxed as soon as there are enough tasks successfully completed by that computer to get a better idea of how often it can handle yet another workunit.

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111592 - Posted: 16 Dec 2011 | 13:21:13 UTC - in response to Message 111586.

Some BOINC projects limit the number of tasks any computer can have downloaded and in progress at first, with this limit relaxed as soon as there are enough tasks successfully completed by that computer to get a better idea of how often it can handle yet another workunit.


That's certainly an option to limit the effect of the runtime estimation / work fetch going mad. But actually I'd like to understand and fix what's going wrong in the first place.

For now i raised the FLOPS estimation and thus the FLOPS limit by a factor of 10 for newly generated workunits. It will take some time (usually about 1.5d) until the first tasks from that will be sent out, though.

BM

robertmiles
Send message
Joined: 16 Nov 11
Posts: 17
Credit: 594,905
RAC: 2,762
Message 111594 - Posted: 16 Dec 2011 | 17:30:54 UTC

I've read that at least some of the BOINC versions never initialize one of the variables often used in runtime estimation. You may want to add reporting of the variables you use so you can check for signs of this.

Profile Ageless
Avatar
Send message
Joined: 26 Jan 05
Posts: 1639
Credit: 70,000
RAC: 0
Message 111601 - Posted: 17 Dec 2011 | 1:23:32 UTC - in response to Message 111592.

That's certainly an option to limit the effect of the runtime estimation / work fetch going mad. But actually I'd like to understand and fix what's going wrong in the first place.

There is something weird going on with the amount of tasks one has per day. As you can see from my double zero credit & RAC, I haven't had one task validate yet. So by now, the amount of tasks I should be able to download for the v1.19 app should be 1, maybe 2.

Yesterday it was 26, now it is 32. Why is it going up?
I am not returning any valid work. Shouldn't it, like in the old days, continue to go down and eventually only give me 1 task per device (CPU core or GPU) per day? As with this, I can continue ad infinitum doing 'bad work'.
____________
Jord.

BOINC FAQ Service

They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake.

Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 15 Oct 04
Posts: 1925
Credit: 1,921,480
RAC: 838
Message 111609 - Posted: 20 Dec 2011 | 10:26:47 UTC
Last modified: 20 Dec 2011 | 10:29:50 UTC

I incorporated D.A.s recent fix for using "conservative flops estimate" in case "we don't have enough statistics" (i.e. too few valid results) into the scheduler running on Albert.

Let's see whether this helps ...

BM

PS: Besides I added some logging that should write the Client's max runtime for every job sent to the scheduler log. You may spot it in the logs for your hosts.

Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 28 Aug 06
Posts: 1447
Credit: 1,757,982
RAC: 2,126
Message 111622 - Posted: 22 Dec 2011 | 22:46:41 UTC - in response to Message 111609.
Last modified: 25 Dec 2011 | 21:13:49 UTC

Hi!

I just got this:

2011-12-22 22:39:37.5065 [PID=14669] [version] Checking plan class 'atiOpenCL' 2011-12-22 22:39:37.5065 [PID=14669] [version] host_flops: 2.972295e+09, speedup: 15.00, projected_flops: 4.458442e+10, peak_flops: 4.176000e+12, peak_flops_factor: 1.00


Still, the estimated CPU time as displayed by boinccmd for such a task is below 50 seconds ... :-( It will actaully take almost 100 times longer.

HB
____________

Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar
Send message
Joined: 28 Aug 06
Posts: 1447
Credit: 1,757,982
RAC: 2,126
Message 111625 - Posted: 25 Dec 2011 | 21:16:15 UTC - in response to Message 111622.

I guess I got a few from the old batch.

Now everything is fine, the runtime estimate is reasonably pessimistic now and tasks validate ok.

HB
____________

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111646 - Posted: 4 Jan 2012 | 11:58:47 UTC - in response to Message 111573.

no hint from Oliver either. Where is he by the way, seems like he evaporated. ;-)


Sort of, holiday season... :-)

Happy new year!

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111647 - Posted: 4 Jan 2012 | 12:02:15 UTC - in response to Message 111575.

It would be instructive to see whether the debugging output in question is common to all 4xxx series cards. Ah..but that's a different subject.


They will. The 4xxx series doesn't support local memory, it's emulated via global memory which incurs a big impact on performance. Also, this series only allows for 64 work items per work group when local memory is used, hence the resizing. However, I doubt that the resizing actually affects the accuracy of the computation, but if it does, it needs to be fixed!

Oliver

oz
Send message
Joined: 28 Feb 05
Posts: 10
Credit: 1,060,681
RAC: 0
Message 111660 - Posted: 6 Jan 2012 | 20:38:39 UTC

Hi,

I also have aborted task due to

exceeded elapsed time limit 19036.53 (28000000.00G/1470.86G) problem
.

The GPU is in bad state with reboot required. All other downloaded openCL tasks are started by BOINC and immediately aborted with:

Output file p2030.20100913.G44.55+00.20.N.b6s0g0.00000_2424_1_3 for task p2030.20100913.G44.55+00.20.N.b6s0g0.00000_2424_1 absent


This is finished after reaching the daily quota of task
I successfully finished atiopenCL tasks with 50000s runtime.
System: Linux Ubuntu Oneiric
OpenCL: ATI GPU 0: Juniper (driver version CAL 1.4.1646, device version OpenCL 1.1 AMD-APP-SDK-v2.5 (684.213), 1024MB)
Catalyst 11.11

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111776 - Posted: 31 Jan 2012 | 11:21:05 UTC - in response to Message 111647.


They will. The 4xxx series doesn't support local memory, it's emulated via global memory which incurs a big impact on performance. Also, this series only allows for 64 work items per work group when local memory is used, hence the resizing. However, I doubt that the resizing actually affects the accuracy of the computation, but if it does, it needs to be fixed!


Well, it turned out it does indeed! We'll fix it ASAP.

Oliver

Oliver Bock
Volunteer moderator
Project administrator
Project developer
Send message
Joined: 4 Sep 07
Posts: 116
Credit: 5,965,020
RAC: 1
Message 111779 - Posted: 1 Feb 2012 | 12:19:53 UTC - in response to Message 111776.

Ok, bug fix implemented and tested. We'll release v1.20 shortly...

Oliver

Post to thread

Message boards : News : Sending work


Home · Your account · Message boards

This material is based upon work supported by the National Science Foundation (NSF) under Grant PHY-0555655 and by the Max Planck Gesellschaft (MPG). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the investigators and do not necessarily reflect the views of the NSF or the MPG.

Copyright © 2013 Bruce Allen for the LIGO Scientific Collaboration