Sending work |
| log in |
Message boards : News : Sending work
| Author | Message |
|---|---|
|
Albert@home is still an unofficial, non-public test project. Don't expect anything to work here. The only type of work Albert@home is currently sending out is for a highly experimental BRP4 OpenCL application. | |
| ID: 111188 | | |
|
Great, will expect doom and hell-fire. :D | |
| ID: 111189 | | |
Albert@home is still an unofficial, non-public test project. Don't expect anything to work here. The only type of work Albert@home is currently sending out is for a highly experimental BRP4 OpenCL application. There is no option in the preferences to select ATI graphic cards so the OpenCL will only be done with Nvidia at the moment. Conan ____________ | |
| ID: 111190 | | |
|
Thanks for reporting. Although I think due to the way this setting is handled internally you shouldn't be able to opt-out of ATI 'work' right now. | |
| ID: 111192 | | |
Ok, so here as well, something breaks the signature. Enigma has the same problem. Honestly I don't care much about sigs at all (I usually view posts w/o sig), and in particular on Albert. This is about the last thing I intend to test here. The problem may occur because Albert doesn't export any stats, or something else. Can you edit your preferences and set the signature again from scratch? BM | |
| ID: 111193 | | |
|
Nah, that doesn't fix it but as you said, it's not a priority. The infinite work duration tasks that are sent... now that's another thing. ;-) Although I think due to the way this setting is handled internally you shouldn't be able to opt-out of ATI 'work' right now. Actually, that is possible with the <ignore_ati_dev/> option in the clients that can do that, and possibly with the <exclude_gpu/> option in the latest test clients. Then it'll just do: 2011-11-01 05:13:40.7956 [PID=8972 ] [version] Checking plan class 'ATIOpenCL' 2011-11-01 05:13:40.7956 [PID=8972 ] [version] No ATI devices found 2011-11-01 05:13:40.7956 [PID=8972 ] [version] [AV#443] app_plan() returned false 2011-11-01 05:13:40.7956 [PID=8972 ] [version] Checking plan class 'NVOpenCL' 2011-11-01 05:13:40.7956 [PID=8972 ] [version] No NVidia devices found 2011-11-01 05:13:40.7956 [PID=8972 ] [version] [AV#440] app_plan() returned false But really, the BOINC back-end code should self-detect whether an ATI application is available in http://albert.phys.uwm.edu/apps.php and then automatically enable the "Use ATI GPU" preference. Apparently it doesn't work yet with your used version of the back-end and OpenCL applications. ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111194 | | |
Thanks for reporting. Although I think due to the way this setting is handled internally you shouldn't be able to opt-out of ATI 'work' right now. Done. BM | |
| ID: 111195 | | |
But really, the BOINC back-end code should self-detect whether an ATI application is available in http://albert.phys.uwm.edu/apps.php and then automatically enable the "Use ATI GPU" preference. Apparently it doesn't work yet with your used version of the back-end and OpenCL applications. Yah, we didn't look into that clever piece of code when we set up the plan classes. It did only work if your plan class name contained the string "ati" (lowercase). BM | |
| ID: 111196 | | |
|
Nothing but Computation errors so far on the OpenCL Wu's after just a few seconds. Tried 11.6 & 11.9 Drivers 5850 & 5870 Cards ... | |
| ID: 111201 | | |
|
The Linux App appears to work, at least on one of our machines. | |
| ID: 111202 | | |
|
[20:05:56][5684][INFO ] Starting data processing... | |
| ID: 111204 | | |
FWIW the App works when running standalone, it crashes only when running under the BOINC Client. How do you run the app stand-alone? It's not an executable. It's called einsteinbinary_BRP4_1.09_windows_intelx86__ATIOpenCL ... no .exe Could that be the problem? D:\ProgramData\BOINC\projects\albert.phys.uwm.edu\einsteinbinary_BRP4_1.09_windows_intelx86__ATIOpenCL caused an Access Violation at location 007b481c in module D:\ProgramData\BOINC\projects\albert.phys.uwm.edu\einsteinbinary_BRP4_1.09_windows_intelx86__ATIOpenCL Writing to location 00000003. ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111208 | | |
|
OK, I got the new AtiOpenCL app 1.11, got a bucket load of work for it as well, but it doesn't start. I even suspended all my other work and still the OpenCL tasks didn't start. | |
| ID: 111209 | | |
|
successfully received work for 2011 iMac/10.7.2 (Darwin 11.2)/ATI Radeon HD 6770M/ | |
| ID: 111210 | | |
|
Hmm, OK, so all that work mysteriously ran and erred within seconds. Th error is still the same as with 1.09 D:\ProgramData\BOINC\projects\albert.phys.uwm.edu\einsteinbinary_BRP4_1.11_windows_intelx86__atiOpenCL.exe caused an Access Violation at location 007b481c in module D:\ProgramData\BOINC\projects\albert.phys.uwm.edu\einsteinbinary_BRP4_1.11_windows_intelx86__atiOpenCL.exe Writing to location 00000003. My system has gone through several reboots yesterday, prior to it attacking the 1.11 work. ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111211 | | |
|
Appear to have completed the first WU successfully: | |
| ID: 111212 | | |
|
Great! | |
| ID: 111215 | | |
|
[21:26:46][3292][INFO ] Starting data processing... | |
| ID: 111218 | | |
|
I wouldn't expect OpenCL to work with clients older than 6.13. | |
| ID: 111219 | | |
|
I got the infamous "GPU type not found in init_data.xml" errror with the recommended BOINC client version 6.12.34. Then I decided to update my BOINC client to version 6.13.10 (pre-release). | |
| ID: 111220 | | |
I wouldn't expect OpenCL to work with clients older than 6.13. FYI, look at this: http://setiathome.berkeley.edu/result.php?resultid=2146353363 OpenCL platform detected: NVIDIA Corporation and all other helpful info about OpenCL detection with BOINC 6.10.60 | |
| ID: 111221 | | |
|
So? Just because Lunatics built a 3rd party application for Seti only that does the OpenCL detection, does not mean that the BOINC you're running does the OpenCL detection. This project uses the OpenCL detection built into the 6.13 clients to determine all OpenCL capable co-processors. | |
| ID: 111223 | | |
So? Just because Lunatics built a 3rd party application for Seti only that does the OpenCL detection, does not mean that the BOINC you're running does the OpenCL detection. This project uses the OpenCL detection built into the 6.13 clients to determine all OpenCL capable co-processors. Ok. Fine. Clear. I'll try 6.13 some later, despite the fact that I don't like > 6.10 interface without Messages tab. | |
| ID: 111224 | | |
|
Then keep the boincmgr.exe and boinc.dll from your present installation. You can run a newer client with an older BOINC Manager. | |
| ID: 111225 | | |
|
FYI, we just updated the AMD OpenCL binaries for Windows, Mac OS 10.7 and Linux (all OpenCL 1.0, minimum BOINC version: 6.13.x). | |
| ID: 111226 | | |
But the messages aren't gone, they are just in a window of their own, reachable by clicking advanced->Event Log, or pressing CTRL+SHIFT+E. I know, but it's inconvenient to use. | |
| ID: 111227 | | |
Current status: Yes, look at that. v1.17 is finally running longer than 4 seconds. :-) Still... not out of the woods yet. At 90 seconds now I have a computation error. http://albert.phys.uwm.edu/result.php?resultid=38165 http://albert.phys.uwm.edu/result.php?resultid=38123 And now they all end with error at 43 seconds. <core_client_version>6.13.10</core_client_version> <![CDATA[ <message> There was an error while deleting the color transform. (0x7e3) - exit code 2019 (0x7e3) </message> <stderr_txt> Activated exception handling... [17:10:04][3420][INFO ] Starting data processing... [17:10:05][3420][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc. [17:10:05][3420][INFO ] Using OpenCL device "ATI RV770" by: Advanced Micro Devices, Inc. [17:10:07][3420][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory). ------> Starting from scratch... [17:10:07][3420][INFO ] Header contents: ------> Original WAPP file: ./p2030.20100913.G48.73+01.03.S.b6s0g0.00000_DM24.80 ------> Sample time in microseconds: 65.4762 ------> Observation time in seconds: 274.62705 ------> Time stamp (MJD): 55453.031586060431 ------> Number of samples/record: 0 ------> Center freq in MHz: 1214.289551 ------> Channel band in MHz: 0.33605957 ------> Number of channels/record: 960 ------> Nifs: 1 ------> RA (J2000): 191718.8153 ------> DEC (J2000): 143053.3507 ------> Galactic l: 0 ------> Galactic b: 0 ------> Name: G48.73+01.03.S ------> Lagformat: 0 ------> Sum: 1 ------> Level: 3 ------> AZ at start: 0 ------> ZA at start: 0 ------> AST at start: 0 ------> LST at start: 0 ------> Project ID: -- ------> Observers: -- ------> File size (bytes): 0 ------> Data size (bytes): 0 ------> Number of samples: 4194304 ------> Trial dispersion measure: 24.8 cm^-3 pc ------> Scale factor: 0.114462 [17:10:17][3420][INFO ] Seed for random number generator is -1018057414. [17:10:48][3420][INFO ] Derived global search parameters: ------> f_A probability = 0.08 ------> single bin prob(P_noise > P_thr) = 9.93986e-009 ------> thr1 = 18.4267 ------> thr2 = 21.5421 ------> thr4 = 26.5915 ------> thr8 = 35.0049 ------> thr16 = 49.3672 [17:10:49][3420][ERROR] Error during OpenCL kernel setup: TSMR-1 (error: -54) [17:10:49][3420][ERROR] Demodulation failed (error: 2019)! 17:10:49 (3420): called boinc_finish </stderr_txt> ]]> So stop deleting the color transform. ;-) Task list: http://albert.phys.uwm.edu/results.php?userid=7430 While running that work, GPU load is zero according to GPU-Z: Date , GPU Core Clock [MHz] , GPU Memory Clock [MHz] , GPU Temperature [°C] , Fan Speed [%] , GPU Load [%] , Fan Speed [RPM] , GPU Temp.(DISPIO) [°C] , GPU Temp.(MEMIO) [°C] , GPU Temp.(SHADERCORE) [°C] , 2010-07-26 08:23:45 , 500.0 , 700.0 , 39.0 , 27 , 1 , - , 39.0 , 38.5 , 34.5 , 2010-07-26 08:23:46 , 500.0 , 700.0 , 39.0 , 27 , 1 , - , 39.0 , 38.0 , 34.5 , 2010-07-26 08:23:47 , 500.0 , 700.0 , 39.0 , 27 , 1 , - , 39.5 , 38.5 , 34.5 , 2010-07-26 08:26:54 , 500.0 , 700.0 , 38.0 , 27 , 0 , - , 37.5 , 36.5 , 33.5 , ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111228 | | |
|
Thanks Jord! That's really helpful. | |
| ID: 111229 | | |
|
I rebooted, just to make sure it wasn't something stuck in my GPU. 2011-11-10 17:27:39 , 625.0 , 950.0 , 54.0 , 30 , - , 100 , 54.0 , 54.0 , 51.5 , 55 , 28 , 1.120 , 2011-11-10 17:27:40 , 625.0 , 950.0 , 55.0 , 30 , - , 100 , 54.5 , 54.0 , 51.0 , 55 , 28 , 1.120 , 2011-11-10 17:27:41 , 625.0 , 950.0 , 55.0 , 30 , - , 100 , 54.5 , 54.5 , 52.0 , 55 , 28 , 1.120 , 2011-11-10 17:27:42 , 625.0 , 950.0 , 55.0 , 30 , - , 100 , 55.5 , 54.5 , 51.5 , 55 , 28 , 1.120 , 2011-11-10 17:27:43 , 625.0 , 950.0 , 55.0 , 30 , - , 100 , 55.0 , 54.0 , 51.5 , 55 , 28 , 1.120 , See the load? (And yes, I also see that the fan is either not recognized or not working...) ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111230 | | |
|
The same result as Ageless already has showed: | |
| ID: 111231 | | |
The same result as Ageless already has showed: Yep, you got the same device series (HD 4xxx) as Jord. The issue is understood and I'm working on a fix, but it's not that easy... You guys should opt-out of GPU tasks until the next version. Hang in there, Oliver | |
| ID: 111232 | | |
|
After Upgrading to v6.13.10 on my ATI Box I now have 2 Wu's running, 1 on a 5870 & 1 on a 5850 ... Very inefficient though, using only 29% & 25% Of the GPU ... Need a App File to possibly run 2 or maybe 3 @ a time if possible ... | |
| ID: 111233 | | |
FYI, we just updated the AMD OpenCL binaries for Windows, Mac OS 10.7 and Linux (all OpenCL 1.0, minimum BOINC version: 6.13.x). Oliver, Mac OS X 10.7 does OpenCL 1.1 already, for OpenCL 1.0 OS X 10.6 is also fine. ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111234 | | |
|
Greetings | |
| ID: 111235 | | |
I know. a) We have an OpenCL 1.1 version, but it would constrain the list of compatible GPUs. b) 10.6 is buggy and Apple doesn't care about it, hence we can't support it. Oliver | |
| ID: 111236 | | |
|
Hi guys,
We deployed updated binaries (v1.19) that should adjust dynamically to the specific hardware they run on. This should fix the issues you've encountered and ensure support for lower-end GPUs like the HD 4xxx series. *Finger's crossed* Oliver | |
| ID: 111238 | | |
*Finger's crossed* Must be difficult to type with crossed fingers. I just tried it. So please uncross. It seems to be working. ;) GPU-Z: Date , GPU Core Clock [MHz] , GPU Memory Clock [MHz] , GPU Temperature [°C] , Fan Speed (%) [%] , Fan Speed (RPM) [RPM] , GPU Load [%] , GPU Temp.(DISPIO) [°C] , GPU Temp.(MEMIO) [°C] , GPU Temp.(SHADERCORE) [°C] , Memory Usage (Dedicated) [MB] , Memory Usage (Dynamic) [MB] , VDDC [V] , Though, can you explain to me why the OpenCL BRP4 app runs at below normal priority? ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111244 | | |
|
Not yet. | |
| ID: 111245 | | |
|
<avg_ncpus>0.150000</avg_ncpus> | |
| ID: 111246 | | |
The task is running, but going slowly. At present rate, with it taking 2 hours to do 22%, it'll take ~9 hours to do the whole 100%. A tad slow, when compared to a similar task at Einstein done on my CPU takes ~10 hours. You're mistaken, these are not similar tasks. If you want to compare GPU vs CPU performance you need to crunch a few of the tasks over here on your CPU. Also, the actual GPU performance varies quite a bit from the Radeon HD 4xxx to the 5xxx and the 6xxx series. Some approx. runtime numbers: HD 6970 (Windows): 1 hour HD 6970 (Linux, no PGO): 1.2 hours HD 5770 (Mac OS 10.7): 7.2 hours You see that your 4xxx GPU blends in just fine... Best, Oliver | |
| ID: 111248 | | |
|
Current status as of v1.19: | |
| ID: 111249 | | |
Though, can you explain to me why the OpenCL BRP4 app runs at below normal priority? Because it uses less than one CPU (avg_ncpus < 1). If it would use a full CPU core (like CPU Apps do), it would run in "Idle" priority. BM | |
| ID: 111250 | | |
So? Just because Lunatics built a 3rd party application for Seti only that does the OpenCL detection, does not mean that the BOINC you're running does the OpenCL detection. This project uses the OpenCL detection built into the 6.13 clients to determine all OpenCL capable co-processors. You are right. The message tab is annoying. | |
| ID: 111257 | | |
|
I am now using ATI5850 Driver11.9 and SDK2.5 | |
| ID: 111276 | | |
|
Could you mention a little more about what GPUs are suitable for the current A@H application? What I've seen so far suggests that it may require AMD/ATI GPUs, and is not ready for Nvidia GPUs yet. | |
| ID: 111277 | | |
For example, must the GPU have hardware support for double precision? No, the floating point speed of the GPU has nothing to do with OpenCL. For AMD/ATI GPUs, only the GPUs at the requirements page on ATI's SDK web site, will have OpenCL capability. In numbers, from the HD4300 onwards and every GPU thereafter, plus the Firepor V3800 and everyone thereafter. For nVidia it's easier, there any GPU capable of CUDA will be able to do OpenCL. In numbers, from the GEFORCE 8300 onwards and every GPU since. I've seen rumors that the 7.0 version of BOINC will be available soon, with at least some support for OpenCL GPU workunits, but almost nothing more on just how much OpenCL GPU support. BOINC 7 (being tested as BOINC 6.13) will support both ATI and nVidia GPUs, with a possibility for Intel GPUs to follow once Intel gets an API out. But really, BOINC doesn't need to support OpenCL, as it doesn't do any of the work. The science application will need to do OpenCL. All that BOINC will do is detect if your GPU is OpenCL capable and if so, which version it is compliant to. Just as it already did detect if the nVidia GPU is CUDA capable and the ATI GPU CAL/Brook+ capable. ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111278 | | |
What I've seen so far suggests that it may require AMD/ATI GPUs, and is not ready for Nvidia GPUs yet. As a starter please see this post of mine. Apart from that, we do have a dedicated CUDA app so we don't focus on running our OpenCL app on NVIDIA devices for the time being. Memory-wise 512 MB cards might be already too "small" for the current app. We always strive to provide apps that run on as many GPUs as possible but there are technical and algorithmic constraints that simply don't allow us to reduce the GPU memory footprint of the OpenCL app right now.
No. Cheers, Oliver | |
| ID: 111281 | | |
Apart from that, we do have a dedicated CUDA app so we don't focus on running our OpenCL app on NVIDIA devices for the time being. On that though, the whole point of OpenCL is to make a new standard in GPGPU and other processors, where presumably one code can work on many pieces of hardware. So in essence, you ought to be able to use the OpenCL app for an ATI GPU on an nVidia GPU, an Intel GPU, an AMD or Intel CPU, and presumably on any Tesla you throw it at. In a perfect world... ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111282 | | |
|
1 task completed!!! | |
| ID: 111283 | | |
Sorry, that's a misconception. While the OpenCL framework itself indeed facilitates the general approach one still has to take into account the differences of each architecture. OpenCL doesn't help you to overcome that out of the box. While a single codebase should run on most OpenCL capable devices, it won't do so efficiently (e.g. thread parallelism vs. data parallelism on NVIDIA and AMD GPUs). Thus it boils down to the fact that you can stick to a single framework for coding/building/deployment but you still have to write architecture-aware implementations for an efficient solution. Best, Oliver | |
| ID: 111284 | | |
|
Not just that. On my travels, I've seen that due to ATI's different approach to GPGPU, from Close-to-Metal through BrookGPU to OpenCL, that their drivers are mature enough to support OpenCL on their hardware. | |
| ID: 111285 | | |
|
It's not that bad after all. However, while our current OpenCL app runs on NVIDIA GPUs indeed it doesn't yet produce valid results on all their models. Most likely due to some subtle differences (optimizations) in NVIDIA's runtime OpenCL compiler... We're investigating but we'll focus on OpenCL@AMD first since we got a CUDA app anyway... | |
| ID: 111286 | | |
|
Looks like the OpenCL app for ATI can't validate against the CPU app: http://albert.phys.uwm.edu/workunit.php?wuid=11889 | |
| ID: 111290 | | |
|
Separate post for this, can you stop sending ATI OpenCL work to non-6.13 clients? | |
| ID: 111291 | | |
Separate post for this, can you stop sending ATI OpenCL work to non-6.13 clients? Guess what we do already? :-) We're aware of that problem but there's no fix right now... Still investigating... Oliver | |
| ID: 111298 | | |
Looks like the OpenCL app for ATI can't validate against the CPU app: http://albert.phys.uwm.edu/workunit.php?wuid=11889 We're still tuning the validator. That's part of this test, we need to sample the numerical stability/inaccuracies across various platforms and devices...
Hm? | |
| ID: 111299 | | |
Sneaky... now it's sent to a third party. It wasn't for several hours last night, before I made the comment... I just needed more patience then. ;-) ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111300 | | |
Guess what we do already? :-) One further request then, decrease the deadline here? I see wingmen who have seemingly abandoned the cause, so now I'll have to wait 14 days before anything is resent. That's too much for a test project. Just make the deadline 3 to 5 days, that's time enough to crunch the work, even on a multi-project system, and send it back. You want the results back fast, don't you? :-) ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111301 | | |
Should work now... | |
| ID: 111302 | | |
Looks adequate for computers with only one GPU. On other BOINC projects, I've seen one thing mentioned that BOINC really needs to do if you want to be able to run both an OpenCL GPU workunit and a non-OpenCL GPU workunit at the same time on computers with more than one GPU - provide a standard way of mapping the way BOINC identifies which GPU it has assigned to a workunit to the way OpenCL identifies those GPUs. Otherwise, it's likely that those two workunits will often attempt to use the same GPU at the same time, with both failing. Except for this, I've seen nothing on features for OpenCL that need to be in BOINC instead of the application. Adding more features would be helpful, but not required, for BOINC projects wanting OpenCL GPU workunits. I currently don't have any AMD/ATI GPUs, so I won't be able to much for Albert@Home soon, at least until I find time to measure the size requirements for putting such a graphics board into my newest computer. | |
| ID: 111312 | | |
Albert@home is still an unofficial, non-public test project. Don't expect anything to work here. The only type of work Albert@home is currently sending out is for a highly experimental BRP4 OpenCL application. I've recently received some SSE and CUDA workunits. Should I consider them outdated workunits left over from Einstein@Home, or should I consider them Nvidia and CPU versions of the OpenCL workunits? | |
| ID: 111329 | | |
I've recently received some SSE and CUDA workunits. Should I consider them outdated workunits left over from Einstein@Home, or should I consider them Nvidia and CPU versions of the OpenCL workunits? The latter. All BRP4 work units on albert can be considered equal and it doesn't matter whether they're run on a CPU, a CUDA device or via OpenCL. This allows us to check cross-platform/device validation and get a feeling for their relative performances. It also allows us to test BOINC's behavior in mixed-GPU (NVIDIA and AMD GPUs in one box) setups - so far the results look good. Best, Oliver | |
| ID: 111336 | | |
|
ATI OpenCL and CUDA32 don't validate together either. | |
| ID: 111385 | | |
ATI OpenCL and CUDA32 don't validate together either. Not universally true but dependent on the NVIDIA GPU. My observation: a GTX 285 (Tesla) does, a GTX 580 (Fermi) doesn't. We'll investigate and will most likely tune the validator. It just takes some time since we're running low on manpower right now and got the download server issue at first prio. Stay tuned, Oliver | |
| ID: 111389 | | |
|
Another one then, http://albert.phys.uwm.edu/workunit.php?wuid=11889. | |
| ID: 111403 | | |
|
Also looks like you need more stringent GPU capability detection. | |
| ID: 111405 | | |
you can set up the scheduler to check for OpenCL capability, memory on the GPU, BOINC client used, etc. GPUs like this need to be locked out, before they're sent work as else it's using unnecessary bandwidth and confuses the user to no end. Yep, already on our TODO list (mostly done, memory requirements pending)... Oliver | |
| ID: 111407 | | |
|
HD4890: 1WU = 62,806.60 sec = 17.5 hours and... 500.00 credits only, I guess? ) | |
| ID: 111411 | | |
|
credits for this alpha project are 500 per WU that validates, irregardless of how long it takes to run. Also, last I read this project is soon not going to export stats, if they haven't yet turned that off. | |
| ID: 111415 | | |
|
CAL ATI Radeon HD 4700/4800 (RV740/RV770) (1024MB) driver: 0.1 | |
| ID: 111416 | | |
|
ATIOpenCL v1.19 doesn't validate with another ATIOpenCL v1.19 either: | |
| ID: 111417 | | |
ATIOpenCL v1.19 doesn't validate with another ATIOpenCL v1.19 either: After 5+ hours the third task finally went out, now to a CUDA GPU. I can tell you up front that this isn't going to validate. Is HR a thought? ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111418 | | |
|
Hmmm... Forget HR. Forget using any HD4850 it seems, or any OpenCL 1.0 only GPUs? I noticed that Infusioned now does have credit, but he did so with a CPU task, not his HD4850. | |
| ID: 111419 | | |
you can set up the scheduler to check for OpenCL capability, memory on the GPU, BOINC client used, etc. GPUs like this need to be locked out, before they're sent work as else it's using unnecessary bandwidth and confuses the user to no end. Due to a bug in the BOINC client this has to wait until at least 6.13.13 (should be released soon). Stay tuned, Oliver | |
| ID: 111421 | | |
Either my GPU is broken, or I use a driver version your app doesn't like (Catalysts 11.6) You need at least Catalyst 11.7 (8.872) since we use APP SDK 2.5 until further notice. Oliver | |
| ID: 111422 | | |
Due to a bug in the BOINC client this has to wait until at least 6.13.13 (should be released soon). Looks like there ain't gonna be a 6.13.13, but that instead it's going to be BOINC 7.0.x :-) ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111437 | | |
Either my GPU is broken, or I use a driver version your app doesn't like (Catalysts 11.6) Darn, we just updated the validator (using less strict tolerances). We won't know whether the 11.6 you used before updating to 11.7 played a role in your tasks being considered invalid :-/ Oliver | |
| ID: 111442 | | |
|
What's it you're saying, Oliver? You want me to return to 11.6? | |
| ID: 111443 | | |
What's it you're saying, Oliver? You want me to return to 11.6? No, you don't have to. You may of course do so as it wouldn't require a full core anymore. Maybe the relaxed validator settings let your tasks through now (we had to tune it anyway)... Thanks for supporting this effort! Oliver | |
| ID: 111446 | | |
|
Hello, [19:27:27][4969][ERROR] Error during OpenCL FFT setup (error: -5) [19:27:27][4969][ERROR] Demodulation failed (error: 2021)! It is Debian unstable, fglrx 11.11, amd-app 2.5, BOINC 6.13.12. The installation of amd-app is essential. Without it, neither primegrid nor albert@H binaries can be executed. Cheers, Steffen ____________ | |
| ID: 111450 | | |
Sorry, not enough GPU memory.
Why? What happens if you don't install it? The runtime libOpenCL.so should already be installed with the driver (as of 11.9 IIRC). Hm, maybe you still need to register the ICD... Oliver | |
| ID: 111456 | | |
What's it you're saying, Oliver? You want me to return to 11.6? I did return to 11.6, but left SDK 2.5 on my system. Immediately the use of the one core went back to 02-10%, instead of the full 25% it was using before that on 11.7. Skyrim is now also back to being a bit more stable. I had many more CTDs (crash to desktop) with 11.7 than I have had with 11.6; where with 11.6 it would be perhaps once a day, with 11.7 it was 7 times yesterday alone. So after the last CTD I reverted back to 11.6 ;-) So... will need an eye on what http://albert.phys.uwm.edu/workunit.php?wuid=13760 will go do. It's me versus two CUDA that can't decide between themselves who's right. I doubt I'll be the clincher for them. ;-) ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111471 | | |
So... will need an eye on what http://albert.phys.uwm.edu/workunit.php?wuid=13760 will go do. It's me versus two CUDA that can't decide between themselves who's right. I doubt I'll be the clincher for them. ;-) As I thought, I wasn't the clincher. That task was crunched with catalysts 11.6 Looks like these don't validate to ATIOpenCL yet. Further fine tuning of the validator? I have suspended the last 3 tasks I have until I hear more. Although, I could of course abort them and see what the newer scheduler in 7.0.2 thinks I should get for loads of amounts of work. With a REC of 11000, too much anyway. ;-) Remember, work that got credit is bad work. Work that didn't validate is good work. It tells the developers here their validator isn't ready yet to work in the outside angry world. :) (and really developers, how many set it and forget it users do you have on here? ;)) ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111477 | | |
So... will need an eye on what http://albert.phys.uwm.edu/workunit.php?wuid=13760 will go do. I see it was another dud. It took another CUDA32 to get validated, me and another CUDA32 finishing outside the points. I have 4 new tasks. 12537 is paired against a BRP3cuda32. I think I won't even try. 15980 is two ATIOpenCL. I wish driver detection here was working so I could make a reasonable guess as to what driver the other guy is using. Driver: 0.1 is useless. 15971 also has me paired against a BRP3cuda32. 15943 also has me paired against a BRP3cuda32. 15980 it is then. Can the developers in the mean time fix the driver detection? ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111480 | | |
| ID: 111484 | | |
|
Easily explained: http://albert.phys.uwm.edu/workunit.php?wuid=12082 | |
| ID: 111485 | | |
|
This is from the Seti@Home Beta message boards (developing an ATi OpenCL App): Please, update your hosts with this builds. ATi and NV builds got speed increase, HD5-version added, NV version added. Thanks, I downloaded the WU and confirm the difference. Maybe this is part of the issue regarding validation? ____________ | |
| ID: 111487 | | |
This is from the Seti@Home Beta message boards (developing an ATi OpenCL App): I think the problem might be in the FFT and CuFFT variations.... But I am not sure. I saw something about such a discussion in another thread... ____________ | |
| ID: 111488 | | |
Maybe this is part of the issue regarding validation? Nvidia cards are still using the CUDA app though, not OpenCL. | |
| ID: 111489 | | |
|
It doesn't mean that NVidia cards don't silently generate overflows in general. | |
| ID: 111490 | | |
Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue. Oliver | |
| ID: 111491 | | |
It seems to work in previous boinc manager versions. Maybe not in 6.13xx yet though.... ____________ | |
| ID: 111493 | | |
Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue. Of course it's up to you. Apparently the server back-end version that you use doesn't store the ATI/CAL driver version, but it is sent to you. From my 7.0.2 sched_request_albert.phys.uwm.edu.xml file: <coproc_ati> You can even use the OpenCL information. Then with the CAL version we can figure out which Catalysts they are. E.g. CAL 1.4.1417 is Catalysts 11.6 ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111494 | | |
Sorry, not up to us. I'm not sure whether the BOINC devs can do anything about it since this might even be an AMD driver issue. Jord has a point here. And it can help users detect possible wrong drivers when they compare with other users. On the question, "which driver is the best driver for my ATI card?" ____________ | |
| ID: 111495 | | |
|
If you want to be totally confused, it does work on Einstein. See my account there. | |
| ID: 111496 | | |
You may talk about two different things here. Jord, what exactly do you think should be fixed? I do see that displaying the ATI CAL/driver version on the host web pages appears broken (on Albert), and possibly the string in the DB is, too. In the scheduler the ATI "driver" version is stored as "char version[50]" and "int version_num" in coproc_ati, and in "char opencl_driver_version[32]" in opencl_device_prop. These could in principle be used in app_plan(), though we don't check this yet. BM | |
| ID: 111498 | | |
Jord, what exactly do you think should be fixed? Showing of the CAL driver version on ATI cards on the account pages here. Yes, sorry, I said it wrong. I asked for a fix for the driver detection. I know you don't do that, that that's up to the client. I meant that all the driver versions showing for Nvidia GPUs is correct, for all ATI GPUs it's always 0.1, which isn't correct. ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111499 | | |
|
Hm. On your host page I currently read: AMD ATI Radeon HD 4700/4800 (RV740/RV770) (1024MB) driver: 1.4.1417 I don't see anything wrong with that. Maybe the previous entry was from an old Client version? The only thing I changed this morning was (parts of) the web page code, but nothing related to the pages involved here. BM | |
| ID: 111503 | | |
|
Well, whatever you did fixed that bug. It now shows on all hosts I checked which CAL driver version these people use. It may have been in there all this time, just not showing as such. So thanks. :) | |
| ID: 111504 | | |
I know, I'm just using the exact tag/version that contains the required bug fix :-) Oliver | |
| ID: 111505 | | |
I just read something related on the boinc_dev mailing list. It seems that BOINC 7.01 and 7.02 don't allocate enough digits in one of the places they store ATI version numbers, and are therefore likely to get at least some of the version numbers wrong. | |
| ID: 111510 | | |
|
Since the upgrade to 7.0.2 I'm not getting any work for the GPUs .. | |
| ID: 111511 | | |
|
Click on the "last contact" link to see the scheduler logs. 2011-12-08 15:42:31.5128 [PID=32492] [version] Checking plan class 'atiOpenCL' 2011-12-08 15:42:31.5128 [PID=32492] [version] GPU RAM required min: 536870912.000000, supplied: 0 2011-12-08 15:42:31.5128 [PID=32492] [version] [AV#459] app_plan() returned false Hm - looks like there's something wrong with the GPU RAM size reporting. I'll look into that tomorrow. BM | |
| ID: 111513 | | |
|
Hm ... according to sched_request the ATI device isn't OpenCL capable:
<coproc_ati>
<count>1</count>
<name>ATI Radeon HD 5800 series (Cypress)</name>
<available_ram>1024458752.000000</available_ram>
<have_cal>1</have_cal>
<have_opencl>0</have_opencl>
<req_secs>0.000000</req_secs>
<req_instances>0.000000</req_instances>
<estimated_delay>0.000000</estimated_delay>
<peak_flops>50000000000.000000</peak_flops>
<CALVersion>1.4.815</CALVersion>
<target>8</target>
<localRAM>1024</localRAM>
<uncachedRemoteRAM>1788</uncachedRemoteRAM>
<cachedRemoteRAM>508</cachedRemoteRAM>
<engineClock>0</engineClock>
<memoryClock>0</memoryClock>
<wavefrontSize>64</wavefrontSize>
<numberOfSIMD>20</numberOfSIMD>
<doublePrecision>1</doublePrecision>
<pitch_alignment>256</pitch_alignment>
<surface_alignment>256</surface_alignment>
<maxResource1DWidth>16384</maxResource1DWidth>
<maxResource2DWidth>16384</maxResource2DWidth>
<maxResource2DHeight>16384</maxResource2DHeight>
<atirt_detected/>
</coproc_ati>
| |
| ID: 111514 | | |
Hm ... according to sched_request the ATI device isn't OpenCL capable: I don't think Cal version 1.4.815 is OpenCL Capable ... ____________ STE\/E | |
| ID: 111515 | | |
|
Weird .. with BOINC 6.13 the same ATI GPU completed scores of work units: | |
| ID: 111516 | | |
|
What exactly does BOINC do to check for OpenCL? | |
| ID: 111517 | | |
It is, but you have to install the SDK and register the OpenCL ICD yourself. Installing the driver is not sufficient. Gaurav, have you changed anything in your setup? Oliver | |
| ID: 111518 | | |
What exactly does BOINC do to check for OpenCL? It uses libOpenCL via late binding to query a few basic properties. Using 10.8 (as you do) you have to make sure it's available as it's not installed automatically in the usual library paths. Better still, upgrade your driver to 11.7 (11.11 on Linux!). That version of the Catalyst driver installs the OpenCL runtime all by itself. Our app officially requires at least 11.7 anyway as we build it using SDK 2.5. Oliver | |
| ID: 111521 | | |
|
Thanks Oliver. That was helpful. Figured it out .. it was a 64-bit Vs 32-bit issue. The ATI app is 32-bit, but the BOINC client I built is 64-bit. So, I had to make both 32 and 64 bit OpenCL libs available in my library path. | |
| ID: 111531 | | |
Aborting task p2030.20100912.G57.94-00.24.S.b5s0g0.00000_416_1: exceeded elapsed time limit 6394.64 (2800000.00G/432.72G) 3 WUs were aborted after ~6395 sec of run time for the same reason: p2030.20100912.G57.94-00.24.S.b5s0g0.00000_416_1 - 6,395.56 sec p2030.20100913.G44.55+00.20.C.b5s0g0.00000_1824_2 - 6,395.53 sec p2030.20100912.G57.94-00.24.S.b5s0g0.00000_744_0 - 6,395.32 sec WTF? | |
| ID: 111540 | | |
|
Hi!
<rsc_fpops_est>140000000000000.000000</rsc_fpops_est>
<rsc_fpops_bound>2800000000000000.000000</rsc_fpops_bound>
with (say)
<rsc_fpops_est>1400000000000000.000000</rsc_fpops_est>
<rsc_fpops_bound>28000000000000000.000000</rsc_fpops_bound>
(Actually you should check that only WUs of the Albert@Home project are changed if you use that BOINC instance for other projects as well). CU HBE ____________ | |
| ID: 111541 | | |
My guess is that the way BOINC calculates the maximum allowed elapsed time for a task to finish has changed with version 7.x. No, it's worse - it's the new server code ("credit new") that does this. Although we are still using server-assigned credit here on Albert, the run time estimation etc. is handled by the new system. This is part of what we intend to test here. BM | |
| ID: 111554 | | |
|
oh.... | |
| ID: 111557 | | |
|
So far I didn't get any help from the BOINC devs that I asked for, so I'm still analyzing and digging through the code myself. max_elapsed_time = rp->wup->rsc_fpops_bound/rp->avp->flops;
where rsc_fpops_bound should be what it gets passed from the server, and avp->flops the "flops" of the "app version". Apparently that's also sent by the server for the App it sends. Bikeman, what's that in your case? There must be something like <app_version>
...
<flops>62700155339.574905</flops>
...
</app_version>
in the client_state.xml you edited. | |
| ID: 111562 | | |
|
Hi!
<app_version>
<app_name>einsteinbinary_BRP4</app_name>
<version_num>119</version_num>
<platform>i686-pc-linux-gnu</platform>
<avg_ncpus>0.150000</avg_ncpus>
<max_ncpus>1.000000</max_ncpus>
<flops>2402526409719.028320</flops>
<plan_class>atiOpenCL</plan_class>
(but that is after I made the editing). As different users see different cut-off times, I would have expected that this scales with the BOINC benchmark result for the individual graphics card. For mine, it seems to be this:
<coproc_ati>
<count>1</count>
<name>ATI Radeon HD 5800 series (Cypress)</name>
<available_ram>1002438656.000000</available_ram>
<have_cal>1</have_cal>
<have_opencl>1</have_opencl>
<peak_flops>4176000000000.000000</peak_flops>
CU HB ____________ | |
| ID: 111569 | | |
|
The "peak_flops" is not benchmarked. It's merely a theoretical upper bound derived basically from the number of "cores" times the clock frequency. Thus the peak_flops should be identical for similar devices. | |
| ID: 111570 | | |
Yes, let's see. Local stuff, from the HD4850
<app_version>
<app_name>einsteinbinary_BRP4</app_name>
<version_num>109</version_num>
<platform>windows_intelx86</platform>
<avg_ncpus>0.200000</avg_ncpus>
<max_ncpus>1.000000</max_ncpus>
<flops>33321667311.708328</flops>
<plan_class>ATIOpenCL</plan_class>
<api_version>6.13.8</api_version>
For the actual card, I'll do the whole shebang:
<coproc_ati>
<count>1</count>
<name>ATI Radeon HD 4700/4800 (RV740/RV770)</name>
<available_ram>1040187392.000000</available_ram>
<have_cal>1</have_cal>
<have_opencl>1</have_opencl>
<peak_flops>2000000000000.000000</peak_flops>
<CALVersion>1.4.1607</CALVersion>
<target>5</target>
<localRAM>1024</localRAM>
<uncachedRemoteRAM>2047</uncachedRemoteRAM>
<cachedRemoteRAM>2047</cachedRemoteRAM>
<engineClock>625</engineClock>
<memoryClock>900</memoryClock>
<wavefrontSize>64</wavefrontSize>
<numberOfSIMD>10</numberOfSIMD>
<doublePrecision>1</doublePrecision>
<pitch_alignment>256</pitch_alignment>
<surface_alignment>4096</surface_alignment>
<maxResource1DWidth>8192</maxResource1DWidth>
<maxResource2DWidth>8192</maxResource2DWidth>
<maxResource2DHeight>8192</maxResource2DHeight>
<atirt_detected/>
<coproc_opencl>
<name>ATI RV770</name>
<vendor>Advanced Micro Devices, Inc.</vendor>
<vendor_id>4098</vendor_id>
<available>1</available>
<half_fp_config>0</half_fp_config>
<single_fp_config>62</single_fp_config>
<double_fp_config>63</double_fp_config>
<endian_little>1</endian_little>
<execution_capabilities>1</execution_capabilities>
<extensions>cl_amd_fp64 cl_khr_gl_sharing cl_amd_device_attribute_query cl_khr_d3d10_sharing </extensions>
<global_mem_size>1073741824</global_mem_size>
<local_mem_size>16384</local_mem_size>
<max_clock_frequency>625</max_clock_frequency>
<max_compute_units>10</max_compute_units>
<opencl_platform_version>OpenCL 1.1 AMD-APP-SDK-v2.5 (793.1)</opencl_platform_version>
<opencl_device_version>OpenCL 1.0 AMD-APP-SDK-v2.5 (793.1)</opencl_device_version>
<opencl_driver_version>CAL 1.4.1607</opencl_driver_version>
</coproc_opencl>
</coproc_ati>
How come my flops are so little on the einsteinbinary? Only 33321667311 for Windows versus Heinz's 2402526409719 for Linux? Yeah I get it, those are estimated flops by the server, but heck. When the peak flops my GPU can do is 2000000000000 flops, or 60 times the estimated amount of flops my GPU is getting, no wonder why a) work is estimated so (s)low and b) I have a TDCF of 9.59! Plonk. ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111571 | | |
|
Hi! | |
| ID: 111572 | | |
This is quite a mission impossible from a BOINC perspective, isn't it? If the initial estimation is off too much, the WUs will ALL get terminated prematurely, and the server will NEVER get a valid result to adjust the estimation of the computation performance, which is actually needed to provide a good estimation for the max elapsed time in the first place!! If no work ever validates, like in my situation, then the adjustment will also never happen. Why will the work never validate? That's still where I come up clueless - no hint from Oliver either. Where is he by the way, seems like he evaporated. ;-) ... The app would then do a short test run (app developers would know best how to do that) that returns an estimation of the runtime for the entire workunit. Either that or allow that the user sets the amount of flops for all OpenCL capable hardware. Now it can only be set on CUDA work and then only when using the anonymous platform. But really, why is the flops count for my hardware set so low on the server, when the peak flops show that it can do way better. For the HD5800 the amount of digits for the peak flops and the 'actual flops' is the same (13). For my HD4850 it's 13 for the peak flops and 11 for the actual flops. There's got to be something wrong there. I think I'll get work, then exit BOINC, adjust the flops number in client_state.xml, restart BOINC and see if that will make a difference. Maybe that will even validate work. Edit: hehe, I edited the flops value to <flops>1919403979592.269394</flops>, now all tasks think they'll run for 11 minutes. That's gonna wreck my DCF completely. ;-) Maybe I should make it even less... ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111573 | | |
This is quite a mission impossible from a BOINC perspective, isn't it? If the initial estimation is off too much, the WUs will ALL get terminated prematurely, and the server will NEVER get a valid result to adjust the estimation of the computation performance, which is actually needed to provide a good estimation for the max elapsed time in the first place!! Yep, that occurred to me, too. I already added some code to our plan-class stuff that should allow me to play around with the flops estimation a bit. I intend to do this tomorrow, together with some more analysis of the scheduler code (sched_version.cpp). BM | |
| ID: 111574 | | |
|
@Jord: Well, because I fiddled manually in client_state.xml, finally a lot of workunits were completed (even validated, not sure if that matters), and my card took ca 4500 sec per WU. | |
| ID: 111575 | | |
|
Over on GPUGRID, I saw something about them finding that the HD4xxx series cards had some type of memory access problem - a limit on the amount of graphics memory each processor on the GPU can access before it starts using a much lower bandwidth path to the computer's main memory instead. I haven't kept up with whether more recent software updates have removed this restriction. | |
| ID: 111576 | | |
I think your card takes around 33k sec to complete a WU, so no matter what the theoretical (computed) peak performance is, the server would be right to assign a ca 7 times lower performance to your card. 7 or 60? Quite some difference. But OK, I am running with a changed flops value, still only 11 digits long but different than what Albert gave me. Since its estimates are all too low (you're right about the ~32k seconds) I've made it think that the tasks are actually longer, not shorter. Just too bad I'm still quite busy with Skyrim. That hacks into the time anything else can use the GPU. ;-) ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111577 | | |
|
The way of calculating (projected_)flops differs largely depending on how many tasks with this app version your host has successfully computed. Maybe this value differs between your hosts. | |
| ID: 111580 | | |
|
Hmmm....I get the same cut-off time as before (even tho I resetted the Albert project before allowing new work). In addition, the app now seems to be configured to use a full CPU core. | |
| ID: 111581 | | |
|
Ok, scheduler reverted. Needs further investigation. | |
| ID: 111582 | | |
The way of calculating (projected_)flops differs largely depending on how many tasks with this app version your host has successfully computed. Maybe this value differs between your hosts. LOL, like zero times for me? None of the tasks I do validate, remember? As for testing your over_flops, what do I do with the extra tasks? Stupid BOINC always fetches 6 tasks, doesn't matter that it then takes ~9 days to do them... Though you now have ~9 days to come up with a better schedule(r). ;-) ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111585 | | |
|
Have you thought of starting with a certain number of dummy tasks, to be replaced with similar information from tasks actually completed as soon as there are enough of them? | |
| ID: 111586 | | |
Some BOINC projects limit the number of tasks any computer can have downloaded and in progress at first, with this limit relaxed as soon as there are enough tasks successfully completed by that computer to get a better idea of how often it can handle yet another workunit. That's certainly an option to limit the effect of the runtime estimation / work fetch going mad. But actually I'd like to understand and fix what's going wrong in the first place. For now i raised the FLOPS estimation and thus the FLOPS limit by a factor of 10 for newly generated workunits. It will take some time (usually about 1.5d) until the first tasks from that will be sent out, though. BM | |
| ID: 111592 | | |
|
I've read that at least some of the BOINC versions never initialize one of the variables often used in runtime estimation. You may want to add reporting of the variables you use so you can check for signs of this. | |
| ID: 111594 | | |
That's certainly an option to limit the effect of the runtime estimation / work fetch going mad. But actually I'd like to understand and fix what's going wrong in the first place. There is something weird going on with the amount of tasks one has per day. As you can see from my double zero credit & RAC, I haven't had one task validate yet. So by now, the amount of tasks I should be able to download for the v1.19 app should be 1, maybe 2. Yesterday it was 26, now it is 32. Why is it going up? I am not returning any valid work. Shouldn't it, like in the old days, continue to go down and eventually only give me 1 task per device (CPU core or GPU) per day? As with this, I can continue ad infinitum doing 'bad work'. ____________ Jord. BOINC FAQ Service They say most of your brain shuts down in cryo-sleep. All but the primitive side, the animal side. No wonder I'm still awake. | |
| ID: 111601 | | |
|
I incorporated D.A.s recent fix for using "conservative flops estimate" in case "we don't have enough statistics" (i.e. too few valid results) into the scheduler running on Albert. | |
| ID: 111609 | | |
|
Hi!
2011-12-22 22:39:37.5065 [PID=14669] [version] Checking plan class 'atiOpenCL'
2011-12-22 22:39:37.5065 [PID=14669] [version] host_flops: 2.972295e+09, speedup: 15.00, projected_flops: 4.458442e+10, peak_flops: 4.176000e+12, peak_flops_factor: 1.00
Still, the estimated CPU time as displayed by boinccmd for such a task is below 50 seconds ... :-( It will actaully take almost 100 times longer. HB ____________ | |
| ID: 111622 | | |
|
I guess I got a few from the old batch. | |
| ID: 111625 | | |
no hint from Oliver either. Where is he by the way, seems like he evaporated. ;-) Sort of, holiday season... :-) Happy new year! | |
| ID: 111646 | | |
It would be instructive to see whether the debugging output in question is common to all 4xxx series cards. Ah..but that's a different subject. They will. The 4xxx series doesn't support local memory, it's emulated via global memory which incurs a big impact on performance. Also, this series only allows for 64 work items per work group when local memory is used, hence the resizing. However, I doubt that the resizing actually affects the accuracy of the computation, but if it does, it needs to be fixed! Oliver | |
| ID: 111647 | | |
|
Hi, exceeded elapsed time limit 19036.53 (28000000.00G/1470.86G) problem. The GPU is in bad state with reboot required. All other downloaded openCL tasks are started by BOINC and immediately aborted with: Output file p2030.20100913.G44.55+00.20.N.b6s0g0.00000_2424_1_3 for task p2030.20100913.G44.55+00.20.N.b6s0g0.00000_2424_1 absent This is finished after reaching the daily quota of task I successfully finished atiopenCL tasks with 50000s runtime. System: Linux Ubuntu Oneiric OpenCL: ATI GPU 0: Juniper (driver version CAL 1.4.1646, device version OpenCL 1.1 AMD-APP-SDK-v2.5 (684.213), 1024MB) Catalyst 11.11 | |
| ID: 111660 | | |
Well, it turned out it does indeed! We'll fix it ASAP. Oliver | |
| ID: 111776 | | |
|
Ok, bug fix implemented and tested. We'll release v1.20 shortly... | |
| ID: 111779 | | |
Message boards :
News :
Sending work