WARNING: This website is obsolete! Please follow this link to get to the new Albert@Home website!

Project server code update

Message boards : News : Project server code update
Message board moderation

To post messages, you must log in.

1 · 2 · 3 · 4 . . . 17 · Next

AuthorMessage
Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 15 Oct 04
Posts: 1956
Credit: 6,218,130
RAC: 0
Message 112845 - Posted: 2 Jun 2014, 8:59:18 UTC
Last modified: 2 Jun 2014, 9:00:04 UTC

The project will be taken down in about an hour to perform an update of the BOINC server code. Ideally you shouldn't notice anything, but usually the world isn't ideal. See you again on the other side.
ID: 112845 · Report as offensive     Reply Quote
Trotador

Send message
Joined: 15 May 13
Posts: 6
Credit: 26,130,548
RAC: 0
Message 112846 - Posted: 2 Jun 2014, 19:04:14 UTC - in response to Message 112845.  

Scheduler request failed: HTTP internal server error

is what I get
ID: 112846 · Report as offensive     Reply Quote
Profile nenym

Send message
Joined: 13 Jun 11
Posts: 14
Credit: 10,001,988
RAC: 0
Message 112847 - Posted: 2 Jun 2014, 23:02:31 UTC - in response to Message 112846.  

Scheduler request failed: HTTP internal server error

is what I get
The same here
ID: 112847 · Report as offensive     Reply Quote
Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 15 Oct 04
Posts: 1956
Credit: 6,218,130
RAC: 0
Message 112850 - Posted: 4 Jun 2014, 6:45:39 UTC - in response to Message 112845.  

Does the problem persist?

We are testing the behavior of "CreditNew" on this project and will try to fix it if necessary. Be prepared for the unexpected!

BM
ID: 112850 · Report as offensive     Reply Quote
Claggy

Send message
Joined: 29 Dec 06
Posts: 78
Credit: 4,040,969
RAC: 0
Message 112851 - Posted: 4 Jun 2014, 8:45:30 UTC - in response to Message 112845.  
Last modified: 4 Jun 2014, 8:45:43 UTC

Intel GPUs are now being shown by the project in the computer details pages:

Computer 9008


But 'Use Intel GPU' isn't being shown on the Albert project preferences page in spite of there being intel GPU apps available, perhaps those apps need their settings adjusted?

Claggy
ID: 112851 · Report as offensive     Reply Quote
Profile nenym

Send message
Joined: 13 Jun 11
Posts: 14
Credit: 10,001,988
RAC: 0
Message 112852 - Posted: 4 Jun 2014, 9:02:18 UTC - in response to Message 112850.  
Last modified: 4 Jun 2014, 9:15:33 UTC

Does the problem persist?

We are testing the behavior of "CreditNew" on this project and will try to fix it if necessary. Be prepared for the unexpected!

BM
04/06/2014 10:53:32 | Albert@Home | Sending scheduler request: Requested by user.
04/06/2014 10:53:32 | Albert@Home | Requesting new tasks for CPU and NVIDIA GPU
04/06/2014 10:53:36 | Albert@Home | Scheduler request failed: HTTP internal server error
The machine has been restarted.

Note: Local time UTC+2 (Prag).

If your are going to use Dave's random number generator, I leave the project. Some CPU projects have fixed it to number generator of expected and acceptable range, but no GPU project has been successful in that deal. Good luck.

EDIT: Before leaving I'll try my favorite joke - using app_info to get BPR4 CPU task to be crunched by intel_gpu. I expect credit 0.5 instead of 62.5. Can be seen as wu 590960.
ID: 112852 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 10 Dec 05
Posts: 450
Credit: 5,409,572
RAC: 0
Message 112853 - Posted: 4 Jun 2014, 9:12:54 UTC - in response to Message 112852.  

If your are going to use Dave's random number generator, I leave the project. Some CPU projects have fixed it to number generator of expected and acceptable range, but no GPU project has been successful in that deal. Good luck.

We know all that. The purpose of this test is, very specifically, to test and try out some fixes to CreditNew that some volunteers have spent the last nine months developing.

It would be most helpful if you would remain attached to the project, to generate some baseline data from a good range of hosts.

Albert has been chosen for this task specifically because it's a test project where nothing is expected to work anyway!
ID: 112853 · Report as offensive     Reply Quote
Eyrie

Send message
Joined: 20 Feb 14
Posts: 47
Credit: 2,410
RAC: 0
Message 112854 - Posted: 4 Jun 2014, 9:16:55 UTC - in response to Message 112852.  

If your are going to use Dave's random number generator, I leave the project. Some CPU projects have fixed it to number generator of expected and acceptable range, but no GPU project has been successful in that deal. Good luck.

EDIT: Before leaving I'll try my favorite joke - using app_info to get BPR4 CPU task to be crunched by intel_gpu. I expect credit 0.5 instead of 62.5.


hold your horses please!

What we are specifically trying to do is to make something that actually WORKS out of David's RNG!

But for that we need to verify first that we see on Albert what we know from SETI main, before we can go on to stick a proper algorithm into it.

So, PLEASE, bear with us while we establish the system works as expected (i.e. is crap) and then apply the patches into the critical areas.
Queen of Aliasses, wielder of the SETI rolling pin, Mistress of the red shoes, Guardian of the orange tree, Slayer of very small dragons.
ID: 112854 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 10 Dec 05
Posts: 450
Credit: 5,409,572
RAC: 0
Message 112855 - Posted: 4 Jun 2014, 9:17:49 UTC - in response to Message 112850.  

Does the problem persist?

We are testing the behavior of "CreditNew" on this project and will try to fix it if necessary. Be prepared for the unexpected!

BM

I'm getting work OK on 'CPU only' requests.

But I've attached some extra hosts, which are requesting NVidia work as part of project initialisation - that's returning 'internal server error' (see email). I'll have to find some way of blocking that initial request - seems to be fine after that's out of the way, using a 'CPU only' venue.
ID: 112855 · Report as offensive     Reply Quote
Profile nenym

Send message
Joined: 13 Jun 11
Posts: 14
Credit: 10,001,988
RAC: 0
Message 112856 - Posted: 4 Jun 2014, 9:20:16 UTC - in response to Message 112853.  

OK, if Albert is not for testing of applications only, but also for the credit system (as SetiBeta and ralph), I have no problem to help to generate baseline. It is important to know it. It that case I have no problem with low and random credit.
ID: 112856 · Report as offensive     Reply Quote
Eyrie

Send message
Joined: 20 Feb 14
Posts: 47
Credit: 2,410
RAC: 0
Message 112857 - Posted: 4 Jun 2014, 9:57:15 UTC - in response to Message 112855.  

Does the problem persist?

We are testing the behavior of "CreditNew" on this project and will try to fix it if necessary. Be prepared for the unexpected!

BM

I'm getting work OK on 'CPU only' requests.

But I've attached some extra hosts, which are requesting NVidia work as part of project initialisation - that's returning 'internal server error' (see email). I'll have to find some way of blocking that initial request - seems to be fine after that's out of the way, using a 'CPU only' venue.

only worked after moving the host to a _fresh_ venue that only has CPU ticked.


Queen of Aliasses, wielder of the SETI rolling pin, Mistress of the red shoes, Guardian of the orange tree, Slayer of very small dragons.
ID: 112857 · Report as offensive     Reply Quote
Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 15 Oct 04
Posts: 1956
Credit: 6,218,130
RAC: 0
Message 112858 - Posted: 4 Jun 2014, 10:22:17 UTC

Found and fixed a bug in the scheduler.

Please try again.
ID: 112858 · Report as offensive     Reply Quote
Profile nenym

Send message
Joined: 13 Jun 11
Posts: 14
Credit: 10,001,988
RAC: 0
Message 112859 - Posted: 4 Jun 2014, 10:39:00 UTC - in response to Message 112858.  

Seems to be OK.
04/06/2014 12:37:42 | Albert@Home | Sending scheduler request: Requested by user.
04/06/2014 12:37:42 | Albert@Home | Requesting new tasks for CPU and NVIDIA GPU
04/06/2014 12:37:45 | Albert@Home | Scheduler request completed: got 0 new tasks
04/06/2014 12:37:45 | Albert@Home | No tasks sent
04/06/2014 12:37:45 | Albert@Home | Tasks for CPU are available, but your preferences are set to not accept them

ID: 112859 · Report as offensive     Reply Quote
Claggy

Send message
Joined: 29 Dec 06
Posts: 78
Credit: 4,040,969
RAC: 0
Message 112864 - Posted: 4 Jun 2014, 11:05:47 UTC
Last modified: 4 Jun 2014, 11:16:54 UTC

My i7-2600K got ATI/AMD work, But when I suspend Seti (where it's crunching OpenCL Seti v7 work), nothing happens, the ATI/AMD Wu isn't started.

https://albert.phys.uwm.edu/show_host_detail.php?hostid=8143

Edit: Finally I manage to get it to error:

Activated exception handling...
[12:06:40][13760][INFO ] Starting data processing...
GPU type not found in init_data.xml
[12:06:40][13760][ERROR] Failed to get OpenCL platform/device info from BOINC (error: -161)!
[12:06:40][13760][ERROR] Demodulation failed (error: -161)!
12:06:40 (13760): called boinc_finish

</stderr_txt>

https://albert.phys.uwm.edu/result.php?resultid=1453772

Claggy
ID: 112864 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 10 Dec 05
Posts: 450
Credit: 5,409,572
RAC: 0
Message 112865 - Posted: 4 Jun 2014, 11:13:37 UTC

My host 11361 got some CUDA work (like result 1448685), which failed with 'No suitable CUDA device available!' - although there's a fully functional "NVIDIA GeForce GTX 750 Ti (2047MB) driver: 335.28", which crunches CUDA at other projects.
ID: 112865 · Report as offensive     Reply Quote
Profile Bernd Machenschalk
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 15 Oct 04
Posts: 1956
Credit: 6,218,130
RAC: 0
Message 112866 - Posted: 4 Jun 2014, 11:55:00 UTC
Last modified: 4 Jun 2014, 12:30:51 UTC

Our plan class specs that were (semi-)automatically converted for the new server code were somewhat broken, causing probably all kinds of oddities for GPU tasks. Should be fixed now.

BM
ID: 112866 · Report as offensive     Reply Quote
Profile nenym

Send message
Joined: 13 Jun 11
Posts: 14
Credit: 10,001,988
RAC: 0
Message 112867 - Posted: 4 Jun 2014, 13:08:11 UTC

BRP4G cuda task is running OK at 9600GT/XP 32bit, driver 335.28.
ID: 112867 · Report as offensive     Reply Quote
Profile zombie67 [MM]
Avatar

Send message
Joined: 10 Oct 06
Posts: 130
Credit: 30,924,459
RAC: 0
Message 112868 - Posted: 4 Jun 2014, 13:17:45 UTC - in response to Message 112851.  

Intel GPUs are now being shown by the project in the computer details pages:

Computer 9008


But 'Use Intel GPU' isn't being shown on the Albert project preferences page in spite of there being intel GPU apps available, perhaps those apps need their settings adjusted?

Claggy

+1 Still no setting in preferences to select intel GPU, like you can for AMD or nVidia. Other GPU projects have this, even Einstein.
Dublin, California
Team: SETI.USA

ID: 112868 · Report as offensive     Reply Quote
Claggy

Send message
Joined: 29 Dec 06
Posts: 78
Credit: 4,040,969
RAC: 0
Message 112873 - Posted: 4 Jun 2014, 19:48:47 UTC - in response to Message 112866.  
Last modified: 4 Jun 2014, 19:49:16 UTC

Our plan class specs that were (semi-)automatically converted for the new server code were somewhat broken, causing probably all kinds of oddities for GPU tasks. Should be fixed now.

BM

All my AtI/AMD tasks are predicted to take six seconds, when they get to 2 minutes 6 seconds they error:

https://albert.phys.uwm.edu/result.php?resultid=1455248

<core_client_version>7.2.42</core_client_version>
<![CDATA[
<message>
Maximum elapsed time exceeded
</message>
<stderr_txt>
Activated exception handling...
[20:37:27][14488][INFO ] Starting data processing...
[20:37:27][14488][INFO ] Using OpenCL platform provided by: Advanced Micro Devices, Inc.
[20:37:27][14488][INFO ] Using OpenCL device "Capeverde" by: Advanced Micro Devices, Inc.
[20:37:27][14488][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[20:37:27][14488][INFO ] Header contents:
------> Original WAPP file: ./p2030.20131124.G176.16-01.04.S.b4s0g0.00000_DM336.00
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 56620.250187503654
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.336182022
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 53157.0385017
------> DEC (J2000): 314116.710699
------> Galactic l: 0
------> Galactic b: 0
------> Name: G176.16-01.04.S
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 336 cm^-3 pc
------> Scale factor: 7.48281e-005
[20:37:29][14488][INFO ] Seed for random number generator is 1203156450.
[20:37:31][14488][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 1.32531e-008
------> thr1 = 18.139
------> thr2 = 21.241
------> thr4 = 26.2686
------> thr8 = 34.6478
------> thr16 = 48.9581
[20:38:27][14488][INFO ] Checkpoint committed!
[20:39:27][14488][INFO ] Checkpoint committed!
[20:39:44][14488][INFO ] OpenCL shutdown complete!
[20:39:44][14488][WARN ] BOINC wants us to quit prematurely or we lost contact! Exiting...

</stderr_txt>
]]>


Claggy
ID: 112873 · Report as offensive     Reply Quote
Richard Haselgrove

Send message
Joined: 10 Dec 05
Posts: 450
Credit: 5,409,572
RAC: 0
Message 112874 - Posted: 4 Jun 2014, 20:08:12 UTC

Holmis reported the same for BRP4G-cuda32-nv301 in the problems area, except he inocculated his against "Exit status 197 EXIT_TIME_LIMIT_EXCEEDED" with a big boost to rsc_fpops_bound.

I guess one of us (and that probably means me) should fire up a GPU fetch and compare the calculations in the server log with what actually ends up in client_state.xml
ID: 112874 · Report as offensive     Reply Quote
1 · 2 · 3 · 4 . . . 17 · Next

Message boards : News : Project server code update



This material is based upon work supported by the National Science Foundation (NSF) under Grant PHY-0555655 and by the Max Planck Gesellschaft (MPG). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the investigators and do not necessarily reflect the views of the NSF or the MPG.

Copyright © 2019 Bruce Allen for the LIGO Scientific Collaboration