Deprecated: Function get_magic_quotes_gpc() is deprecated in /srv/BOINC/live-webcode/html/inc/util.inc on line 640
[New release] BRP app v1.28 feedback thread

WARNING: This website is obsolete! Please follow this link to get to the new Albert@Home website!

[New release] BRP app v1.28 feedback thread

Message boards : Problems and Bug Reports : [New release] BRP app v1.28 feedback thread
Message board moderation

To post messages, you must log in.

Previous · 1 · 2

AuthorMessage
Vlatko

Send message
Joined: 9 Aug 12
Posts: 4
Credit: 149,500
RAC: 0
Message 112195 - Posted: 24 Aug 2012, 19:39:43 UTC

The speed is amazing.Going from 3800s to 2200s for single task,and 3500s for 2 task.That is on PCIe x16 v1.1 not on v3.0

Great work
ID: 112195 · Report as offensive     Reply Quote
Profile Holmis

Send message
Joined: 4 Jan 05
Posts: 104
Credit: 2,104,736
RAC: 0
Message 112196 - Posted: 25 Aug 2012, 16:02:27 UTC

Yesterday I upgraded my GPU to a factory over clocked GTX 660Ti.

I've run about 15 units, two at a time, and observed a speedup of about 700s compared to the BRP4 v1.25 on Einstein. At the same time the CPU-time per result has decreased by about 400s, thats more than half on my system.

v1.28 on Albert, x2, run time ~2170s and CPU time ~360s.
v1.25 on Einstein, x2, run time ~2900 and CPU time ~770s.

GPU-load also increased from ~80% to 95+%.

Great work!
ID: 112196 · Report as offensive     Reply Quote
Vlatko

Send message
Joined: 9 Aug 12
Posts: 4
Credit: 149,500
RAC: 0
Message 112197 - Posted: 26 Aug 2012, 17:45:21 UTC - in response to Message 112196.  

How bout a single work unit?I want to see if a 660Ti is faster than a 580gtx/
ID: 112197 · Report as offensive     Reply Quote
Profile Holmis

Send message
Joined: 4 Jan 05
Posts: 104
Credit: 2,104,736
RAC: 0
Message 112198 - Posted: 26 Aug 2012, 18:06:12 UTC - in response to Message 112197.  

How bout a single work unit?I want to see if a 660Ti is faster than a 580gtx/

I'd like to test that but as of right now I can't get any more work for BRP4, probably because the server status pages says 0 tasks to send...
ID: 112198 · Report as offensive     Reply Quote
Alex

Send message
Joined: 1 Mar 05
Posts: 88
Credit: 398,734
RAC: 0
Message 112199 - Posted: 26 Aug 2012, 18:12:35 UTC

BRP3Cuda32
GTX550Ti 1793 / 303 single wu
i3 win7/64 7.0.31 x16 slot
ID: 112199 · Report as offensive     Reply Quote
Jeroen

Send message
Joined: 25 Nov 05
Posts: 12
Credit: 638,256
RAC: 0
Message 112200 - Posted: 26 Aug 2012, 23:51:06 UTC

I ran the new CUDA 1.28 app via one of my Windows systems today. I have not been able to get much work today but the two tasks that ran via my GTX 580, completed at 834 seconds each. This is with one task running at a time. GPU load was at approximately 90-91% while running one task.

If memory serves me right, the previous application ran at around 1360 seconds per task with the 1.25 app via this system. This is a very decent improvement in performance. Thanks for the work put into optimizing the BRP4 applications.
ID: 112200 · Report as offensive     Reply Quote
Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 28 Aug 06
Posts: 1483
Credit: 1,864,017
RAC: 0
Message 112201 - Posted: 27 Aug 2012, 8:50:08 UTC

Thanks for the feedback. That's actually a bit more of a speedup than I had expected based on some tests on slower hardware. Definitely in relative terms, the speedup is more pronounced on faster cards.

I will now install the Linux CUDA app on Albert, stand by for more tests. I'm eager to see those GTX 680 .... ;-)

Cheers
HB


ID: 112201 · Report as offensive     Reply Quote
Profile Holmis

Send message
Joined: 4 Jan 05
Posts: 104
Credit: 2,104,736
RAC: 0
Message 112202 - Posted: 27 Aug 2012, 11:57:54 UTC - in response to Message 112197.  

How bout a single work unit?I want to see if a 660Ti is faster than a 580gtx/

Got hold of 2 BRP4-tasks and ran them one at a time on my over clocked GTX660Ti (Core@1201.9 MHz).
Run time: 1183.69 and 1175.39 seconds for an average of 1179.54 s.
GPU load ~84%.
This on Win7 x64, PCI-E 3.0x16.
ID: 112202 · Report as offensive     Reply Quote
Jeroen

Send message
Joined: 25 Nov 05
Posts: 12
Credit: 638,256
RAC: 0
Message 112204 - Posted: 28 Aug 2012, 2:40:06 UTC - in response to Message 112201.  
Last modified: 28 Aug 2012, 2:41:07 UTC

Here are some preliminary numbers for the GTX 680.

One task per GPU

System #1 - Single GPU

x16 3.0 - 721 seconds

System #2 - Multi GPU

x16 3.0 - 785 seconds
x8 3.0 - 901 seconds

Overall, the performance looks great so far. I want to do some more testing with multiple tasks running at once, different PCI-E configurations, and with the CPU dedicated for BRP4 GPU only. The above tests were done with ~50% CPU load from running other CPU tasks at the same time.
ID: 112204 · Report as offensive     Reply Quote
Profile archae86

Send message
Joined: 6 Dec 05
Posts: 414
Credit: 67,924
RAC: 0
Message 112205 - Posted: 28 Aug 2012, 3:27:55 UTC

I have a possible finding--not even a little bit sure--but am posting in case others might spot such a thing.

I've got two different hosts with the same GPU, a GTX 460. Neither has had downclocking problems for some weeks, but I found both downclocked severely today, with the problem persisting through system reboot.

It might just barely be possible that running the current Albert BRP1.28 CUDA ap, or on an even less likely note the Albert 0.29 Gamma Ray Pulsar application--or switching back and forth from those to the current Einstein applications was involved.

More likely something else in my system's history was the problem, but I thought I'd post the suspicion in case someone else sees something.

I'm not even sure what the true downclock frequency was, as different sources reported different numbers, but it was either 405 MHz or less--the reduction in power consumption and GPU temperature, while reporting exceedingly high GPU utilization, but making very slow progress on the WU was persuasive that downclocking was at hand.
ID: 112205 · Report as offensive     Reply Quote
Vlatko

Send message
Joined: 9 Aug 12
Posts: 4
Credit: 149,500
RAC: 0
Message 112206 - Posted: 28 Aug 2012, 12:39:00 UTC - in response to Message 112205.  

I have sometimes the same issue.It only happens when i overclock the gpu +130Mhz from baseline.After several hours the screen goes blank and with gpu-z it shows core speed of 400Mhz then i proceed with reboot and everything goes to normal.
I think it some fail safe method that Nvidia uses.Also I noticed when boinc is not running the core speed drops to 50mhz and goes to 800 instantaneous if a demanding gpu work is needed
ID: 112206 · Report as offensive     Reply Quote
Profile zombie67 [MM]
Avatar

Send message
Joined: 10 Oct 06
Posts: 130
Credit: 30,924,459
RAC: 0
Message 112207 - Posted: 29 Aug 2012, 0:55:26 UTC - in response to Message 112177.  
Last modified: 29 Aug 2012, 1:01:52 UTC

Another idea: What I'm always a bit suspicious about (BOINC-performance-wise) is OSX's power saving feature to automatically switch between a CPU build-in GPU (e.g. the Intel stuff in Sandy/Ivy Bridge CPUs) and the dedicated graphics card. Could this be the case that this feature is set differently on those two hosts perhaps?


Just to be clear, that feature can be turned off, so that the higher performance GPU is always used.

Also, be sure to be clear on which GPUs we are talking about. For example, there is the HD 5850 and there is the Mobility HD 5850. While the numbers are the same, they are completely different things. HD 5850 uses the Cypress PRO core, and is 2088 gflops. The Mobility HD 5850 is the Juniper chip, and is 800-1000 gflops, depending on the clock. The mobility versions are used in lap tops, including the Macbooks.
Dublin, California
Team: SETI.USA

ID: 112207 · Report as offensive     Reply Quote
Profile Stephan Goll

Send message
Joined: 13 Dec 05
Posts: 19
Credit: 1,874,367
RAC: 0
Message 112208 - Posted: 30 Aug 2012, 9:49:55 UTC
Last modified: 30 Aug 2012, 10:03:35 UTC

After some time I decided to look for Albert again, this time with my nvidia. The reason was:
30-Aug-2012 08:53:39 [Albert@Home] Requesting new tasks for ATI
30-Aug-2012 08:53:44 [Albert@Home] Scheduler request completed: got 0 new tasks

To my surprise there was work:

30-Aug-2012 10:41:08 [Albert@Home] Started download of einsteinbinary_BRP4_1.28_i686-pc-linux-gnu__BRP3cuda32nv270

Hmmm. It looks like there is not only a 1.28 OpenCL binary and it looks like we can expect a newer CUDA binany. At the moment we have 1.24 in Einstein.

Bernd or Bikeman, can you please give a bit more information? :)
Thanks,
Stephan
PS: Ah ... I see that there was an bit of information. But mostly for Windows. Now let's see what my Linux box can do.
ID: 112208 · Report as offensive     Reply Quote
Profile Bikeman (Heinz-Bernd Eggenstein)
Volunteer moderator
Project administrator
Project developer
Avatar

Send message
Joined: 28 Aug 06
Posts: 1483
Credit: 1,864,017
RAC: 0
Message 112209 - Posted: 31 Aug 2012, 8:13:27 UTC - in response to Message 112208.  

Hi!

Indeed, the 1.28 CUDA app will be released on Einstein@Home shortly ...probably today.

Cheers
HB


ID: 112209 · Report as offensive     Reply Quote
Jeroen

Send message
Joined: 25 Nov 05
Posts: 12
Credit: 638,256
RAC: 0
Message 112211 - Posted: 31 Aug 2012, 14:06:51 UTC
Last modified: 31 Aug 2012, 14:07:10 UTC

The older cards are also running well with the new version.

8800GT G92 512 MB - x16 slot @ 5.0 GT/s

1.28: 2940 seconds
1.24: ~3600 seconds
ID: 112211 · Report as offensive     Reply Quote
Previous · 1 · 2

Message boards : Problems and Bug Reports : [New release] BRP app v1.28 feedback thread



This material is based upon work supported by the National Science Foundation (NSF) under Grant PHY-0555655 and by the Max Planck Gesellschaft (MPG). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the investigators and do not necessarily reflect the views of the NSF or the MPG.

Copyright © 2024 Bruce Allen for the LIGO Scientific Collaboration