WARNING: This website is obsolete! Please follow this link to get to the new Albert@Home website!

Posts by oz

1) Message boards : Problems and Bug Reports : [New release] BRP app v1.22 feedback thread (Message 111929)
Posted 15 Mar 2012 by oz
Post:
Hi,
what did you do exactly on client_state.xml? If I change the <flops> entry in the ati_openCL application section it was automatically reset by the application after a while and tasks end up before finishing.

<app_version>
    <app_name>einsteinbinary_BRP4</app_name>
    <version_num>122</version_num>
    <platform>i686-pc-linux-gnu</platform>
    <avg_ncpus>0.150000</avg_ncpus>
    <max_ncpus>1.000000</max_ncpus>
    <flops>4127438621653.708496</flops>
    <plan_class>atiOpenCL</plan_class>
    <api_version>7.0.18</api_version>
    <file_ref>
        <file_name>einsteinbinary_BRP4_1.22_i686-pc-linux-gnu__atiOpenCL</file_name>
        <main_program/>
    </file_ref>
    <file_ref>
        <file_name>einsteinbinary_BRP4_1.00_graphics_i686-pc-linux-gnu</file_name>
        <open_name>graphics_app</open_name>
    </file_ref>
    <coproc>
        <type>ATI</type>
        <count>1.000000</count>
    </coproc>
    <gpu_ram>377487360.000000</gpu_ram>
</app_version>
2) Message boards : Problems and Bug Reports : [New release] BRP app v1.22 feedback thread (Message 111914)
Posted 8 Mar 2012 by oz
Post:
Today I had a lot of atiOpenCL tasks aborted after exactly 24:14 min.

133328  43214   7 Mar 2012 | 16:58:05 UTC       8 Mar 2012 | 6:11:00 UTC        Error while computing   1,454.57        580.95  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133327  39414   7 Mar 2012 | 16:59:13 UTC       8 Mar 2012 | 6:11:00 UTC        Error while computing   1,453.71        578.01  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133326  39395   7 Mar 2012 | 16:59:13 UTC       8 Mar 2012 | 6:11:00 UTC        Error while computing   1,454.23        582.54  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133325  39432   7 Mar 2012 | 16:59:13 UTC       8 Mar 2012 | 8:30:35 UTC        Error while computing   1,453.80        582.52  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133324  39441   7 Mar 2012 | 16:59:13 UTC       8 Mar 2012 | 8:30:35 UTC        Error while computing   1,453.70        586.05  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133323  43314   7 Mar 2012 | 16:58:05 UTC       8 Mar 2012 | 6:11:00 UTC        Error while computing   1,454.00        584.04  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133321  39279   7 Mar 2012 | 16:55:50 UTC       8 Mar 2012 | 6:11:00 UTC        Error while computing   1,453.96        582.06  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133320  37403   7 Mar 2012 | 17:00:24 UTC       8 Mar 2012 | 8:30:35 UTC        Error while computing   1,454.57        614.77  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133319  36932   7 Mar 2012 | 17:00:24 UTC       8 Mar 2012 | 8:30:35 UTC        Error while computing   1,453.83        607.69  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133318  44053   7 Mar 2012 | 17:00:25 UTC       8 Mar 2012 | 10:38:10 UTC       Error while computing   1,453.83        662.62  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133317  38006   7 Mar 2012 | 17:01:34 UTC       8 Mar 2012 | 12:57:42 UTC       Error while computing   1,454.31        665.01  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133316  43437   7 Mar 2012 | 16:58:05 UTC       8 Mar 2012 | 6:11:00 UTC        Error while computing   1,454.19        582.07  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)


Bikemans:
133387  44311   7 Mar 2012 | 17:42:19 UTC       8 Mar 2012 | 9:42:23 UTC        Error while computing   947.54  846.04  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133348  44229   7 Mar 2012 | 17:42:19 UTC       8 Mar 2012 | 9:42:23 UTC        Error while computing   946.95  840.90  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133346  44226   7 Mar 2012 | 17:43:26 UTC       8 Mar 2012 | 10:07:15 UTC       Error while computing   947.58  838.51  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133314  39550   7 Mar 2012 | 17:41:09 UTC       8 Mar 2012 | 4:31:47 UTC        Error while computing   946.85  838.29  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
133274  44093   7 Mar 2012 | 17:41:09 UTC       8 Mar 2012 | 4:31:47 UTC        Error while computing   947.08  842.30  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
130790  44395   7 Mar 2012 | 17:43:27 UTC       8 Mar 2012 | 10:43:45 UTC       Error while computing   946.86  844.07  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)
130749  44374   7 Mar 2012 | 17:42:19 UTC       8 Mar 2012 | 9:42:23 UTC        Error while computing   947.30  837.06  ---     Binary Radio Pulsar Search v1.22 (atiOpenCL)


PS.:
Bikemans end up earlier due to better hardware
3) Message boards : Problems and Bug Reports : [New release] BRP app v1.22 feedback thread (Message 111892)
Posted 1 Mar 2012 by oz
Post:
This happens in 7.0.18:
The scheduler requests new jobs, and 3 seconds later it starts S6LV1. I can not imagine that the download of the task is completed then? Normally we can see something like this
Started download of p2030.20111110.G39.19-00.79.N.b2s0g0.00000_3648.binary
Finished download of p2030.20111110.G39.19-00.79.N.b2s0g0.00000_3648.binary

first.

01-Mar-2012 08:39:39 [Albert@Home] Sending scheduler request: To fetch work.
01-Mar-2012 08:39:39 [Albert@Home] Reporting 4 completed tasks, requesting new tasks for CPU
01-Mar-2012 08:39:48 [Albert@Home] Scheduler request completed: got 1 new tasks
01-Mar-2012 08:39:48 [Albert@Home] Resent lost task h1_0059.95_S6GC1__39_S6LV1A_1
01-Mar-2012 08:39:51 [Albert@Home] Starting task h1_0059.95_S6GC1__39_S6LV1A_1 using einstein_S6LV1 version 110 (SSE2) in slot 10
01-Mar-2012 08:39:52 [Albert@Home] Computation for task h1_0059.95_S6GC1__39_S6LV1A_1 finished
01-Mar-2012 08:39:52 [Albert@Home] Output file h1_0059.95_S6GC1__39_S6LV1A_1_0 for task h1_0059.95_S6GC1__39_S6LV1A_1 absent
01-Mar-2012 08:41:39 [Albert@Home] Sending scheduler request: To fetch work.
01-Mar-2012 08:41:39 [Albert@Home] Reporting 1 completed tasks, requesting new tasks for CPU
01-Mar-2012 08:41:41 [Albert@Home] Scheduler request completed: got 4 new tasks
01-Mar-2012 08:41:43 [Albert@Home] Starting task h1_0059.95_S6GC1__35_S6LV1A_1 using einstein_S6LV1 version 110 (SSE2) in slot 10
01-Mar-2012 08:41:43 [Albert@Home] Starting task h1_0059.95_S6GC1__33_S6LV1A_1 using einstein_S6LV1 version 110 (SSE2) in slot 11
01-Mar-2012 08:41:43 [Albert@Home] Starting task h1_0059.95_S6GC1__34_S6LV1A_1 using einstein_S6LV1 version 110 (SSE2) in slot 12
01-Mar-2012 08:41:44 [Albert@Home] Computation for task h1_0059.95_S6GC1__35_S6LV1A_1 finished
01-Mar-2012 08:41:44 [Albert@Home] Output file h1_0059.95_S6GC1__35_S6LV1A_1_0 for task h1_0059.95_S6GC1__35_S6LV1A_1 absent
01-Mar-2012 08:41:44 [Albert@Home] Starting task h1_0059.95_S6GC1__36_S6LV1A_1 using einstein_S6LV1 version 110 (SSE2) in slot 10
01-Mar-2012 08:41:45 [Albert@Home] Computation for task h1_0059.95_S6GC1__33_S6LV1A_1 finished
01-Mar-2012 08:41:45 [Albert@Home] Output file h1_0059.95_S6GC1__33_S6LV1A_1_0 for task h1_0059.95_S6GC1__33_S6LV1A_1 absent
01-Mar-2012 08:41:46 [Albert@Home] Computation for task h1_0059.95_S6GC1__34_S6LV1A_1 finished
01-Mar-2012 08:41:46 [Albert@Home] Output file h1_0059.95_S6GC1__34_S6LV1A_1_0 for task h1_0059.95_S6GC1__34_S6LV1A_1 absent
01-Mar-2012 08:41:47 [Albert@Home] Computation for task h1_0059.95_S6GC1__36_S6LV1A_1 finished
01-Mar-2012 08:41:47 [Albert@Home] Output file h1_0059.95_S6GC1__36_S6LV1A_1_0 for task h1_0059.95_S6GC1__36_S6LV1A_1 absent


4) Message boards : Problems and Bug Reports : [OpenCL] app v1.20/v1.21 feedback thread (Message 111857)
Posted 16 Feb 2012 by oz
Post:
After getting it running at all, I finished some atiOpenCL Tasks on two ATI cards (GPU0=Screen0,GPU1=Screen1(SingleDesktop) are both ATI 5700) simultaniously. The runtime of the task on GPU0 is using 3 times of the task on GPU1. After starting ubuntu with "unity2d" without 3D WindowManager both task are running fast. So I blamed compiz for the slow performance on GPU0.

I had no success in a non X setup. (I do not need X here)
Stopping lighdm so that all modules are loaded and DISPLAY (COMPUTE) variable set to ":0" atiOpenCL apps do not start on the commandline in a console.
5) Message boards : Problems and Bug Reports : [OpenCL] app v1.20/v1.21 feedback thread (Message 111805)
Posted 3 Feb 2012 by oz
Post:
Yes, i' am running Linux on my MacPro. (3.1)
So here we are:
BOINC 1GB
FGLRX 1GB
AMD-SDK 512MB
Card label, Apple 1GB

I installed the driver AFTER the AMD-SDK., so the openCL runtime libs maybe replaced by the driver installation.

And now linux kernel:

id:
display
description: VGA compatible controller
product: Juniper [Radeon HD 5700 Series]
vendor: ATI Technologies Inc
physical id:
0
bus info:
pci@0000:01:00.0
version: 00
width: 64 bits
clock: 33MHz
capabilities: pm pciexpress msi vga_controller bus_master cap_list rom
configuration:
driver = fglrx_pci
latency = 0
resources:
irq : 60
memory : 80000000-9fffffff
memory : c0b00000-c0b1ffff
ioport : 3000(size=256)
memory : c0b20000-c0b3ffff

Someone with power of 2 capability may calculate ram from here
6) Message boards : Problems and Bug Reports : [OpenCL] app v1.20/v1.21 feedback thread (Message 111803)
Posted 3 Feb 2012 by oz
Post:
According to apple specs it's a MacPro with the standard graphics card ATI Radeon 5770 1GB video mem (one auxilary power cable). Maybe AMD-APP-SDK limits to 50%?
7) Message boards : Problems and Bug Reports : [OpenCL] app v1.20/v1.21 feedback thread (Message 111801)
Posted 3 Feb 2012 by oz
Post:
Oops you're right, video memory is reported as 1024MB for both cards. With 817MB, 983MB available, but global memory for OpenCL is reported from AMD-APP-SDK as 512MB , but (strange) boinc says 817MB, 983MB available). Is there a tweak in OpenCL configuration? amdccle (Catalyst Control Center) says 1024MB Video Memory for both cards. Hmm...?
8) Message boards : Problems and Bug Reports : [OpenCL] app v1.20/v1.21 feedback thread (Message 111797)
Posted 2 Feb 2012 by oz
Post:
Have tried several Catalyst driver/AMD-APP-SDK/Boinc combinations.
OS = Ubuntu/oneiric, 2 x AMD 5770 (Juniper) Cards

ATI GPU 0: ATI Radeon HD 5700 series (Juniper) (CAL version 1.4.1664, 1024MB, 817MB available, 2720 GFLOPS peak)
ATI GPU 1: ATI Radeon HD 5700 series (Juniper) (CAL version 1.4.1664, 1024MB, 983MB available, 2720 GFLOPS peak)
02-Feb-2012 16:45:01 [---] OpenCL: ATI GPU 0: Juniper (driver version CAL 1.4.1664, device version OpenCL 1.1 AMD-APP (851.4), 512MB, 817MB available)
02-Feb-2012 16:45:01 [---] OpenCL: ATI GPU 1: Juniper (driver version CAL 1.4.1664, device version OpenCL 1.1 AMD-APP (851.4), 512MB, 983MB available)


Last combination is Catalyst 12.1, AMD-APP-SDK-v2.6, (without OpenCL v1.2 support) boinc 7.0.12. Results are like=>

http://albert.phys.uwm.edu/result.php?resultid=114016

clinfo reports:clinfo
Number of platforms:                             1
  Platform Profile:                              FULL_PROFILE
  Platform Version:                              OpenCL 1.1 AMD-APP (851.4)
  Platform Name:                                 AMD Accelerated Parallel Processing
  Platform Vendor:                               Advanced Micro Devices, Inc.
  Platform Extensions:                           cl_khr_icd cl_amd_event_callback cl_amd_offline_devices


  Platform Name:                                 AMD Accelerated Parallel Processing
Number of devices:                               3
  Device Type:                                   CL_DEVICE_TYPE_GPU
  Device ID:                                     4098
  Board name:                                    ATI Radeon HD 5700 Series
  Device Topology:                               PCI[ B#2, D#0, F#0 ]
  Max compute units:                             10
  Max work items dimensions:                     3
    Max work items[0]:                           256
    Max work items[1]:                           256
    Max work items[2]:                           256
  Max work group size:                           256
  Preferred vector width char:                   16
  Preferred vector width short:                  8
  Preferred vector width int:                    4
  Preferred vector width long:                   2
  Preferred vector width float:                  4
  Preferred vector width double:                 0
  Native vector width char:                      16
  Native vector width short:                     8
  Native vector width int:                       4
  Native vector width long:                      2
  Native vector width float:                     4
  Native vector width double:                    0
  Max clock frequency:                           0Mhz
  Address bits:                                  32
  Max memory allocation:                         134217728
  Image support:                                 Yes
  Max number of images read arguments:           128
  Max number of images write arguments:          8
  Max image 2D width:                            8192
  Max image 2D height:                           8192
  Max image 3D width:                            2048
  Max image 3D height:                           2048
  Max image 3D depth:                            2048
  Max samplers within kernel:                    16
  Max size of kernel argument:                   1024
  Alignment (bits) of base address:              2048
  Minimum alignment (bytes) for any datatype:    128
  Single precision floating point capability
    Denorms:                                     No
    Quiet NaNs:                                  Yes
    Round to nearest even:                       Yes
    Round to zero:                               Yes
    Round to +ve and infinity:                   Yes
    IEEE754-2008 fused multiply-add:             Yes
  Cache type:                                    None
  Cache line size:                               0
  Cache size:                                    0
  Global memory size:                            536870912
  Constant buffer size:                          65536
  Max number of constant args:                   8
  Local memory type:                             Scratchpad
  Local memory size:                             32768
  Kernel Preferred work group size multiple:     64
  Error correction support:                      0
  Unified memory for Host and Device:            0
  Profiling timer resolution:                    1
  Device endianess:                              Little
  Available:                                     Yes
  Compiler available:                            Yes
  Execution capabilities:                                
    Execute OpenCL kernels:                      Yes
    Execute native function:                     No
  Queue properties:                              
    Out-of-Order:                                No
    Profiling :                                  Yes
  Platform ID:                                   0x7fedcd03c100
  Name:                                          Juniper
  Vendor:                                        Advanced Micro Devices, Inc.
  Device OpenCL C version:                       OpenCL C 1.1 
  Driver version:                                CAL 1.4.1664
  Profile:                                       FULL_PROFILE
  Version:                                       OpenCL 1.1 AMD-APP (851.4)
  Extensions:                                    cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32 cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_popcnt 
9) Message boards : News : Sending work (Message 111660)
Posted 6 Jan 2012 by oz
Post:
Hi,

I also have aborted task due to
exceeded elapsed time limit 19036.53 (28000000.00G/1470.86G) problem
.

The GPU is in bad state with reboot required. All other downloaded openCL tasks are started by BOINC and immediately aborted with:

Output file p2030.20100913.G44.55+00.20.N.b6s0g0.00000_2424_1_3 for task p2030.20100913.G44.55+00.20.N.b6s0g0.00000_2424_1 absent


This is finished after reaching the daily quota of task
I successfully finished atiopenCL tasks with 50000s runtime.
System: Linux Ubuntu Oneiric
OpenCL: ATI GPU 0: Juniper (driver version CAL 1.4.1646, device version OpenCL 1.1 AMD-APP-SDK-v2.5 (684.213), 1024MB)
Catalyst 11.11
10) Message boards : Problems and Bug Reports : Running on ATI (Message 111595)
Posted 16 Dec 2011 by oz
Post:
Same Codebase different platform / Compiler .. ?

Linux: Intel(R) Xeon(R) CPU X5472 @ 3.00GHz [2] AMD ATI Radeon HD 5700 series (Juniper) (1024MB) driver: 1.4.1646 runtime 45000 - 61000 sec

Windows:Intel(R) Core(TM)2 Quad CPU Q6600 @ 2.40GHz [2] AMD ATI Radeon HD 5700 series (Juniper) (1024MB) driver: 1.4.1607 runtime 7100 - 8800 sec
hmm... .
I would expect ~ 8000sec for juniper radeon

The successors (cypress, cayman) looking good.






This material is based upon work supported by the National Science Foundation (NSF) under Grant PHY-0555655 and by the Max Planck Gesellschaft (MPG). Any opinions, findings, and conclusions or recommendations expressed in this material are those of the investigators and do not necessarily reflect the views of the NSF or the MPG.

Copyright © 2024 Bruce Allen for the LIGO Scientific Collaboration