6 Replies Latest reply on Mar 13, 2012 10:48 AM by grabner

    What does this mean for OpenCL on CPU "Internal Error:  as failed"

    zhuzxy

       I am using the opencl on CPU, and yesterday I could run my cl code on CPU, and today I found I could not, the err msg is "Internal Error:  as failed". What does this mean?

       The only difference I could figure out is I did an update for the CAL software today, Does this matter?

      my sdk version is 2.5,  the platform is A8-3850 and I am using the integrated GPU. The graphic driver version information:

      The current software information is :

      Driver Packaging Version 8.881-110728a-124445C-ATI 
      Catalyst Version 11.8 
      Provider ATI Technologies Inc. 
      2D Driver Version 8.01.01.1178 
      Direct3D Version 7.14.10.0855 
      OpenGL Version 6.14.10.11005 
      AMD VISION Engine Control Center Version 2011.0728.1756.30366 
       

      The original information (yesterday):

      driver version: 8.871-110627a-121835E-ATI 
      Catalyst version 11.7 
      2D driver 8.01.01.1171 
      Direct3D version 7.14.10.0847 
      OpenGL version 6.14.10.10907 
      AMD VISION Engine Control Center version 2011.0627.1558.26805 

        • What does this mean for OpenCL on CPU "Internal Error:  as failed"
          genaganna

           

          Originally posted by: zhuzxy  I am using the opencl on CPU, and yesterday I could run my cl code on CPU, and today I found I could not, the err msg is "Internal Error:  as failed". What does this mean?

           The only difference I could figure out is I did an update for the CAL software today, Does this matter?

          my sdk version is 2.5,  the platform is A8-3850 and I am using the integrated GPU. The graphic driver version information:

          The current software information is :

          Driver Packaging Version 8.881-110728a-124445C-ATI  Catalyst Version 11.8  Provider ATI Technologies Inc.  2D Driver Version 8.01.01.1178  Direct3D Version 7.14.10.0855  OpenGL Version 6.14.10.11005  AMD VISION Engine Control Center Version 2011.0728.1756.30366   

          The original information (yesterday):

          driver version: 8.871-110627a-121835E-ATI  Catalyst version 11.7  2D driver 8.01.01.1171  Direct3D version 7.14.10.0847  OpenGL version 6.14.10.10907  AMD VISION Engine Control Center version 2011.0627.1558.26805 

          It looks like you have installed OpenCL runtime packages again from driver. Can you run CLInfo and paste log here.

          Not sure how this is effected. have you stopped the installation in b/w?

          Removing runtime package and reinstalling could work.

            • What does this mean for OpenCL on CPU "Internal Error:  as failed"
              zhuzxy

              I  did remove the ' sdk runtime' from the install manager and install it again. And I still got the error msg when running the sample app' bitonicsort' for Opencl on CPU. The err msg is the same, build program failed.

              Can anyone help on the solution?

              my clinfo result is :

              Number of platforms:                             1
                Platform Profile:                              FULL_PROFILE
                Platform Version:                              OpenCL 1.1 AMD-APP-SDK-v2.5 (70
              9.2)
                Platform Name:                                 AMD Accelerated Parallel Proces
              sing
                Platform Vendor:                               Advanced Micro Devices, Inc.
                Platform Extensions:                           cl_khr_icd cl_amd_event_callbac
              k cl_amd_offline_devices cl_khr_d3d10_sharing


                Platform Name:                                 AMD Accelerated Parallel Proces
              sing
              Number of devices:                               2
                Device Type:                                   CL_DEVICE_TYPE_GPU
                Device ID:                                     4098
                Max compute units:                             5
                Max work items dimensions:                     3
                  Max work items[0]:                           256
                  Max work items[1]:                           256
                  Max work items[2]:                           256
                Max work group size:                           256
                Preferred vector width char:                   16
                Preferred vector width short:                  8
                Preferred vector width int:                    4
                Preferred vector width long:                   2
                Preferred vector width float:                  4
                Preferred vector width double:                 0
                Native vector width char:                      16
                Native vector width short:                     8
                Native vector width int:                       4
                Native vector width long:                      2
                Native vector width float:                     4
                Native vector width double:                    0
                Max clock frequency:                           600Mhz
                Address bits:                                  32
                Max memory allocation:                         536870912
                Image support:                                 Yes
                Max number of images read arguments:           128
                Max number of images write arguments:          8
                Max image 2D width:                            8192
                Max image 2D height:                           8192
                Max image 3D width:                            2048
                Max image 3D height:                           2048
                Max image 3D depth:                            2048
                Max samplers within kernel:                    16
                Max size of kernel argument:                   1024
                Alignment (bits) of base address:              32768
                Minimum alignment (bytes) for any datatype:    128
                Single precision floating point capability
                  Denorms:                                     No
                  Quiet NaNs:                                  Yes
                  Round to nearest even:                       Yes
                  Round to zero:                               Yes
                  Round to +ve and infinity:                   Yes
                  IEEE754-2008 fused multiply-add:             Yes
                Cache type:                                    None
                Cache line size:                               0
                Cache size:                                    0
                Global memory size:                            536870912
                Constant buffer size:                          65536
                Max number of constant args:                   8
                Local memory type:                             Scratchpad
                Local memory size:                             32768
                Kernel Preferred work group size multiple:     64
                Error correction support:                      0
                Unified memory for Host and Device:            1
                Profiling timer resolution:                    1
                Device endianess:                              Little
                Available:                                     Yes
                Compiler available:                            Yes
                Execution capabilities:
                  Execute OpenCL kernels:                      Yes
                  Execute native function:                     No
                Queue properties:
                  Out-of-Order:                                No
                  Profiling :                                  Yes
                Platform ID:                                   000007FEE6E0D348
                Name:                                          BeaverCreek
                Vendor:                                        Advanced Micro Devices, Inc.
                Device OpenCL C version:                       OpenCL C 1.1
                Driver version:                                CAL 1.4.1457 (VM)
                Profile:                                       FULL_PROFILE
                Version:                                       OpenCL 1.1 AMD-APP-SDK-v2.5 (70
              9.2)
                Extensions:                                    cl_khr_global_int32_base_atomic
              s cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_lo
              cal_int32_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store
              cl_khr_gl_sharing cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd
              _media_ops cl_amd_popcnt cl_khr_d3d10_sharing


                Device Type:                                   CL_DEVICE_TYPE_CPU
                Device ID:                                     4098
                Max compute units:                             4
                Max work items dimensions:                     3
                  Max work items[0]:                           1024
                  Max work items[1]:                           1024
                  Max work items[2]:                           1024
                Max work group size:                           1024
                Preferred vector width char:                   16
                Preferred vector width short:                  8
                Preferred vector width int:                    4
                Preferred vector width long:                   2
                Preferred vector width float:                  4
                Preferred vector width double:                 0
                Native vector width char:                      16
                Native vector width short:                     8
                Native vector width int:                       4
                Native vector width long:                      2
                Native vector width float:                     4
                Native vector width double:                    0
                Max clock frequency:                           2899Mhz
                Address bits:                                  64
                Max memory allocation:                         2147483648
                Image support:                                 Yes
                Max number of images read arguments:           128
                Max number of images write arguments:          8
                Max image 2D width:                            8192
                Max image 2D height:                           8192
                Max image 3D width:                            2048
                Max image 3D height:                           2048
                Max image 3D depth:                            2048
                Max samplers within kernel:                    16
                Max size of kernel argument:                   4096
                Alignment (bits) of base address:              1024
                Minimum alignment (bytes) for any datatype:    128
                Single precision floating point capability
                  Denorms:                                     Yes
                  Quiet NaNs:                                  Yes
                  Round to nearest even:                       Yes
                  Round to zero:                               Yes
                  Round to +ve and infinity:                   Yes
                  IEEE754-2008 fused multiply-add:             No
                Cache type:                                    Read/Write
                Cache line size:                               64
                Cache size:                                    65536
                Global memory size:                            8050302976
                Constant buffer size:                          65536
                Max number of constant args:                   8
                Local memory type:                             Global
                Local memory size:                             32768
                Kernel Preferred work group size multiple:     1
                Error correction support:                      0
                Unified memory for Host and Device:            1
                Profiling timer resolution:                    353
                Device endianess:                              Little
                Available:                                     Yes
                Compiler available:                            Yes
                Execution capabilities:
                  Execute OpenCL kernels:                      Yes
                  Execute native function:                     Yes
                Queue properties:
                  Out-of-Order:                                No
                  Profiling :                                  Yes
                Platform ID:                                   000007FEE6E0D348
                Name:                                          AMD A8-3850 APU with Radeon(tm)
               HD Graphics
                Vendor:                                        AuthenticAMD
                Device OpenCL C version:                       OpenCL C 1.1
                Driver version:                                2.0
                Profile:                                       FULL_PROFILE
                Version:                                       OpenCL 1.1 AMD-APP-SDK-v2.5 (70
              9.2)
                Extensions:                                    cl_khr_fp64 cl_amd_fp64 cl_khr_
              global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int3
              2_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_
              khr_int64_extended_atomics cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ex
              t_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_media_ops cl_a
              md_popcnt cl_amd_printf cl_khr_d3d10_sharing

               

               

                • What does this mean for OpenCL on CPU "Internal Error:  as failed"
                  genaganna

                   

                  Originally posted by: zhuzxy I  did remove the ' sdk runtime' from the install manager and install it again. And I still got the error msg when running the sample app' bitonicsort' for Opencl on CPU. The err msg is the same, build program failed.

                  Please install Run time package from SDK package and see whether it works or not.

                  Presently you have installed Runtime package from Driver.

                    • What does this mean for OpenCL on CPU "Internal Error:  as failed"
                      zhuzxy

                      What does you mean by 'installed runtime package from driver'? I install it from the install package and it first extract the files to directorys like c:\ati or c:\amd.

                      And I have tried to remove the ATI compoents and re-install it with my old package. The error still happens with the same err msg.

                      my steps as following:

                          run the amd catalyst install manager and remove the components :

                           amd app sdk developer/amd app sdk runtime/amd app profiler 2.3/and app kernelanalyzer1.9/amd app sdksamples.

                         rename the c:\ati and c:\amd directorys to other names.

                      run the amd's driver instal file  11-7_vista64_win7_64_dd_cc_ocl.exe

                      run the sdk install file  adm-app-sdk-v2.5-rc2-windows-64.

                      then test the amd sdk sample project to make the opencl run on cpu. And got the error msg"Internal Error:  as failed".

                       

                      Can anyone tell me how to reinstall the whole thing completely? I am guessing when I use the AMD catalyst install manager to remove old components( for opencl related things), it keeped some old files in the system, and so I can not do a complete clean new installation.

                • What does this mean for OpenCL on CPU "Internal Error:  as failed"
                  twentz

                  This may be EXACTLY the problem I'm having. I can't run OpenCL code on my CPU (A8-3850).

                  If it worked with an older version, it might be just that:  the new version might be broken

                    • Re: What does this mean for OpenCL on CPU "Internal Error:  as failed"
                      grabner

                      I just had the same problem and found a solution, so if you are still interested, here it is. For some reason, the "PATH" environment variable was not updated properly during installation of the AMD APP SDK. You can verify this by opening a command prompt and typing the command "amdocl_as.exe --version". If the PATH is correct, the output of this command will tell you that AMD packaged the GNU assembler with their software ;-), otherwise you will just get an error message that the command could not be found. On my system, "amdocl_as.exe" is installed in the directory "C:\Program Files (x86)\AMD APP\bin\x86_64", adding this directory to the PATH environment variable fixed the problem for me. BTW, the same problem occurs immediately after installation of the AMD APP SDK if you try to run OpenCL programs from a command prompt which was opened before installation since the environment of a running prompt is not automatically updated when modified by an external process (e.g., a software installer).

                       

                      Kind regards,

                      Markus