5 Replies Latest reply on Jan 12, 2018 5:01 PM by mirh

    carrizo not support opencl2.0?

    mikewolf_gkd


      my HP elite g3 notebook with newest drivers, seems it is carrizo displaycard.

       

      I run opencl 2.0 sampels in  amd app sdk samples.

      it gave me inforamtion:

       

      carrizo Device is xxxxx

      Unsupported device! required CL_DEVICE_OPENCL_C+VERSION 2.0 or higher.

       

      I think carrizo is same arch with fiji, no reason not support opencl2.0, why know reasons?

       

       

      thanks,

        • Re: carrizo not support opencl2.0?
          dipak

          Please make sure that you've installed the latest driver from here: Download Drivers . After driver installation, please run "clinfo" command and check  "Device OpenCL C version" parameter in clinfo output. If the value is "OpenCL C 2.0", then OpenCL 2.0 samples will work, otherwise not.

            • Re: carrizo not support opencl2.0?
              mikewolf_gkd

              hi, dipak

               

              thanks quick reply.

               

              actually, my driver is newest version 17.12.1

               

              from clinfo, seems gpu support opencl2.0, but from app sdk opencl2.0 sample(simplepipe),

              I found below lines, its return deviceVersion is 0x013615b8 "OpenCL 1.2 AMD-APP (2527.7)", very strange, seems driver some issues.

              CLUitl.hpp:2356

                          status = clGetDeviceInfo(

                                       deviceId,

                                       CL_DEVICE_VERSION,

                                       sizeof(char) * tempSize,

                                       deviceVersion,

                                       NULL);

               

               

              Number of platforms:1
                Platform Profile:FULL_PROFILE
                Platform Version:OpenCL 2.1 AMD-APP (2527.7)
                Platform Name:AMD Accelerated Parallel Processing
                Platform Vendor:Advanced Micro Devices, Inc.
                Platform Extensions:cl_khr_icd cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_amd_event_callback cl_amd_offline_devices

               

               

              detail clinfo

               

                Platform Name:AMD Accelerated Parallel Processing
              Number of devices:2
                Device Type:CL_DEVICE_TYPE_GPU
                Vendor ID:1002h
                Board name:AMD Radeon R7 Graphics
                Device Topology:PCI[ B#0, D#0, F#1 ]
                Max compute units:8
                Max work items dimensions:3
                  Max work items[0]:1024
                  Max work items[1]:1024
                  Max work items[2]:1024
                Max work group size:256
                Preferred vector width char:4
                Preferred vector width short:2
                Preferred vector width int:1
                Preferred vector width long:1
                Preferred vector width float:1
                Preferred vector width double:1
                Native vector width char:4
                Native vector width short:2
                Native vector width int:1
                Native vector width long:1
                Native vector width float:1
                Native vector width double:1
                Max clock frequency:720Mhz
                Address bits:64
                Max memory allocation:1368352358
                Image support:Yes
                Max number of images read arguments:128
                Max number of images write arguments:64
                Max image 2D width:16384
                Max image 2D height:16384
                Max image 3D width:2048
                Max image 3D height:2048
                Max image 3D depth:2048
                Max samplers within kernel:16
                Max size of kernel argument:1024
                Alignment (bits) of base address:2048
                Minimum alignment (bytes) for any datatype:128

                Single precision floating point capability

                  Denorms:No
                  Quiet NaNs:Yes
                  Round to nearest even:Yes
                  Round to zero:Yes
                  Round to +ve and infinity:Yes
                  IEEE754-2008 fused multiply-add:Yes
                Cache type:Read/Write
                Cache line size:64
                Cache size:16384
                Global memory size:2146697216
                Constant buffer size:1368352358
                Max number of constant args:8
                Local memory type:Scratchpad
                Local memory size:32768
                Max pipe arguments:16
                Max pipe active reservations:16
                Max pipe packet size:1368352358
                Max global variable size:1231516928
                Max global variable preferred total size:2146697216
                Max read/write image args:64
                Max on device events:1024
                Queue on device max size:8388608
                Max on device queues:1
                Queue on device preferred size:262144
                SVM capabilities:
                  Coarse grain buffer:Yes
                  Fine grain buffer:Yes
                  Fine grain system:Yes
                  Atomics:Yes
                Preferred platform atomic alignment:0
                Preferred global atomic alignment:0
                Preferred local atomic alignment:0
                Kernel Preferred work group size multiple:64
                Error correction support:0
                Unified memory for Host and Device:1
                Profiling timer resolution:1
                Device endianess:Little
                Available:Yes
                Compiler available:Yes
                Execution capabilities:
                  Execute OpenCL kernels:Yes
                  Execute native function:No
                Queue on Host properties:
                  Out-of-Order:No
                  Profiling :Yes
                Queue on Device properties:
                  Out-of-Order:Yes
                  Profiling :Yes
                Platform ID:00007FFAFC334AB0
                Name:Carrizo
                Vendor:Advanced Micro Devices, Inc.
                Device OpenCL C version:OpenCL C 2.0
                Driver version:2527.7
                Profile:FULL_PROFILE
                Version:OpenCL 2.0 AMD-APP (2527.7)
                Extensions:cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_fp16 cl_khr_gl_sharing cl_khr_gl_depth_images cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_d3d11_sharing cl_khr_dx9_media_sharing cl_khr_image2d_from_buffer cl_khr_spir cl_khr_subgroups cl_khr_gl_event cl_khr_depth_images cl_khr_mipmap_image cl_khr_mipmap_image_writes cl_amd_liquid_flash

               

               

               

               

                Device Type:CL_DEVICE_TYPE_CPU
                Vendor ID:1002h
                Board name:
                Max compute units:4
                Max work items dimensions:3
                  Max work items[0]:1024
                  Max work items[1]:1024
                  Max work items[2]:1024
                Max work group size:1024
                Preferred vector width char:16
                Preferred vector width short:8
                Preferred vector width int:4
                Preferred vector width long:2
                Preferred vector width float:8
                Preferred vector width double:4
                Native vector width char:16
                Native vector width short:8
                Native vector width int:4
                Native vector width long:2
                Native vector width float:8
                Native vector width double:4
                Max clock frequency:2096Mhz
                Address bits:64
                Max memory allocation:2147483648
                Image support:Yes
                Max number of images read arguments:128
                Max number of images write arguments:64
                Max image 2D width:8192
                Max image 2D height:8192
                Max image 3D width:2048
                Max image 3D height:2048
                Max image 3D depth:2048
                Max samplers within kernel:16
                Max size of kernel argument:4096
                Alignment (bits) of base address:1024
                Minimum alignment (bytes) for any datatype:128

                Single precision floating point capability

                  Denorms:Yes
                  Quiet NaNs:Yes
                  Round to nearest even:Yes
                  Round to zero:Yes
                  Round to +ve and infinity:Yes
                  IEEE754-2008 fused multiply-add:Yes
                Cache type:Read/Write
                Cache line size:64
                Cache size:32768
                Global memory size:7963320320
                Constant buffer size:65536
                Max number of constant args:8
                Local memory type:Global
                Local memory size:32768
                Max pipe arguments:16
                Max pipe active reservations:16
                Max pipe packet size:2147483648
                Max global variable size:1879048192
                Max global variable preferred total size:1879048192
                Max read/write image args:64
                Max on device events:0
                Queue on device max size:0
                Max on device queues:0
                Queue on device preferred size:0
                SVM capabilities:
                  Coarse grain buffer:No
                  Fine grain buffer:No
                  Fine grain system:No
                  Atomics:No
                Preferred platform atomic alignment:0
                Preferred global atomic alignment:0
                Preferred local atomic alignment:0
                Kernel Preferred work group size multiple:1
                Error correction support:0
                Unified memory for Host and Device:1
                Profiling timer resolution:488
                Device endianess:Little
                Available:Yes
                Compiler available:Yes
                Execution capabilities:
                  Execute OpenCL kernels:Yes
                  Execute native function:Yes
                Queue on Host properties:
                  Out-of-Order:No
                  Profiling :Yes
                Queue on Device properties:
                  Out-of-Order:No
                  Profiling :No
                Platform ID:00007FFAFC334AB0
                Name:AMD PRO A12-8800B R7, 12 Compute Cores 4C+8G 
                Vendor:AuthenticAMD
                Device OpenCL C version:OpenCL C 1.2
                Driver version:2527.7 (sse2,avx,fma4)
                Profile:FULL_PROFILE
                Version:OpenCL 1.2 AMD-APP (2527.7)
                Extensions:cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops cl_amd_media_ops2 cl_amd_popcnt cl_khr_d3d10_sharing cl_khr_spir cl_khr_gl_event