13 Replies Latest reply on May 22, 2012 5:22 AM by jatinderkaur

    Executing OpenCL Samples

    jatinderkaur

      Hello Everyone

       

      I have installed AMD APP SDK on my system (i3 pro, AMD Radeon HD 6470 GPU) and visual studio 2010.

      I am trying to execute openCL Sample for Matrix application. It is not detecting GPU device. This is the output :

       

       

       

      Can anyone help please?

        • Re: Executing OpenCL Samples
          Wenju

          Hi,jatinderkaur

          Do you input some arguments when you running it,and have you ever modified the program before you running it? It runs well on my computer(i5, AMD Radeon HD 5650 GPU).Maybe you should debug the program,and you can find line 233 in "MatrixMultiplication.cpp",the message just from here,so you'd try!

          Thank you!

          • Re: Executing OpenCL Samples
            scharupa

            SDK 2..7 does not have GPU support. You need to install Catalyst -12.4.

             

            Also, pls check clinfo details. (C:/windows/system32/clinfo.exe)

              • Re: Executing OpenCL Samples
                jatinderkaur

                Thx for replies.

                 

                I have opened clinfo.exe. But on my system path of file is different. It is:

                C:\Program Files (x86)\AMD APP\bin\x86_64\clinfo.exe

                 

                But problem is when i open this exe file. It displays some information and closes very quickly.

                So I am unable to read it.

                  • Re: Executing OpenCL Samples
                    scharupa

                    Run the clinfo.exe from command window.

                     

                    Check for GPU details

                      • Re: Executing OpenCL Samples
                        jatinderkaur

                        On my system I have amd-graphic-driver (ATI Technologies Inc.  Driver Version 8.830.6.3000) . It doesn't work?

                        I have also run through cmd. Now output is:

                         

                        C:\Program Files (x86)\AMD APP\bin\x86_64>clinfo.exe

                        Number of platforms:                             2

                          Platform Profile:                              FULL_PROFILE

                          Platform Version:                              OpenCL 1.1 AMD-APP (831.4)

                          Platform Name:                                 AMD Accelerated Parallel Proces

                        sing

                          Platform Vendor:                               Advanced Micro Devices, Inc.

                          Platform Extensions:                           cl_khr_icd cl_amd_event_callbac

                        k cl_amd_offline_devices cl_khr_d3d10_sharing

                          Platform Profile:                              FULL_PROFILE

                          Platform Version:                              OpenCL 1.1

                          Platform Name:                                 Intel(R) OpenCL

                          Platform Vendor:                               Intel(R) Corporation

                          Platform Extensions:                           cl_khr_fp64 cl_khr_icd cl_khr_g

                        lobal_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32

                        _base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store

                        cl_intel_printf cl_ext_device_fission cl_intel_exec_by_local_thread cl_khr_gl_sh

                        aring cl_intel_dx9_media_sharing

                         

                         

                          Platform Name:                                 AMD Accelerated Parallel Proces

                        sing

                        Number of devices:                               1

                          Device Type:                                   CL_DEVICE_TYPE_CPU

                          Device ID:                                     4098

                          Board name:

                          Max compute units:                             4

                          Max work items dimensions:                     3

                            Max work items[0]:                           1024

                            Max work items[1]:                           1024

                            Max work items[2]:                           1024

                          Max work group size:                           1024

                          Preferred vector width char:                   16

                          Preferred vector width short:                  8

                          Preferred vector width int:                    4

                          Preferred vector width long:                   2

                          Preferred vector width float:                  4

                          Preferred vector width double:                 0

                          Native vector width char:                      16

                          Native vector width short:                     8

                          Native vector width int:                       4

                          Native vector width long:                      2

                          Native vector width float:                     4

                          Native vector width double:                    0

                          Max clock frequency:                           2195Mhz

                          Address bits:                                  64

                          Max memory allocation:                         2147483648

                          Image support:                                 Yes

                          Max number of images read arguments:           128

                          Max number of images write arguments:          8

                          Max image 2D width:                            8192

                          Max image 2D height:                           8192

                          Max image 3D width:                            2048

                          Max image 3D height:                           2048

                          Max image 3D depth:                            2048

                          Max samplers within kernel:                    16

                          Max size of kernel argument:                   4096

                          Alignment (bits) of base address:              1024

                          Minimum alignment (bytes) for any datatype:    128

                          Single precision floating point capability

                            Denorms:                                     Yes

                            Quiet NaNs:                                  Yes

                            Round to nearest even:                       Yes

                            Round to zero:                               Yes

                            Round to +ve and infinity:                   Yes

                            IEEE754-2008 fused multiply-add:             Yes

                          Cache type:                                    Read/Write

                          Cache line size:                               64

                          Cache size:                                    32768

                          Global memory size:                            4240293888

                          Constant buffer size:                          65536

                          Max number of constant args:                   8

                          Local memory type:                             Global

                          Local memory size:                             32768

                          Kernel Preferred work group size multiple:     1

                          Error correction support:                      0

                          Unified memory for Host and Device:            1

                          Profiling timer resolution:                    466

                          Device endianess:                              Little

                          Available:                                     Yes

                          Compiler available:                            Yes

                          Execution capabilities:

                            Execute OpenCL kernels:                      Yes

                            Execute native function:                     Yes

                          Queue properties:

                            Out-of-Order:                                No

                            Profiling :                                  Yes

                          Platform ID:                                   000007FEE702C9C8

                          Name:                                                 Intel(R) Core(TM) i3-233

                        0M CPU @ 2.20GHz

                          Vendor:                                        GenuineIntel

                          Device OpenCL C version:                       OpenCL C 1.1

                          Driver version:                                2.0

                          Profile:                                       FULL_PROFILE

                          Version:                                       OpenCL 1.1 AMD-APP (831.4)

                          Extensions:                                    cl_khr_fp64 cl_amd_fp64 cl_khr_

                        global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int3

                        2_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_

                        khr_int64_extended_atomics cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ex

                        t_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_

                        media_ops cl_amd_popcnt cl_khr_d3d10_sharing

                         

                         

                          Platform Name:                                 Intel(R) OpenCL

                        Number of devices:                               1

                          Device Type:                                   CL_DEVICE_TYPE_CPU

                          Device ID:                                     32902

                          Max compute units:                             4

                          Max work items dimensions:                     3

                            Max work items[0]:                           1024

                            Max work items[1]:                           1024

                            Max work items[2]:                           1024

                          Max work group size:                           1024

                          Preferred vector width char:                   16

                          Preferred vector width short:                  8

                          Preferred vector width int:                    4

                          Preferred vector width long:                   2

                          Preferred vector width float:                  4

                          Preferred vector width double:                 2

                          Native vector width char:                      16

                          Native vector width short:                     8

                          Native vector width int:                       4

                          Native vector width long:                      2

                          Native vector width float:                     4

                          Native vector width double:                    2

                          Max clock frequency:                           2200Mhz

                          Address bits:                                  64

                          Max memory allocation:                         1060073472

                          Image support:                                 Yes

                          Max number of images read arguments:           480

                          Max number of images write arguments:          480

                          Max image 2D width:                            8192

                          Max image 2D height:                           8192

                          Max image 3D width:                            2048

                          Max image 3D height:                           2048

                          Max image 3D depth:                            2048

                          Max samplers within kernel:                    480

                          Max size of kernel argument:                   3840

                          Alignment (bits) of base address:              1024

                          Minimum alignment (bytes) for any datatype:    128

                          Single precision floating point capability

                            Denorms:                                     Yes

                            Quiet NaNs:                                  Yes

                            Round to nearest even:                       Yes

                            Round to zero:                               No

                            Round to +ve and infinity:                   No

                            IEEE754-2008 fused multiply-add:             No

                          Cache type:                                    Read/Write

                          Cache line size:                               64

                          Cache size:                                    262144

                          Global memory size:                            4240293888

                          Constant buffer size:                          131072

                          Max number of constant args:                   480

                          Local memory type:                             Global

                          Local memory size:                             32768

                          Kernel Preferred work group size multiple:     128

                          Error correction support:                      0

                          Unified memory for Host and Device:            1

                          Profiling timer resolution:                    466

                          Device endianess:                              Little

                          Available:                                     Yes

                          Compiler available:                            Yes

                          Execution capabilities:

                            Execute OpenCL kernels:                      Yes

                            Execute native function:                     Yes

                          Queue properties:

                            Out-of-Order:                                Yes

                            Profiling :                                  Yes

                          Platform ID:                                   0000000000716080

                          Name:                                                 Intel(R) Core(TM) i3-233

                        0M CPU @ 2.20GHz

                          Vendor:                                        Intel(R) Corporation

                          Device OpenCL C version:                       OpenCL C 1.1

                          Driver version:                                1.1

                          Profile:                                       FULL_PROFILE

                          Version:                                       OpenCL 1.1 (Build 31360.31441)

                          Extensions:                                    cl_khr_fp64 cl_khr_icd cl_khr_g

                        lobal_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32

                        _base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store

                        cl_intel_printf cl_ext_device_fission cl_intel_exec_by_local_thread cl_khr_gl_sh

                        aring cl_intel_dx9_media_sharing