cancel
Showing results for 
Search instead for 
Did you mean: 

Archives Discussions

jatinderkaur
Adept I

Executing OpenCL Samples

Jump to solution

Hello Everyone

I have installed AMD APP SDK on my system (i3 pro, AMD Radeon HD 6470 GPU) and visual studio 2010.

I am trying to execute openCL Sample for Matrix application. It is not detecting GPU device. This is the output :

pastedImage_0.png

Can anyone help please?

0 Likes
1 Solution

Accepted Solutions
jatinderkaur
Adept I

Re: Executing OpenCL Samples

Jump to solution

Thanks all. I have installed ATI driver 8.882 from hp website. Now clinfo has detected my GPU device and o/p is:

pastedImage_0.png

View solution in original post

0 Likes
13 Replies
Wenju
Elite

Re: Executing OpenCL Samples

Jump to solution

Hi,jatinderkaur

Do you input some arguments when you running it,and have you ever modified the program before you running it? It runs well on my computer(i5, AMD Radeon HD 5650 GPU).Maybe you should debug the program,and you can find line 233 in "MatrixMultiplication.cpp",the message just from here,so you'd try!

Thank you!

0 Likes
scharupa
Staff
Staff

Re: Executing OpenCL Samples

Jump to solution

SDK 2..7 does not have GPU support. You need to install Catalyst -12.4.

Also, pls check clinfo details. (C:/windows/system32/clinfo.exe)

0 Likes
jatinderkaur
Adept I

Re: Executing OpenCL Samples

Jump to solution

Thx for replies.

I have opened clinfo.exe. But on my system path of file is different. It is:

C:\Program Files (x86)\AMD APP\bin\x86_64\clinfo.exe

But problem is when i open this exe file. It displays some information and closes very quickly.

So I am unable to read it.

0 Likes
scharupa
Staff
Staff

Re: Executing OpenCL Samples

Jump to solution

Run the clinfo.exe from command window.

Check for GPU details

0 Likes
jatinderkaur
Adept I

Re: Executing OpenCL Samples

Jump to solution

On my system I have amd-graphic-driver (ATI Technologies Inc.  Driver Version 8.830.6.3000) . It doesn't work?

I have also run through cmd. Now output is:

C:\Program Files (x86)\AMD APP\bin\x86_64>clinfo.exe

Number of platforms:                             2

  Platform Profile:                              FULL_PROFILE

  Platform Version:                              OpenCL 1.1 AMD-APP (831.4)

  Platform Name:                                 AMD Accelerated Parallel Proces

sing

  Platform Vendor:                               Advanced Micro Devices, Inc.

  Platform Extensions:                           cl_khr_icd cl_amd_event_callbac

k cl_amd_offline_devices cl_khr_d3d10_sharing

  Platform Profile:                              FULL_PROFILE

  Platform Version:                              OpenCL 1.1

  Platform Name:                                 Intel(R) OpenCL

  Platform Vendor:                               Intel(R) Corporation

  Platform Extensions:                           cl_khr_fp64 cl_khr_icd cl_khr_g

lobal_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32

_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store

cl_intel_printf cl_ext_device_fission cl_intel_exec_by_local_thread cl_khr_gl_sh

aring cl_intel_dx9_media_sharing

  Platform Name:                                 AMD Accelerated Parallel Proces

sing

Number of devices:                               1

  Device Type:                                   CL_DEVICE_TYPE_CPU

  Device ID:                                     4098

  Board name:

  Max compute units:                             4

  Max work items dimensions:                     3

    Max work items[0]:                           1024

    Max work items[1]:                           1024

    Max work items[2]:                           1024

  Max work group size:                           1024

  Preferred vector width char:                   16

  Preferred vector width short:                  8

  Preferred vector width int:                    4

  Preferred vector width long:                   2

  Preferred vector width float:                  4

  Preferred vector width double:                 0

  Native vector width char:                      16

  Native vector width short:                     8

  Native vector width int:                       4

  Native vector width long:                      2

  Native vector width float:                     4

  Native vector width double:                    0

  Max clock frequency:                           2195Mhz

  Address bits:                                  64

  Max memory allocation:                         2147483648

  Image support:                                 Yes

  Max number of images read arguments:           128

  Max number of images write arguments:          8

  Max image 2D width:                            8192

  Max image 2D height:                           8192

  Max image 3D width:                            2048

  Max image 3D height:                           2048

  Max image 3D depth:                            2048

  Max samplers within kernel:                    16

  Max size of kernel argument:                   4096

  Alignment (bits) of base address:              1024

  Minimum alignment (bytes) for any datatype:    128

  Single precision floating point capability

    Denorms:                                     Yes

    Quiet NaNs:                                  Yes

    Round to nearest even:                       Yes

    Round to zero:                               Yes

    Round to +ve and infinity:                   Yes

    IEEE754-2008 fused multiply-add:             Yes

  Cache type:                                    Read/Write

  Cache line size:                               64

  Cache size:                                    32768

  Global memory size:                            4240293888

  Constant buffer size:                          65536

  Max number of constant args:                   8

  Local memory type:                             Global

  Local memory size:                             32768

  Kernel Preferred work group size multiple:     1

  Error correction support:                      0

  Unified memory for Host and Device:            1

  Profiling timer resolution:                    466

  Device endianess:                              Little

  Available:                                     Yes

  Compiler available:                            Yes

  Execution capabilities:

    Execute OpenCL kernels:                      Yes

    Execute native function:                     Yes

  Queue properties:

    Out-of-Order:                                No

    Profiling :                                  Yes

  Platform ID:                                   000007FEE702C9C8

  Name:                                                 Intel(R) Core(TM) i3-233

0M CPU @ 2.20GHz

  Vendor:                                        GenuineIntel

  Device OpenCL C version:                       OpenCL C 1.1

  Driver version:                                2.0

  Profile:                                       FULL_PROFILE

  Version:                                       OpenCL 1.1 AMD-APP (831.4)

  Extensions:                                    cl_khr_fp64 cl_amd_fp64 cl_khr_

global_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int3

2_base_atomics cl_khr_local_int32_extended_atomics cl_khr_int64_base_atomics cl_

khr_int64_extended_atomics cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ex

t_device_fission cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_

media_ops cl_amd_popcnt cl_khr_d3d10_sharing

  Platform Name:                                 Intel(R) OpenCL

Number of devices:                               1

  Device Type:                                   CL_DEVICE_TYPE_CPU

  Device ID:                                     32902

  Max compute units:                             4

  Max work items dimensions:                     3

    Max work items[0]:                           1024

    Max work items[1]:                           1024

    Max work items[2]:                           1024

  Max work group size:                           1024

  Preferred vector width char:                   16

  Preferred vector width short:                  8

  Preferred vector width int:                    4

  Preferred vector width long:                   2

  Preferred vector width float:                  4

  Preferred vector width double:                 2

  Native vector width char:                      16

  Native vector width short:                     8

  Native vector width int:                       4

  Native vector width long:                      2

  Native vector width float:                     4

  Native vector width double:                    2

  Max clock frequency:                           2200Mhz

  Address bits:                                  64

  Max memory allocation:                         1060073472

  Image support:                                 Yes

  Max number of images read arguments:           480

  Max number of images write arguments:          480

  Max image 2D width:                            8192

  Max image 2D height:                           8192

  Max image 3D width:                            2048

  Max image 3D height:                           2048

  Max image 3D depth:                            2048

  Max samplers within kernel:                    480

  Max size of kernel argument:                   3840

  Alignment (bits) of base address:              1024

  Minimum alignment (bytes) for any datatype:    128

  Single precision floating point capability

    Denorms:                                     Yes

    Quiet NaNs:                                  Yes

    Round to nearest even:                       Yes

    Round to zero:                               No

    Round to +ve and infinity:                   No

    IEEE754-2008 fused multiply-add:             No

  Cache type:                                    Read/Write

  Cache line size:                               64

  Cache size:                                    262144

  Global memory size:                            4240293888

  Constant buffer size:                          131072

  Max number of constant args:                   480

  Local memory type:                             Global

  Local memory size:                             32768

  Kernel Preferred work group size multiple:     128

  Error correction support:                      0

  Unified memory for Host and Device:            1

  Profiling timer resolution:                    466

  Device endianess:                              Little

  Available:                                     Yes

  Compiler available:                            Yes

  Execution capabilities:

    Execute OpenCL kernels:                      Yes

    Execute native function:                     Yes

  Queue properties:

    Out-of-Order:                                Yes

    Profiling :                                  Yes

  Platform ID:                                   0000000000716080

  Name:                                                 Intel(R) Core(TM) i3-233

0M CPU @ 2.20GHz

  Vendor:                                        Intel(R) Corporation

  Device OpenCL C version:                       OpenCL C 1.1

  Driver version:                                1.1

  Profile:                                       FULL_PROFILE

  Version:                                       OpenCL 1.1 (Build 31360.31441)

  Extensions:                                    cl_khr_fp64 cl_khr_icd cl_khr_g

lobal_int32_base_atomics cl_khr_global_int32_extended_atomics cl_khr_local_int32

_base_atomics cl_khr_local_int32_extended_atomics cl_khr_byte_addressable_store

cl_intel_printf cl_ext_device_fission cl_intel_exec_by_local_thread cl_khr_gl_sh

aring cl_intel_dx9_media_sharing

0 Likes
scharupa
Staff
Staff

Re: Executing OpenCL Samples

Jump to solution

Looks to be you have installed both AMD APP and Intel OpenCL packages on the machine. AMD opencl files might be replaced with Intel OpenCL binaries. Intel OpenCL does not support GPU.

And ATI Driver Version 8.830 was an out dated driver(its catalyst 11.03).

Install Catalyst-12.4 on you machine then run the sample.

jatinderkaur
Adept I

Re: Executing OpenCL Samples

Jump to solution

Ok Thx. I will install catalyst-12.4 . And do i need to uninstall Intel OpenCL also to run samples or there is an alternative?

0 Likes
scharupa
Staff
Staff

Re: Executing OpenCL Samples

Jump to solution

No need.

0 Likes
jatinderkaur
Adept I

Re: Executing OpenCL Samples

Jump to solution

Ok I have tried updating from :

Control Panel > Graphic Adapters > Radeon HD 6470 > Driver > Update Driver

>Search Automatically for update driver

Result was:

The best driver s/w for your hardware is already installed. Windows has determined that your driver is uptodate.

IInd Method Tried:

Used AMD Driver Autodetect : http://support.amd.com/us/gpudownload/windows/Pages/auto_detect.aspx

It has again given h/w compatibility issues. And

result was:

pastedImage_0.png

0 Likes