5 Replies Latest reply on Dec 16, 2014 7:34 AM by humasama

    FineGrainSVM fails on Kaveri

    humasama

      I have installed omega driver and SDK3.0(platform: ubuntu 14.10 && Kaveri), but when I try the example FineGrainSVM, it reports that "Unsupported device! Device does not support SVM Atomics". Then I use clinfo, it says "Fine grain buffer: yes, Fine grain system: no, Atomics: no".

      Does it mean that Kaveri doesn't support atomic operations? If so, which platform will support atomic operations? I'm so confused...

       

      Any help will be greatly appreciated.

        • Re: FineGrainSVM fails on Kaveri
          dipak

          Currently, fine grained SVM with atomics is only supported on AMD Kaveri APU on Linux 64-bit OS. Are you using 64bit OS? Please share your setup details and clinfo output.

            • Re: FineGrainSVM fails on Kaveri
              humasama

              Thank for your reply~

               

              1. setup information:

              OS: Linux 3.2.0-54-generic x86_64 Ubuntu14.10

              X.org: 1.16.0

              glibc: 2.19

               

              2. clinfo

               

              Number of platforms: 1

              Platform Profile: FULL_PROFILE

              Platform Version: OpenCL 2.0 AMD-APP (1642.5)

              Platform Name: AMD Accelerated Parallel Processing

              Platform Vendor: Advanced Micro Devices, Inc.

              Platform Extensions: cl_khr_icd cl_amd_event_callback cl_amd_offline_devices

              Platform Name: AMD Accelerated Parallel Processing

              Number of devices: 2

               

               

              Device Type: CL_DEVICE_TYPE_GPU

              Vendor ID:1002h

              Board name:AMD Radeon(TM) R7 Graphics

              Device Topology: PCI[ B#0, D#1, F#0 ]

              Max compute units:8

              Max work items dimensions:3

              Max work items[0]:256

              Max work items[1]:256

              Max work items[2]:256

              Max work group size:256

              Preferred vector width char:4

              Preferred vector width short:2

              Preferred vector width int:1

              Preferred vector width long:1

              Preferred vector width float:1

              Preferred vector width double:1

              Native vector width char:4

              Native vector width short:2

              Native vector width int:1

              Native vector width long:1

              Native vector width float:1

              Native vector width double:1

              Max clock frequency:720Mhz

              Address bits:64

              Max memory allocation:451411968

              Image support:Yes

              Max number of images read arguments:128

              Max number of images write arguments:64

              Max image 2D width:16384

              Max image 2D height:16384

              Max image 3D width:2048

              Max image 3D height:2048

              Max image 3D depth:2048

              Max samplers within kernel:16

              Max size of kernel argument:1024

              Alignment (bits) of base address:2048

              Minimum alignment (bytes) for any datatype:128

              Single precision floating point capability

              Denorms:No

              Quiet NaNs:Yes

              Round to nearest even:Yes

              Round to zero:Yes

              Round to +ve and infinity:Yes

              IEEE754-2008 fused multiply-add:Yes

              Cache type:Read/Write

              Cache line size:64

              Cache size:16384

              Global memory size:1803550720

              Constant buffer size:65536

              Max number of constant args:8

              Local memory type:Scratchpad

              Local memory size:32768

              Max pipe arguments:16

              Max pipe active reservations:16

              Max pipe packet size:451411968

              Max global variable size:406270720

              Max global variable preferred total size: 1803550720

              Max read/write image args:64

              Max on device events:1024

              Queue on device max size:524288

              Max on device queues:1

              Queue on device preferred size:16384

              SVM capabilities:

              Coarse grain buffer:Yes

              Fine grain buffer:Yes

              Fine grain system:No

              Atomics:No

              Preferred platform atomic alignment:0

              Preferred global atomic alignment:0

              Preferred local atomic alignment:0

              Kernel Preferred work group size multiple:64

              Error correction support:0

              Unified memory for Host and Device:1

              Profiling timer resolution:1

              Device endianess:Little

              Available:Yes

              Compiler available:Yes

              Execution capabilities:

              Execute OpenCL kernels:Yes

              Execute native function:No

              Queue on Host properties:

              Out-of-Order:No

              Profiling :Yes

              Queue on Device properties:

              Out-of-Order:Yes

              Profiling :Yes

              Platform ID:0x7fc554fdcfd0

              Name:Spectre

              Vendor:Advanced Micro Devices, Inc.

              Device OpenCL C version:OpenCL C 2.0

              Driver version:1642.5 (VM)

              Profile:FULL_PROFILE

              Version:OpenCL 2.0 AMD-APP (1642.5)

              Extensions:

              cl_khr_fp64 cl_amd_fp64 cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics

              cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics

              cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes

              cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_atomic_counters_32

              cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops

              cl_amd_media_ops2 cl_amd_popcnt cl_khr_image2d_from_buffer cl_khr_spir cl_khr_subgroups

              cl_khr_gl_event cl_khr_depth_images

               

               

               

              Device Type:CL_DEVICE_TYPE_CPU

              Vendor ID:1002h

              Board name:

              Max compute units:4

              Max work items dimensions:3

              Max work items[0]:1024

              Max work items[1]:1024

              Max work items[2]:1024

              Max work group size:1024

              Preferred vector width char:16

              Preferred vector width short:8

              Preferred vector width int:4

              Preferred vector width long:2

              Preferred vector width float:8

              Preferred vector width double:4

              Native vector width char:16

              Native vector width short:8

              Native vector width int:4

              Native vector width long:2

              Native vector width float:8

              Native vector width double:4

              Max clock frequency:1700Mhz

              Address bits:64

              Max memory allocation:3909684224

              Image support:Yes

              Max number of images read arguments:128

              Max number of images write arguments:64

              Max image 2D width:8192

              Max image 2D height:8192

              Max image 3D width:2048

              Max image 3D height:2048

              Max image 3D depth:2048

              Max samplers within kernel:16

              Max size of kernel argument:4096

              Alignment (bits) of base address:1024

              Minimum alignment (bytes) for any datatype:128

              Single precision floating point capability

              Denorms:Yes

              Quiet NaNs:Yes

              Round to nearest even:Yes

              Round to zero:Yes

              Round to +ve and infinity:Yes

              IEEE754-2008 fused multiply-add:Yes

              Cache type:Read/Write

              Cache line size:64

              Cache size:16384

              Global memory size:15638736896

              Constant buffer size:65536

              Max number of constant args:8

              Local memory type:Global

              Local memory size:32768

              Max pipe arguments:16

              Max pipe active reservations:16

              Max pipe packet size:3909684224

              Max global variable size:1879048192

              Max global variable preferred total size: 1879048192

              Max read/write image args:64

              Max on device events:0

              Queue on device max size:0

              Max on device queues:0

              Queue on device preferred size:0

               

              SVM capabilities:

              Coarse grain buffer:Yes

              Fine grain buffer:Yes

              Fine grain system:Yes

              Atomics:Yes

              Preferred platform atomic alignment:0

              Preferred global atomic alignment:0

              Preferred local atomic alignment:0

              Kernel Preferred work group size multiple:1

              Error correction support:0

              Unified memory for Host and Device:1

              Profiling timer resolution:1

              Device endianess:Little

              Available:Yes

              Compiler available:Yes

              Execution capabilities:

              Execute OpenCL kernels:Yes

              Execute native function:Yes

              Queue on Host properties:

              Out-of-Order:No

              Profiling :Yes

              Queue on Device properties:

              Out-of-Order:No

              Profiling :No

              Platform ID:0x7fc554fdcfd0

              Name:AMD A10-7850K Radeon R7, 12 Compute

              Cores 4C+8G

              Vendor:AuthenticAMD

              Device OpenCL C version:OpenCL C 1.2

              Driver version:1642.5 (sse2,avx,fma4)

              Profile:FULL_PROFILE

              Version:OpenCL 1.2 AMD-APP (1642.5)

              Extensions:

              cl_khr_fp64 cl_amd_fp64

              cl_khr_global_int32_base_atomics cl_khr_global_int32_extended_atomics

              cl_khr_local_int32_base_atomics cl_khr_local_int32_extended_atomics

              cl_khr_int64_base_atomics cl_khr_int64_extended_atomics cl_khr_3d_image_writes

              cl_khr_byte_addressable_store cl_khr_gl_sharing cl_ext_device_fission

              cl_amd_device_attribute_query cl_amd_vec3 cl_amd_printf cl_amd_media_ops

              cl_amd_media_ops2 cl_amd_popcnt cl_khr_spir cl_khr_gl_event