AnsweredAssumed Answered

GPU_MAX_ALLOC_PERCENT and 13.1 drivers failure

Question asked by liwoog on Mar 15, 2013
Latest reply on Sep 6, 2015 by nirv_knox

While my code was running well in production using GPU_MAX_ALLOC_PERCENT at up to 100% with the 12.4 drivers, it fails (CL_OUT_OF_RESOURCES) with the 13.1 drivers (I allocate up to 90% of memory from the code). I tried changing 100% to 80% to no avail.

 

Only being able to use 2GB of the 3GB on the card would render it useless for my next project. I need every bit of memory I can use.

 

Is there a workaround?

 

Machine:

4x HD 7970

Catalyst 13.1 driver on CentOS 6.3


Operating System Version (name), Linux version 2.6.32-279.19.1.el6.centos.plus.x86_64 (mockbuild@c6b7.bsys.dev.centos.org) (gcc version 4.4.6 20120305 (Red Hat 4.4.6-4) (GCC) ) #1 SMP Wed Dec 19 06:20:23 UTC 2012

 

Operating System Version (number), 2.6.32

Number Of Processors, 32

System Type, Intel(R) Xeon(R) CPU E5-2670 0 @ 2.60GHz

Total Physical Memory, 64392 MB

Available Physical Memory, 62184 MB

Total Virtual Memory, 33554431 MB

Available Virtual Memory, 33519322 MB

Total Page Files, 8191 MB

Available Page Files, 8191 MB

 

Platform ID, 1, 1, 1, 1, 1

Device Type, GPU, GPU, GPU, GPU, CPU

Device Name, Tahiti, Tahiti, Tahiti, Tahiti, Intel(R) Xeon(R) CPU E5-2670 0 @ 2.60GHz

Vendor, Advanced Micro Devices, Inc., Advanced Micro Devices, Inc., Advanced Micro Devices, Inc., Advanced Micro Devices, Inc., GenuineIntel

Command Queue Properties, Queue profiling, Queue profiling, Queue profiling, Queue profiling, Queue profiling

Is Available, Yes, Yes, Yes, Yes, Yes

Is Compiler Available, Yes, Yes, Yes, Yes, Yes

Is Little Endian, Yes, Yes, Yes, Yes, Yes

Error Correction Support, No, No, No, No, No

Execution Capabilities, Kernel Execution, Kernel Execution, Kernel Execution, Kernel Execution, Kernel Execution, Native Kernel Execution

Global Memory Cache Size, 16 KB, 16 KB, 16 KB, 16 KB, 32 KB

Memory Cache Type, Read Write, Read Write, Read Write, Read Write, Read Write

Global Memory Cache Line Size, 64 bytes, 64 bytes, 64 bytes, 64 bytes, 64 bytes

Global Memory Size, 2,048 MB, 2,048 MB, 2,048 MB, 2,048 MB, 64,393 MB

Host Unified Memory, No, No, No, No, Yes

Are Images Supported, Yes, Yes, Yes, Yes, Yes

Max Image 2D Dimensions, (256w, 256h), (256w, 256h), (256w, 256h), (256w, 256h), (1024w, 1024h)

Max Image 3D Dimensions, (256w, 256h, 256d), (256w, 256h, 256d), (256w, 256h, 256d), (256w, 256h, 256d), (1024w, 1024h, 1024d)

Local Memory Size, 32 KB, 32 KB, 32 KB, 32 KB, 32 KB

Local Memory Type, Local, Local, Local, Local, Global

Max Clock Frequency, 1050, 1050, 1050, 1050, 1200

Max Compute Units, 32, 32, 32, 32, 32

Max Constant Arguments, 8, 8, 8, 8, 8

Max Constant Buffer Size, 64 KB, 64 KB, 64 KB, 64 KB, 64 KB

Max Memory Allocation Size, 512 MB, 512 MB, 512 MB, 512 MB, 16,099 MB

Max Parameter Size, 1,024 bytes, 1,024 bytes, 1,024 bytes, 1,024 bytes, 4 KB

Read Image Arguments, 128, 128, 128, 128, 128

Max Samplers, 16, 16, 16, 16, 16

Max Workgroup Size, 256, 256, 256, 256, 1024

Max Work Item Dimensions, 3, 3, 3, 3, 3

Max Work Item Sizes, (256,256,256), (256,256,256), (256,256,256), (256,256,256), (1024,1024,1024)

Max Write Image Arguments, 8, 8, 8, 8, 8

Memory Base Address Alignment, 2048, 2048, 2048, 2048, 1024

Minimal Data Type Alignment Size, 128 bytes, 128 bytes, 128 bytes, 128 bytes, 128 bytes

OpenCL C Version, OpenCL C 1.2 , OpenCL C 1.2 , OpenCL C 1.2 , OpenCL C 1.2 , OpenCL C 1.2

Native Char Vector Width, 4, 4, 4, 4, 16

Native Short Vector Width, 2, 2, 2, 2, 8

Native Int Vector Width, 1, 1, 1, 1, 4

Native Long Vector Width, 1, 1, 1, 1, 2

Native Float Vector Width, 1, 1, 1, 1, 8

Native Double Vector Width, 1, 1, 1, 1, 4

Native Half Vector Width, 1, 1, 1, 1, 4

Preferred Char Vector Width, 4, 4, 4, 4, 16

Preferred Short Vector Width, 2, 2, 2, 2, 8

Preferred Int Vector Width, 1, 1, 1, 1, 4

Preferred Long Vector Width, 1, 1, 1, 1, 2

Preferred Float Vector Width, 1, 1, 1, 1, 8

Preferred Double Vector Width, 1, 1, 1, 1, 4

Preferred Half Vector Width, 1, 1, 1, 1, 4

Profile, FULL_PROFILE, FULL_PROFILE, FULL_PROFILE, FULL_PROFILE, FULL_PROFILE

Profiling Timer Resolution, 1, 1, 1, 1, 1

Vendor ID, OpenCL 1.2 AMD-APP (1113.2), OpenCL 1.2 AMD-APP (1113.2), OpenCL 1.2 AMD-APP (1113.2), OpenCL 1.2 AMD-APP (1113.2), OpenCL 1.2 AMD-APP (1113.2)

Outcomes