cancel
Showing results for 
Search instead for 
Did you mean: 

OpenCL

himanshu_gautam
Grandmaster

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

Hi,

Still not able to access this link as my company network does not allow it.

Please re-share the test case by attaching it here itself as a zip file.

EDIT: Use advanced text editor to attach the testcase.

Message was edited by: Himanshu Gautam

0 Likes
Reply
Raistmer
Adept II

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

Here it is.

0 Likes
Reply
himanshu_gautam
Grandmaster

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

Thanks Raistmer.

I guess the problem is already reported by binying, but I was not able to find a tracking number for it. Will let you know the status now.

0 Likes
Reply
himanshu_gautam
Grandmaster

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

Hi Raistmer,

I tried the two executables you had shared (MB7_win_x86_SSE_OpenCL_ATi_r1726_verbose.exe and setiathome_6.99_windows_intelx86__opencl_ati_sah.exe) on Drivers 13.1, 12.8 and 12.3. Both the applications always resulted in driver crash.

My system details: HD 7970, Driver: 13.1,12.8,12.3, CPU: FX4100

Anyways I will report it to AMD Team Again. Sorry could not find the reference to the old bug.

0 Likes
Reply
Raistmer
Adept II

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

Are you sure you was able to downgrade recent drivers properly.

Inability of AMD Catalyst installer to properly do OpenCL runtime downgrade is known bug and was reported by Claggy on these forums too.

Very possible that all variants you tried were on the same recent 13.1 OpenCL runtime that fails to compile at all indeed.

Real OpenCL runtime from Cat 12.8 has no issues with app. And to check initial problem with invalid computations you should try Catalyst 12.10 drivers, not ones you tried.

0 Likes
Reply
himanshu_gautam
Grandmaster

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

I have not seen any problems in downgrading to old drivers with a clean system. I could see proper driver versions in CCC.

Anyways will check with 12.10 too. (As per you, i should get invalid results with 12.10 driver, how to verify that?).

I will check once more with 12.8 with more rigorous cleanup. Thanks for your support.

0 Likes
Reply
Raistmer
Adept II

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

Driver version and OpenCL runtime version are quite different things. Be careful to refer the right one (OpenCL runtime).

I have reports of success with running Catalyst 13.1 video (and perhaps sound and so on) driver but with OpenCL runtime taken from Cat 12.8. Only OpenCL compiler works incorrectly. BTW, did you remove *.bin* files between runs? If you will run with compiled binaries (under old driver) on new driver you will get correct results too (cause again, OpenCL compiler broken, not OpenCL runtime per se. If one already have right binary it will be executed OK.

For now to check if app works differently check stderr.txt file for number of found signals.

I will attach validating tool later.

0 Likes
Reply
Raistmer
Adept II

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

To check validness of computation one can use attached tool and reference result (inside archive).

usage:

rescmpv5.exe ref-setiathome_6.98_windows_intelx86.exe-PG0395_v7.wu.res result.sah

where result.sah is the file generated after app full run.

tool output self-explaining. In case of big result differencies it will show table with quality of found signals between 2 files.

Examples of usage:

Cat 12.8 run:

E:\123>rescmpv5.exe ref-setiathome_6.98_windows_intelx86.exe-PG0395_v7.wu.res result.sah

Result      : Strongly similar,  Q= 99.41%

not 100% similarity almost inevitable between CPU and GPU long floating point computations but similarity good enough.

Cat 12.8 but binaries taken from Cat 12.11 beta 8:

1) driver restart occured (just as was reported in initial post).

2)

E:\123>setiathome_6.99_windows_intelx86__opencl_ati_sah.exe

E:\123>rescmpv5.exe ref-setiathome_6.98_windows_intelx86.exe-PG0395_v7.wu.res result.sah

                ------------- R1:R2 ------------     ------------- R2:R1 ------------

                Exact  Super  Tight  Good    Bad     Exact  Super  Tight  Good    Bad

        Spike      0      0      0      0      0        0      0      0      0      0

     Autocorr      0      0      0      0      1        0      0      0      0      0

     Gaussian      0      0      0      0      1        0      0      0      0      0

        Pulse      0      0      0      0      0        0      0      0      0      0

      Triplet      0      0      0      0      0        0      0      0      0      0

   Best Spike      0      0      0      0      1        0      0      0      0      0

Best Autocorr      0      0      0      0      1        0      0      0      0      0

Best Gaussian      0      0      0      0      1        0      0      0      0      0

   Best Pulse      0      0      0      0      1        0      0      0      0      0

Best Triplet      0      0      0      0      0        0      0      0      0      0

                ----   ----   ----   ----   ----     ----   ----   ----   ----   ----

                   0      0      0      0      6        0      0      0      0      0

Unmatched signal(s) in R1 at line(s) 672 689 716 732 749 775

Result      : Different.

As one can see number of found results differs (of course, app was terminated after driver restart, computations not finished).

One will see similar table if computation will finish ok, but with wrong results.

Validation tool will show differencies as in this sample.

P.S.:

(As per you, i should get invalid results with 12.10 driver, how to verify that?).

Yes, expect wrong result (but no driver restart ) with Cat 12.10. Driver restarts appeared on later driver releases.

Tool for verification and how to use it described in this post, above.


0 Likes
Reply
himanshu_gautam
Grandmaster

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

Hi Raistmer,

Probably you were right about driver downgrading issue. Here are my observations:

1. I had installed 12.10 without a proper system clean and saw the driver crash there. Ran rescmpv5.exe and result were incorrect.

2. Then I had cleaned the system using AMD cleanup utility before installing any other driver:

3. Installed 12.8 driver: SETI.exe ran without a crash. Check correctness with rescmpv5.exe and it gave 99.9% correctness.

4. Installed 12.10 again, and surprisingly seti.exe again ran without crash. rescmpv5 also passed correctly.

5. Now installed 13.1 driver, SETI.exe crashed. rescmpv5.exe confirms incorrect result.

Attached are the result.sah and stderr file for all cases.

So our observations are differing for 12.10 driver as of now. But anyways it is a bug. Please provide any feedback you have on the results.

Claggy
Adept II

Re: Problems with Cat 12.10 and up and HD7xxx (and not only) GPUs

himanshu.gautam wrote:

Hi Raistmer,

Probably you were right about driver downgrading issue. Here are my observations:

1. I had installed 12.10 without a proper system clean and saw the driver crash there. Ran rescmpv5.exe and result were incorrect.

2. Then I had cleaned the system using AMD cleanup utility before installing any other driver:

3. Installed 12.8 driver: SETI.exe ran without a crash. Check correctness with rescmpv5.exe and it gave 99.9% correctness.

4. Installed 12.10 again, and surprisingly seti.exe again ran without crash. rescmpv5 also passed correctly.

5. Now installed 13.1 driver, SETI.exe crashed. rescmpv5.exe confirms incorrect result.

Attached are the result.sah and stderr file for all cases.

So our observations are differing for 12.10 driver as of now. But anyways it is a bug. Please provide any feedback you have on the results.

Looking at the result from:

4. Installed 12.10 again, and surprisingly seti.exe again ran without crash. rescmpv5 also passed correctly.

It looks as if Raistmer has supplied a workunit that doesn't show a weakily similiar result on Cat 12.10, it has:

'WU true angle range is :  0.394768'

The Workunits that that showed the weakily similar reult were the PG0009_v7.wu and the refquick_v7.wu workunits,

which have 'WU true angle range is :  0.008955' and 'WU true angle range is :  0.775000' respectively.

Here's a full bench of five different workunits (with 3 different apps) where those two workunits are weakily similar.

Claggy

Edit: added PG0009_v7.wu and refquick_v7.wu workunits along with ref files for said workunits.