Hello, Using the latest linux driver shows a much worse performance for float16 and double16. Is this expected?
Meassured with clpeak:
driver_version | 1642.5 (sse2) -> 1702.3 (sse2)
float16| 9.96633 -> 1.91836 Mflops
double16| 2.37198 -> 0.640011 Mflops
- Coarse grain buffer: Yes
- Fine grain buffer: Yes
- Fine grain system: Yes
- Atomics: Yes
+ Coarse grain buffer: No
+ Fine grain buffer: No
+ Fine grain system: No
+ Atomics: No
Any pointers will be appreciated. Thanks!