When you set the BRT_RUNTIME to CPU does it emulates the parallelization or it jsut runs the code serially.
Well i execute a serial code and execute it (i have a written a C program ) it takes less time while the BROOK+ code when executed with CPU backend take a very very long time . Input range being the same.
I would really like to know the answer for the same.
I read the hard architecture provided in brook+. It says stream processors have SIMD engine and SIMD engines have thread processors which have stream core. So how many threads are executed at time. I am using FireStream 9170. The document says wavefron size is 64. I checked the specification of 9170 on AMD website doesn have the full description as they have for 9270.
Would really want answers to this question