cancel
Showing results for 
Search instead for 
Did you mean: 

Archives Discussions

riza_guntur
Journeyman III

Why Brook+ vs CAL iteratively rerun 10 times perform better than Brook+

I loop brook+ optimized_matmult 10 times and CAL simple_matmult 10 times for 6400x6400

Why CAL perform much better, when in 1 run the brook+ optimized_matmult runs better if not the same?

What's happen actually?

0 Likes
5 Replies
gaurav_garg
Adept I

As I said earlier, system time reported by CAL sample doesn't include a lot of stuff. You should review the code and change the timer placement similar to Brook+.

0 Likes

Thanks gaurav,

I thought those actions are excluded after first kernel call in Brook+

0 Likes

Woops... Double post...

My connection bad lately at this site.

0 Likes

Originally posted by: gaurav.garg As I said earlier, system time reported by CAL sample doesn't include a lot of stuff. You should review the code and change the timer placement similar to Brook+.

Hm... After some thought. I ask because I got the average running-time for 10 iteration are about 50 percent of that one iteration, some perform 35 percent of that one iteration. Does CAL has some sort of caching algorithm too?

0 Likes

Bump, I really need help fast

0 Likes