rosario.c

Q: How to count cache misses, at a given level of cache, per iteration

Discussion created by rosario.c on Oct 7, 2010
Latest reply on Oct 7, 2010 by rosario.c
Q: How to count cache misses, at a given level of cache, per iteration

Hello,

my name is Rosario. Only recently I started using oProfile. I wonder if
there is any way to dump cache misses count per iteration with oProfile.

For example, let's assume I have a loop like the following:

for ( int i = 0 ; i < 1000 ; i ++) {
for ( int j = 0 ; j < 1000 ; j ++) {
float sum = 0 . 0 ;
for ( int k = 0 ; k < 1000 ; k++) {
sum = sum + a[k] * b[k][j];
}
r
[j]=sum ;
}
}

I am interested in the count of the L1D load misses per iteration of the
outermost loop. Therfore, the output of the profiling should be a list
containing 1000 pairs (iteration number, L1D load count).

Is there any support in oProfile to count misses per iteration? If not, can
you suggest any way to obtain profiling data in the form I explained above?

Thank you for your time,
- Ro

Outcomes