an anybody tell me if these profiling's results are good, pls?
Am I making too many texture fetches? Have the kernel good GPU occupancy? Do you see any evident problem apart from its a complex kernel and I am fetching like mad , pls?
Thanks.
bubu,
The important thing is that profiler stamps depends largely on the algorithm you are implementing.So to decide whether you are doing too many texture fetches or your occupancy is low can only be compared with profiler stamps of a standard implementation of the same algorithm.
we might be able to discuss it better if you post your code.
what i can say that you should try increase ALUpacking ratio. according to guide value above 70% is good. but it do not need brings performace as ALUBusy is pretty low too.
IMHO you should compare results relatively to you previous results.