The important thing is that profiler stamps depends largely on the algorithm you are implementing.So to decide whether you are doing too many texture fetches or your occupancy is low can only be compared with profiler stamps of a standard implementation of the same algorithm.
we might be able to discuss it better if you post your code.
what i can say that you should try increase ALUpacking ratio. according to guide value above 70% is good. but it do not need brings performace as ALUBusy is pretty low too.
IMHO you should compare results relatively to you previous results.