I have tested it on AMD5870 and NV9800 - stable and correct results, specs do not say that the result is going to be undefined, but it is not allowed. however is it allowed and the result is correct
have no idea then, i will keep current implementation for the moment, because each time to copy memory from image to image, well you know ...