I am not sure if I understand your question correctly. Are you using domainOffset and domainSize methods to change the domain? Or, you have just reduced the stream size becuase you are using uint4 instead of uint.
You can use both the methods in compute Shader too.
I just reduced the stream size because I am using uint4.
a = NxN (in uint4)
b = NxN (out uint4)
domain width = N/4 domain height = N