For a thread that has tID % 4 ==0 to get x values from the 4 threads , the neighbor threads must also execute the lds_read_vec_neighborExch.
Is this true?