Understanding clause parallelism

The context is Chapter 2.3 "Control Flow and Clauses" from the R600_Instruction_Set_Architecture document.

I would like to understand the phrase "Multiple clauses of a program can execute in parallel if they contain instructions of different types and the clauses are independent of one another" from the point of view of the processors in a SIMD array running ALU clauses. Considering that all 16 units follow the same code it is unclear what exactly is parallelized. I am missing somethig.. please provide an example.