Best GPU algorithm for solving 4x4 linear equation system

What is the best GPU algorithm for solving a linear equation system Ax = b, where A is a 4x4 or 3x3 matrix?


The equation is to be solved in one work item.


Thank you in advance!


Vis Cocoa