Where can you find BLAS/LAPACK implementations? The ATI SDK for OpenCL includes some operations like matrix multiplication and eigenvalues but I need a more complete package especially one that has matrix inversion and singular value decomposition.