reading slides of 69xx compute features seems that still will not have all features supported by Fermi, as real function calls (with stack) which will allow recursion and function pointers, I was thinking that AMD could add to CPU backend while waiting for being implemented on hardware..
Also this would be similar idea as CUDA-x86 which will have to support such CUDA features..
Specifically I would add to CPU backend:
*Real Function calls (supporting recursion) and function pointer support: this can be implemented right now pretty efficiently seems as Ocelot does in translating CUDA 3.x PTX files to LLVM..
*Similar to printf add malloc and free.. This can be efficient too.. as Nvidia implements even on GPUs.. Your 69xx cards support calling malloc and free in GPU code as Fermi supports it?
*Add asm("") function being able to insert x86 assembly code in kernels
CUDA allows asm function inside CUDA device functions..
*Image support and autovectorization: intel supports both!