Hi @hotstone,
If your interest lies in needing to use an OpenCL FFT library, I would point you to the open source clFFT library at github:
clMathLibraries/clFFT · GitHub
If your interest is in reproducing your own implementation for learning, then I can't help you with this article. You should be able to step through the source in the repository above and see how it constructs it's kernels, but it's development was not related to this article.
Kent