This content has been marked as final. Show 3 replies
For constant performance, the ways of accessing from highest to lowest performance is as follows:
constant ptr w/ compile time constant
constant ptr w/ runtime constant for all threads
constant ptr w/ linear access from all threads
constant ptr w/ random access
Thanks, that was very helpful.
But is there a reasonable limit in number of literals for a kernel? At what number will the performance suffer?
Literals are embedded in the instruction itself, so there is no limit on the number of literals in theory. In practice it is limited to 16k unique literals in a compilation unit.