I would like to suggest making hardware that instead of reading bytes linearly, also reads them vertically and diagonally, so a byte could be used more than once, improving the processor's performance.