I was working on a interger program which basically walks a bush (multiple tree) topology and performs SIMD on each node along the way. When not using pthread, opencc results in 5%-10% better performance than gcc 4.2. However, when using pthread, even if only one thread is forked/used, opencc has 40% *lower* performance than gcc 4.2.
Is there some known issues that could be the reason for this? If not, how should I proceed to find out the problem?
Also does opencc accept the data type attribute for memory alignment?