Hey guys, i am stuck with a strange problem that is probably not even the fault of the cards themselves but i hope maybe someone here has and idea how to troubleshoot next.
I wanted to have 3 pro duo Polaris in a server so it would have 6 physical GPU's.
All the cards work individually, they all work as a set with one other, they also work in all the slots.
But as soon as i install all three it wont boot.
The motherboard the asrock x399 Taichi with a 2950X just keeps cycling boot codes in a continues loop.
I already contacted there support but no response as of yet.
Cant get a screen nor bios so its stuck in the pre boot check.
Power is not the issue, i switched the PSU with a brand new 1300w.
Same problem.
if i install 2 pro duo's and a wx4100 it works, but if i take a third pro duo it goes haywire.
Hope someone has an idea what to try out next.
I have fixed the problem, just a quick recap for anyone running into the issue.
According to what i have read it has to do with the pcie address space that by default only ran to 32bit on this motherboard.
The pro duo cards have a pcie switch and the two gpu's it self so they consume some address space.
Also the board was crammed full of nvme and other stuff, so the third card could not be addressed.
You have to tweak the following settings in bios if you run into this:
Disable CSM
IOMMU on
Above 4g decoding <------ most important one to turn on.
After that it worked like a charm
How cool is it to see the following line in the software:
running the test data set on up to 32 CPU cores and on up to 6 GPUs