I have seen many of these posts...and tried so many things.
Some Details and the Initial Problem
Things I have tried
Things I find noteworthy
I tried the testing in 2 ways, one with CBP and XMP on, the other without. No change was found in any of the testing.
With all defaults and only removing ZeroRPM and Increasing the fan curve dramatically, the card was able to function much longer before getting this error. However, "much" in this case means it would sometimes get through the 60 second stress test, but never a second time. It still, in most cases would fail and green screen during the first test but with ZeroRPM and no fan curve set, it would fail in the first 20 seconds, without it would usually get passed 30.
Lowering the voltage had no effect. I think I could go down to 980 or somewhere in that range. Tried many different values.
Lowering the GPU had no effect. I tried at 2100, 2000,...,1500. I tried all these values and found nothing remarkable.
The memory had no lowering, and was always at 2000 mhz, the only tuning I could do would be to increase the VRAM, which seemed would not help in this issues so I left the values alone here.
I tried the power settings at different percent increases, up to like 15 I think (which was max) and no change could be seen.
I feel that running the same test and playing the same games all this is happening on while using the 5700 xt removes most of the possibility this is coming from something on my system. For instance, if the wattage draw when running the 6800 xt at a much lower GPU is the same or lower as my 5700, the power supply should not be the issue since the crashing still occurred with no change. Also, since the GPU is not pushing more than the 5700 xt when the 6800 xt is lowered, it should be something the CPU or RAM cannot handle for some reason, and in the same regard, the board as well if the GPU is actively doing less, that is less for the board to handle. Basically, in the testing I am reducing the 6800 xt to something below the 5700 xt and the issues still occur at the same rate as having the 6800 xt on defaults. Of course, that is a little different when turning off ZeroRPM and increasing the fan curve...but only slightly.
Lastly, I decided to make a partition on the WD 1tb drive and try installing windows there and testing this out on a new install. My thoughts are that it would not solve the issue since even when looking through bios settings while the 6800 xt is installed, I am getting resets (though it does not green screen, but I do see some vertical artifacts before it just resets). In the process of installing Windows, the computer reset (I install windows all the time, this was not a typical installation reset, this was not a planned action). Eventually, I got windows installed, and the green screen occurred before I could even install the chipset drivers. Still, pressed on, got everything installed, still could not pass a stress test. So the issue is not windows, and I do not believe the issue is the drivers either. The crashing was so bad, I had to put the 5700 xt back in just to remove the partition and and fix my boot.
Things I know about, but haven't tried
I have a ticket with Gigabyte over this...but it sucks. I had to over spend by hundreds to get this card and if they RMA it and the same crap happens....ungghh....don't even want to think of that.
So does anyone have anything else they think may help in figuring this out? Its a common issue it seems getting the Cache Hierarchy WHEA Logger Event 18 Processor Core APID [n] error. It seems many people have tried many things and there is never a concrete answer. I am of the thinking this card is truly defective. But, at the same time, more minds on a problem are better than one.
Removed my 6800 xt. With the 5700 xt installed, I ran DDU in safe mode (first option) and then restarted. I used the next to latest WHQL Radeon drivers and let it extract. If I let Radeon continue and choose install driver only, it does not help when I get teh 6800 xt back in. But, if I close the installer, then update the display adapter in the device manager with the path that the Radeon extractor saved the drivers, then turn the machine off and put the 6800 xt back in, then once started do the same with the newet WHQL drivers and then do to device manager and updated the 6800 xt to the newest driver, I am able to sparingly use the card with afterburner loading and setting a much higher fan curve. Without after burner and the much higher fan curve, I am not able to pass any test or bench marks.
I say sparingly because I can pass a 1080p Heaven run for 15 minutes, but as soon as I put it to 1440p (on a card that should be 4k ) it green screens. If I run RDR2 at 1080 p, I was able to run around for a while without a crash, but move that up to 1440, its green screen time. Any VR I try crashes within minutes.
I have removed my m2 NVM drives, let everything cool down, and then installed a Sata HD (yes, HD, still have a bunch of those), popped the CMOS, waited a bit and put it back, started the machine with the 6800xt, and I could not make it through an installed of windows with the machine rebooting. Put the 5700xt, installed windows with no issues.
My hope here was that somehow the m2 NVM drives were someone causing a problem...apparently they do not.
The card still causes reboots even when I am in bios, which having the 5700 xt installed, I do not experience at all.
I am guessing my next test is either to try this other x570 Gigabyte Auros Elite board, which I would rather not open...I would rather sale it. Also, I could take my power supply off this machine and put it in a different one of my computers (which are all intel but older) and see if the crashing occurs. But, if it does not, that does not really tell me a whole lot except that by default the card is not exactly faulty....but it still may be regarding AMD CPUs I guess.
I hate this. If I RMA the card with gigabyte, its either bad or not and they either return a fixed, new, or the same card that they found to be working but still will not work for me. Then if I RMA with newegg, its either to get the full amount I paid if it is defective per their tests, or get what I paid minus restocking which is outrageous at $250 per item. So newegg = money or same card back that is not working for me, gigabyte = fixed card, or same card that is not working for me. If I send it to gigabyte, it would not return in time for me to send it back to newegg.
Anyone else? I see other posts about with the same info, but they do not seem to be doing as much testing to figure this out. I also do not see anyone saying anything about the system restarting even in bios.
On all these restarts when I am in bios or trying to install windows (I have tried in more ways than I wrote so far), the 6800 xt fans spin at startup, then stop. The back of the card is hot to the touch...not burning, but still hot, and yet the fans never come on.
This is interesting in relation to some other things:
When using adrenaline, if I could get to the manual settings before the green screen happened and turn of ZeroRPM and set the curve higher, the fans start spinning, I do not think I got a green screen until I ran tests that further stressed the card.
If I start with afterburner and have it apply the higher fan curve, I do not get green screen and fans are spinning.
After a green screen, if I touch the card, its hot. Which to me may mean that when I am getting reboots in bios and installing windows, the card's vbios is telling it not to turn the fans on, which after burner overrides if I can get that far and adrenaline overrides if I can get that far.
There is no vbios update for this card per the gigabyte tech support agent I spoke to.
So...if the card is hot, I imagine the junction temps are pretty high. But, without software telling it do so, the fans do not kick on.
I keep leaning more and more to this being a specifically bad card. But if I send it off to gigabyte, I am not sure how thorough they will test it and if they just throw it in a machine, open some crap game, and it works, then they are sending the same card back and its outside my windows for return. I guess I could send a note with it explaining the issue more than there small few sentence RMA textbox for the issue would allow.
I think this is sealing it as a bad card, or bad advice from AMD...not sure which.
I just installed the 6800 xt to a bit older machine, but still one that should be powerful enough:
Intel Core i7 4790 @ 3.6gh
32gb of ram (older DDR3)
Gigabyte h97m-d3h Board (same brand as the card)
Corsair CX 750
So I put the card in. Started up the machine. I had it connected via Display port since I did not have an HDMI cable here at work. I have typically be using HDMI at home.
I went to download the drivers from AMD, black screen. The machine did not reboot....but it did not do anything else either. I turned off the machine, and this card was super hot. The fans never spun. I cannot risk my server at work anymore than what I did already. But I think two machines, both in the realm of being capable of using this card, crashing without even actually stressing the card is a sure fire indication this card is bad. The fans not spinning...maybe that is part of their bios, but I cannot see why they would not spin when the back of the card is so hot.
I will be RMA'ing this thing with Gigabyte and including all of my test. If they come back with, "It works for us", then I am pretty sure we are all being scammed by someone, be that AMD or the card manufactures. I am only semi-serious there.
However, the one thing that is the same among these tests is the power supply wattage being 750 watts. Mine being a corsair hx 750 Platinum and this one at work being a lesser corsair cx 750 bronze I think. But, per everything I have read from official documentation and pcpartpicker (not anecdotal or opinions on forums) 750, especially gold or above, should be enough. And remember, this card is crashing without even doing anything so no where near 300 watts are even being used and when I can get it running for a while on my machine, it went at least 10 minutes on 1080 p. So, are these cards capable of running on 750 watts? I would love to hear from someone actively doing that without any issues. If being generous and I say 80 for the board, 10 for 2 rams sticks, 125 for the CPU, 10 for cpu fan, 10 for 3 case fans, 50 for 2 NVM drives, that is 285. I also tried with one HD instead of NVM which takes less watts. That leaves at min 465 watts. I cannot believe that a non stressed card would fail because of a lack of wattage, and remember those numbers above are a system in stress, and the 285 is likely quite high.
I guess that is it for testing. I will be sending the card to gigabyte and post back when I hear back from them.
Your issue and experience is so much like mine, that I had to steal your formatting for my initial post. Hopefully someone gets to my ticket with Gigabyte some time soon.
Gigabyte tested my card and found 2 issues, one was the memory module and the other I could not understand what the guy said but it sounding like it was related to power. But, apparently the issue was the card itself was bad and they are sending a new one. I anticipate some type of document or explanation at some point that better explains the issues and I will post that here.
Who knows how long that will take, but I am hopeful the issue will be resolved when I get the new one.
I will post back when I get the new one.
The RMA came via email in about 2 days. I did call and speak with a tech before hand and explained the issue and they said to RMA it...maybe that sped it up some?
They did say that RMA's can take 3-5 days I think and then shipping can take 2 weeks with 1-2 weeks out from being inspected and tested. Mine seemed to move through the system quite a bit faster, but maybe work within those timelines? They also said those were typical times right now and that depending on the workload it could take longer.
It has been about 18 days since they tested the card and found it had issues. They said they were about 2 weeks out, but I called today and they said they are not sure as it could take longer.
Also, they do not give more detail to the problems they found, so the best detail I am getting is "a bad memory module" and "a power issue on a chip" which is literally all the technicians wrote.
Somewhat irritated that A) I am not going to get more information about what went wrong to help others and B) That I am sure they have sold more of this same card in the time I am waiting to receive mine.
If I owed the bank $1000 and said instead I wanted to buy $1000 worth of peanuts, the bank would not care because I have a debt to them and I would be fined for not honoring my obligation of that debt before doing something else. I feel the same logic applies here, gigabyte should be obligated to withhold a sale in order to remedy their debt to me. I have already paid for their product, and yet, I have no product. What is an acceptable time frame here? Each day or week or month is one day or week or month I do not get enjoy the product I paid for within the applicable time that product is meant to be used. What if takes 6 months...a year... I paid for a card that would be high end for the immediate foreseeable future and if it takes to long, then I am effectively paying for a used-to-be high end card or the least a high end card with a limited time remaining of being high end. I realize that is a bit drastic and probably unlikely, its more likely I will get the card in the next weeks, but I needed to vent on my disappointment.
I found the same, on MSI,ASUS,ASROCk,ZOTAC, ETC
i had issues just like the 1000s of posts I've seen so far.. just re paste and it all goes away.. when u have parts of the chip that HASNT HAD THERMAL PASTE ON IT FROM FACTORY U will have black screens weird power issues based on parts of chip exposed...
here is my XFX RX6800 qICk XFX already SANDED the GPU DIECHIP showing its a MCM already before I sanded the ram chip surface and VRM surface allowing for better thermal contact to metal under the plastic... cut the thermal pads u can save some heat wash by separating the VRM and RAM thermal pads not sharing one BIG piece of thermal pad and put some MX5 on there...
DONT BE SCARED GO SLOW MAKE ROOM AND TAKE UR TIME...