My system is a Asus G513QY with AMD 5980HX CPU and 6800m GPU running GPU drivers 22.11.2.
To start with, my temperatures are fine, running memtest has showed no RAM hardware problems. vbios is up to date from Asus.
After doing various testing with applications, watching temperatures, testing games, etc, I finally found what was causing my GPU drivers to crash so hard that my computer reboots. I found that if I have OBS Studio recording with hardware encoding (it happens with both H.264 and H.265), I will occasionally have my computer hard crash and reboot. So far this has only happened while I am playing Escape from Tarkov, but I play that game the most by far so that also skews things a bit. This only happens if OBS Studio is recording. I can play fine without any crashes if no recording is happening.
Below is a minidump from one of the crashes. The debug info says I am running windows 10, but I am actually running windows 11. Not sure why it is displaying that. At the bottom of the output you can see the lovely 0x9F_3_amdkmdag_IMAGE_pci.sys item that points to AMD.
******************************************************************************* * * * Bugcheck Analysis * * * ******************************************************************************* DRIVER_POWER_STATE_FAILURE (9f) A driver has failed to complete a power IRP within a specific time. Arguments: Arg1: 0000000000000003, A device object has been blocking an IRP for too long a time Arg2: ffffab0e2cee8300, Physical Device Object of the stack Arg3: ffff8002308ef7d8, nt!TRIAGE_9F_POWER on Win7 and higher, otherwise the Functional Device Object of the stack Arg4: ffffab0e439eb750, The blocked IRP Debugging Details: ------------------ KEY_VALUES_STRING: 1 Key : Analysis.CPU.mSec Value: 2406 Key : Analysis.DebugAnalysisManager Value: Create Key : Analysis.Elapsed.mSec Value: 2873 Key : Analysis.IO.Other.Mb Value: 0 Key : Analysis.IO.Read.Mb Value: 0 Key : Analysis.IO.Write.Mb Value: 0 Key : Analysis.Init.CPU.mSec Value: 234 Key : Analysis.Init.Elapsed.mSec Value: 11729 Key : Analysis.Memory.CommitPeak.Mb Value: 111 Key : Bugcheck.Code.DumpHeader Value: 0x9f Key : Bugcheck.Code.Register Value: 0x9f Key : Dump.Attributes.AsUlong Value: 1008 Key : Dump.Attributes.DiagDataWrittenToHeader Value: 1 Key : Dump.Attributes.ErrorCode Value: 0 Key : Dump.Attributes.KernelGeneratedTriageDump Value: 1 Key : Dump.Attributes.LastLine Value: Dump completed successfully. Key : Dump.Attributes.ProgressPercentage Value: 0 FILE_IN_CAB: 011723-14750-01.dmp DUMP_FILE_ATTRIBUTES: 0x1008 Kernel Generated Triage Dump BUGCHECK_CODE: 9f BUGCHECK_P1: 3 BUGCHECK_P2: ffffab0e2cee8300 BUGCHECK_P3: ffff8002308ef7d8 BUGCHECK_P4: ffffab0e439eb750 DRVPOWERSTATE_SUBCODE: 3 IMAGE_NAME: pci.sys MODULE_NAME: pci FAULTING_MODULE: fffff8021bfc0000 pci BLACKBOXBSD: 1 (!blackboxbsd) BLACKBOXNTFS: 1 (!blackboxntfs) BLACKBOXPNP: 1 (!blackboxpnp) BLACKBOXWINLOGON: 1 CUSTOMER_CRASH_COUNT: 1 PROCESS_NAME: System STACK_TEXT: ffff8002`308ef788 fffff802`19f72056 : 00000000`0000009f 00000000`00000003 ffffab0e`2cee8300 ffff8002`308ef7d8 : nt!KeBugCheckEx ffff8002`308ef790 fffff802`19f71f2c : 00000000`00000004 ffff8381`a52c5180 ffffab0e`38914b48 ffff8002`308ef909 : nt!PopIrpWatchdogBugcheck+0x122 ffff8002`308ef810 fffff802`19cbb97b : ffffab0e`00000009 ffffab0e`00000001 ffff8002`00000000 00000000`00000002 : nt!PopIrpWatchdog+0xc ffff8002`308ef840 fffff802`19cbd0f6 : 00000000`00000000 00000000`00000000 00000000`00000000 00000000`0099c28e : nt!KiProcessExpiredTimerList+0x1eb ffff8002`308ef970 fffff802`19e2dd1e : ffff8381`a52c5180 ffff8381`a52c5180 ffffab0e`2b564080 ffffab0e`447d1080 : nt!KiRetireDpcList+0xed6 ffff8002`308efc00 00000000`00000000 : ffff8002`308f0000 ffff8002`308e9000 00000000`00000000 00000000`00000000 : nt!KiIdleLoop+0x9e IMAGE_VERSION: 10.0.22621.900 STACK_COMMAND: .cxr; .ecxr ; kb FAILURE_BUCKET_ID: 0x9F_3_amdkmdag_IMAGE_pci.sys OSPLATFORM_TYPE: x64 OSNAME: Windows 10
I contacted support that gave the typical
- Run the AMD driver cleanup utility
- Reinstall the Drivers
- Reinstall your OS
I had already done all that. They told me to use Asus specific drivers or contact Asus for support. In other words, the first level tech support guy doesn't know and doesn't want to actually send this to the engineering team.
Anyone else know of a potential fix?
Have you tried the latest (23.2.1) drivers? They specifically mention OBS in the release notes so it's probably worth looking at. Unless there is some particular reason you need to stay with the older version.