Here's the last bit of kernel log. Usually nothing is captured - at least here - when this happens.
this is running einstein@home x 4 work units in parallel. GPU temps never get much above 140-150F. Same with the cpu cores. They run 120-130F tops typically. Not sure about the other
'drm' errors, I get them all the time but they don't seem to crash the box. gfxhub page fault?
Memory error?
---------------------------------------------------------------------------------------------
Dec 27 00:07:12 kernel: [107576.628702] [drm] Unknown EDID CEA parser results
Dec 27 00:07:12 kernel: [107576.642108] [drm] Unknown EDID CEA parser results
Dec 27 00:23:50 kernel: [108574.671480] [drm] Unknown EDID CEA parser results
Dec 27 00:41:01 kernel: [109605.667960] amdgpu 0000:03:00.0: amdgpu: [gfxhub] page fault (src_id:0 ring:88 vmid:8 pasid:32771, for process einstein_O3AS_1 pid 9689 thread einstein_O3AS_1 pid 9689)
Dec 27 00:41:01 kernel: [109605.667967] amdgpu 0000:03:00.0: amdgpu: in page starting at address 0x00007f87e6ccc000 from client 0x1b (UTCL2)
Dec 27 00:41:01 kernel: [109605.667970] amdgpu 0000:03:00.0: amdgpu: GCVM_L2_PROTECTION_FAULT_STATUS:0x008008B0
Dec 27 00:41:01 kernel: [109605.667971] amdgpu 0000:03:00.0: amdgpu: Faulty UTCL2 client ID: CPF (0x4)
Dec 27 00:41:01 kernel: [109605.667972] amdgpu 0000:03:00.0: amdgpu: MORE_FAULTS: 0x0
Dec 27 00:41:01 kernel: [109605.667973] amdgpu 0000:03:00.0: amdgpu: WALKER_ERROR: 0x0
Dec 27 00:41:01 kernel: [109605.667974] amdgpu 0000:03:00.0: amdgpu: PERMISSION_FAULTS: 0xb
Dec 27 00:41:01 kernel: [109605.667975] amdgpu 0000:03:00.0: amdgpu: MAPPING_ERROR: 0x0
Dec 27 00:41:01 kernel: [109605.667976] amdgpu 0000:03:00.0: amdgpu: RW: 0x0
Dec 27 00:47:22 kernel: [109985.919646] [drm] Unknown EDID CEA parser results
Dec 27 00:47:22 kernel: [109985.932499] [drm] Unknown EDID CEA parser results
---------------------------------------------------------------
I'm running it all near stock as far as I know. No overclock, etc. plug and pray .....
Any ideas welcome, TIA and Happy New Year.
Mike