1 Reply Latest reply on Nov 27, 2017 9:06 AM by amdmatt

    Kernel Crash in X11 session with ROCm kernel

    tuxontour

      Hello,

       

      I've got a kernel crash in the ROCm kernel 4.11.0-kfd-compute-rocm-rel-1.6-180 .

      To reproduce it I just start Kubuntu linux 17.10 64 bit (17.04 had the same error too) and open a few windows. Suddenly after a few seconds everything freezes. I can still log in from another computer via SSH.

       

      With the stock kernel the system runs fine.

       

      I get the following trace in the syslog:

       

      [   33.165198]  CPU: 15 PID: 1183 Comm: Xorg Tainted: G     W     4.11.0-kfd-compute-rocm-rel-1.6-180 #1

      [   33.165199] Hardware name: MSI MS-7A37/B350M MORTAR (MS-7A37), BIOS 1.70 09/19/2017

      [   33.165200] Call Trace:

      [   33.165205]  dump_stack+0x63/0x90

      [   33.165208]  __warn+0xcb/0xf0

      [   33.165212]  warn_slowpath_null+0x1d/0x20

      [   33.165271]  dc_surface_retain+0x34/0x50 [amdgpu]

      [   33.165330]  resource_attach_surfaces_to_context+0xb1/0x3e0 [amdgpu]

      [   33.165388]  resource_validate_attach_surfaces+0xad/0x160 [amdgpu]

      [   33.165449]  dce112_validate_with_context+0x15e/0x1c0 [amdgpu]

      [   33.165507]  dc_get_validate_context+0x75/0xe0 [amdgpu]

      [   33.165573]  amdgpu_dm_atomic_check+0x48e/0xb80 [amdgpu]

      [   33.165576]  ? __alloc_skb+0x10/0x1a0

      [   33.165596]  drm_atomic_check_only+0x468/0x590 [drm]

      [   33.165614]  drm_atomic_nonblocking_commit+0x18/0x60 [drm]

      [   33.165677]  amdgpu_atomic_helper_page_flip+0x13f/0x170 [amdgpu]

      [   33.165692]  drm_mode_page_flip_ioctl+0x3ae/0x410 [drm]

      [   33.165706]  drm_ioctl+0x1fc/0x450 [drm]

      [   33.165721]  ? drm_mode_cursor2_ioctl+0x10/0x10 [drm]

      [   33.165724]  ? do_readv_writev+0x8c/0xb0

      [   33.165766]  amdgpu_drm_ioctl+0x4c/0x80 [amdgpu]

      [   33.165770]  do_vfs_ioctl+0x92/0x5a0

      [   33.165772]  ? vfs_writev+0x3c/0x50

      [   33.165775]  SyS_ioctl+0x79/0x90

      [   33.165778]  entry_SYSCALL_64_fastpath+0x1e/0xad

      [   33.165780] RIP: 0033:0x7f4e7fb8eea7

      [   33.165781] RSP: 002b:00007fffa72fb618 EFLAGS: 00003246 ORIG_RAX: 0000000000000010

      [   33.165784] RAX: ffffffffffffffda RBX: 0000000000000002 RCX: 00007f4e7fb8eea7

      [   33.165785] RDX: 00007fffa72fb650 RSI: 00000000c01864b0 RDI: 0000000000000017

      [   33.165786] RBP: 00007fffa72fc650 R08: 0000000000001208 R09: 000000000000072f

      [   33.165787] R10: 0000000000000007 R11: 0000000000003246 R12: 00000000c01864b0

      [   33.165788] R13: 0000000000000017 R14: 0000562802a3f600 R15: 0000562802b67120

      [   33.165791] ---[ end trace 474e1b968676584c ]---

      [   33.167211] ------------[ cut here ]------------

       

      Relevant Hardware:

      Mainboard: MSI Mortar B350M with Ryzen 1700 and newest BIOS

      Graphics Card: Gigabyte Radeon RX 580 Gaming 8G

       

      Thanks