cancel
Showing results for 
Search instead for 
Did you mean: 

Drivers & Software

hydrian
Adept I

Stability issues with 17.50 on Ubuntu 16.04

I'm currently running Ubuntu 16.04.3 with the linux-image-generic-hwe-16.04 a version 4.13.0.32.52 (current). I'm getting some hardware lockups. It doesn't matter if I'm in a hardware accelerated game or just on my desktop environemnt (mate). When it locks up, the whole machine freeze and the only way to get out of it is a hard reboot. I haven't seen anything in the logs to help me determine the issue.

This wasn't the case when I was previously using the A8-7600 's integrated Radeon R7 GPU in the same computer. 

I disabled the APU's GPU and installed an RX 570. I switched over the the Ubuntu HWE kernel and graphics stack and installed the amdgpu-pro 17.50-511655 driver. After fighting with removing some old fglrx drivers from the previous GPU, I was able to compile and load the amdgpu(-pro) driver successfully. Now I'm getting the random freezes that require the hard reboot to recover from.

Any ideas?

Other system info:

System:    Host: balor Kernel: 4.13.0-32-generic x86_64 (64 bit gcc: 5.4.0)

           Desktop: MATE 1.18.0 (Gtk 3.18.9-1ubuntu3.3) Distro: Linux Mint 18.3 Sylvia

Machine:   Mobo: ASRock model: FM2A88X Extreme6+ Bios: American Megatrends v: P4.20 date: 01/13/2016

CPU:       Quad core AMD A8-7600 Radeon R7 10 Compute Cores 4C+6G (-MCP-) cache: 8192 KB

           flags: (lm nx sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm) bmips: 24752

           clock speeds: max: 3100 MHz 1: 1900 MHz 2: 1400 MHz 3: 1400 MHz 4: 1400 MHz

Graphics:  Card: Advanced Micro Devices [AMD/ATI] Device 67df bus-ID: 01:00.0

           Display Server: X.Org 1.19.5 drivers: ati,amdgpu (unloaded: fbdev,vesa,radeon)

           Resolution: 1280x1024@60.02hz

           GLX Renderer: Radeon RX 570 Series GLX Version: 4.5.13505 - CPC 17.50.2.13 Direct Rendering: Yes

Audio:     Card-1 Advanced Micro Devices [AMD] FCH Azalia Controller driver: snd_hda_intel bus-ID: 00:14.2

           Card-2 Advanced Micro Devices [AMD/ATI] Device aaf0 driver: snd_hda_intel bus-ID: 01:00.1

           Sound: Advanced Linux Sound Architecture v: k4.13.0-32-generic

Network:   Card-1: Intel Wireless 7260 driver: iwlwifi bus-ID: 03:00.0

           IF: wlan2 state: up mac: 7c:5c:f8:17:1b:e1

           Card-2: Qualcomm Atheros QCA8171 Gigabit Ethernet driver: alx port: c000 bus-ID: 05:00.0

           IF: eth0 state: up speed: 1000 Mbps duplex: full mac: d0:50:99:61:40:41

Drives:    HDD Total Size: 4821.0GB (23.6% used) ID-1: /dev/sda model: TOSHIBA_MD04ACA4 size: 4000.8GB

           ID-2: /dev/sdb model: WDC_WD5000BPKT size: 500.1GB ID-3: /dev/sdc model: ST3320620AS size: 320.1GB

Partition: ID-1: / size: 286G used: 238G (88%) fs: ext4 dev: /dev/sdc1

           ID-2: swap-1 size: 7.97GB used: 0.00GB (0%) fs: swap dev: /dev/sdc5

RAID:      Device-1: /dev/md0 - active components: online: sda1[0]

           Info: raid: 1 report: 2/1 blocks: 3906885632 chunk size: N/A bitmap: true

Sensors:   System Temperatures: cpu: 14.9C mobo: N/A gpu: 36.0

           Fan Speeds (in rpm): cpu: N/A

Info:      Processes: 255 Uptime: 1:05 Memory: 4507.5/24106.6MB Init: systemd runlevel: 5 Gcc sys: 5.4.0

           Client: Shell (bash 4.3.481) inxi: 2.2.35

0 Likes
2 Solutions
poggs
Adept I

I'm having this exact problem right now - same amdgpu-pro driver, and same kernel version.  After logging in to the GUI, I see the following in my dmesg, before my desktop has completely loaded:

[   27.971530] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971533] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971534] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971536] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971714] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971715] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971716] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971718] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971772] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971773] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971774] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971775] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971801] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971802] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971803] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971805] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971830] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971831] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971833] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971834] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971859] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971861] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971862] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971863] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971888] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971890] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971891] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971893] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971918] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971920] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971921] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971922] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971950] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c784802

[   27.971951] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971953] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E048002

[   27.971954] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC0' (0x54433000) (72)

[   27.971983] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c784802

[   27.971984] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971985] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E048002

[   27.971987] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC0' (0x54433000) (72)

The screen froze, the mouse pointer worked, but I couldn't move the mouse out of the screen it was within at the time of the crash.  Even if I ssh'd in, trying to reboot via the command line failed.

Booting in to the previous 4.13.0-31-generic kernel from GRUB fixed this for me.

View solution in original post

I'm running 4.13.0-36-generic here and it's so stable I forgot previous kernels were an issue.

View solution in original post

9 Replies
poggs
Adept I

I'm having this exact problem right now - same amdgpu-pro driver, and same kernel version.  After logging in to the GUI, I see the following in my dmesg, before my desktop has completely loaded:

[   27.971530] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971533] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971534] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971536] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971714] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971715] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971716] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971718] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971772] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971773] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971774] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971775] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971801] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971802] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971803] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971805] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971830] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971831] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971833] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971834] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971859] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971861] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971862] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971863] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971888] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971890] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971891] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971893] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971918] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c780402

[   27.971920] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971921] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E004002

[   27.971922] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC3' (0x54433300) (4)

[   27.971950] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c784802

[   27.971951] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971953] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E048002

[   27.971954] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC0' (0x54433000) (72)

[   27.971983] amdgpu 0000:65:00.0: GPU fault detected: 147 0x0c784802

[   27.971984] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_ADDR   0x00A0198F

[   27.971985] amdgpu 0000:65:00.0:   VM_CONTEXT1_PROTECTION_FAULT_STATUS 0x0E048002

[   27.971987] amdgpu 0000:65:00.0: VM fault (0x02, vmid 7) at page 10492303, read from 'TC0' (0x54433000) (72)

The screen froze, the mouse pointer worked, but I couldn't move the mouse out of the screen it was within at the time of the crash.  Even if I ssh'd in, trying to reboot via the command line failed.

Booting in to the previous 4.13.0-31-generic kernel from GRUB fixed this for me.

Hello,

i've the same problem.

System crashes after upgrade to 17.50 after some time as excactly described here before.

Ubuntu 16.04 LTS

Kernel: 4.13.0-32-generic 4.13.0-32.35~16.04.1

Best Regards

0 Likes

Hi,

here are some more information:

i downgraded my kernel to 4.13.0.31.34~16.04.1 as descirbed before, but this did not helped.

linux-image-extra-4.13.0-31-generic                         4.13.0-31.34~16.04.1

cat /etc/lsb-release

DISTRIB_ID=Ubuntu

DISTRIB_RELEASE=16.04

DISTRIB_CODENAME=xenial

DISTRIB_DESCRIPTION="Ubuntu 16.04.3 LTS"

Graphic Card RX480.

AMDGPU-PRO 17.50

Here is my last log i found on the system and i've been seen some time before:

Feb 11 16:10:10 thomas-desktop kernel: [ 1098.661844] mmap: qtdemux0:sink (6972): VmData 272039936 exceed data ulimit 271973775. Update limits or use boot option ignore_rlimit_data.

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886658] [drm] Atomic commit: RESET. crtc id 0:[ffff91d94f240000]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886663] [drm] dc_commit_context: 1 streams

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886665] [drm] core_stream 0x5513a800: src: 0, 0, 2560, 1440; dst: 0, 0, 2560, 1440, colorSpace:1

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886667] [drm]     pix_clk_khz: 241500, h_total: 2720, v_total: 1481, pixelencoder:1, displaycolorDepth:2

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886669] [drm]     sink name: ASUS PB278, serial: 171750

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886670] [drm]     link: 1

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886681] BUG: unable to handle kernel NULL pointer dereference at 00000000000002e0

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886811] IP: dce110_disable_stream+0x20/0x100 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886837] PGD 0

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886838] P4D 0

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886849]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886870] Oops: 0000 [#1] SMP NOPTI

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.886889] Modules linked in: nls_iso8859_1 xt_CHECKSUM iptable_mangle ipt_REJECT nf_reject_ipv4 xt_tcpudp ebtable_filter ebtables ip6table_filter ip6_tables rpcsec_gss_krb5 auth_rpcgss nfsv4 nfs lockd grace fscache ipt_MASQUERADE nf_nat_masquerade_ipv4 xfrm_user xfrm_algo iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack x_tables nf_nat nf_conntrack libcrc32c br_netfilter bridge stp llc aufs uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core videodev media joydev snd_usb_audio snd_usbmidi_lib usblp binfmt_misc arc4 edac_mce_amd crct10dif_pclmul crc32_pclmul ath5k ghash_clmulni_intel ath pcbc mac80211 snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi input_leds aesni_intel snd_hda_intel aes_x86_64 crypto_simd

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887263]  snd_seq_midi snd_hda_codec glue_helper cfg80211 snd_seq_midi_event cryptd snd_hda_core snd_rawmidi wmi_bmof snd_seq snd_hwdep snd_pcm k10temp i2c_piix4 fam15h_power snd_seq_device snd_timer snd soundcore nuvoton_cir shpchp rc_core mac_hid kvm_amd kvm irqbypass parport_pc sunrpc ppdev lp parport autofs4 hid_generic usbhid hid uas usb_storage amdkfd(OE) amd_iommu_v2 amdgpu(OE) amdttm(OE) mxm_wmi amdkcl(OE) i2c_algo_bit drm_kms_helper r8169 syscopyarea mii sysfillrect sysimgblt fb_sys_fops e1000e ahci drm libahci ptp pps_core wmi

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887500] CPU: 1 PID: 2275 Comm: Xorg Tainted: G           OE   4.13.0-31-generic #34~16.04.1-Ubuntu

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887541] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./970 Extreme4, BIOS P2.80 08/05/2015

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887585] task: ffff91d953725d00 task.stack: ffffb50f8466c000

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887663] RIP: 0010:dce110_disable_stream+0x20/0x100 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887690] RSP: 0018:ffffb50f8466f830 EFLAGS: 00010286

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887713] RAX: ffff91d9570a7c00 RBX: ffff91d90c190158 RCX: 0000000000000001

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887744] RDX: 0000000000000000 RSI: ffff91d5ebc6c000 RDI: ffff91d94f3b3dc0

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887776] RBP: ffffb50f8466f840 R08: ffffb50f8466f9ac R09: 000000000000053e

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887807] R10: ffffb50f8466fa38 R11: 000000000000053e R12: ffff91d90c190158

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887838] R13: ffff91d5ebc6c158 R14: ffff91d9570a7c00 R15: ffff91d94f3d0000

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887870] FS:  00007f76d66d1a00(0000) GS:ffff91d97ec40000(0000) knlGS:0000000000000000

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887905] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887931] CR2: 00000000000002e0 CR3: 000000081213c000 CR4: 00000000000406e0

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.887962] Call Trace:

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888031]  core_link_disable_stream+0x51/0x230 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888136]  dce110_reset_hw_ctx_wrap+0xa1/0x190 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888228]  dce110_apply_ctx_to_hw+0x4f/0x8e0 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888258]  ? sched_clock+0x9/0x10

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888276]  ? up+0x32/0x50

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888336]  ? amdgpu_cgs_read_register+0x14/0x20 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888407]  ? generic_reg_get+0x24/0x60 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888478]  dc_commit_context_no_check+0xcf/0x2e0 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888553]  dc_commit_context+0x97/0xf0 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888625]  amdgpu_dm_atomic_commit_tail+0x1e2/0xa40 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888653]  ? ww_mutex_unlock+0x26/0x30

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888721]  ? dm_plane_helper_prepare_fb+0xc9/0x220 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888757]  commit_tail+0x3f/0x70 [drm_kms_helper]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888786]  drm_atomic_helper_commit+0x9c/0xe0 [drm_kms_helper]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888863]  amdgpu_dm_atomic_commit+0x9b/0xb0 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888905]  drm_atomic_commit+0x4b/0x50 [drm]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888930]  drm_atomic_helper_connector_dpms+0xff/0x170 [drm_kms_helper]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.888972]  drm_mode_connector_set_obj_prop+0x62/0x70 [drm]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889028]  drm_mode_obj_set_property_ioctl+0x11b/0x160 [drm]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889066]  ? drm_mode_connector_set_obj_prop+0x70/0x70 [drm]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889103]  drm_mode_connector_property_set_ioctl+0x3f/0x60 [drm]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889140]  drm_ioctl_kernel+0x69/0xb0 [drm]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889169]  drm_ioctl+0x3e4/0x450 [drm]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889198]  ? drm_mode_connector_set_obj_prop+0x70/0x70 [drm]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889225]  ? ep_ptable_queue_proc+0xa0/0xa0

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889245]  ? timerqueue_add+0x59/0x90

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889299]  amdgpu_drm_ioctl+0x4c/0x80 [amdgpu]

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889321]  do_vfs_ioctl+0xa1/0x5f0

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889338]  ? entry_SYSCALL_64_after_hwframe+0x118/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889363]  ? entry_SYSCALL_64_after_hwframe+0x111/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889388]  ? entry_SYSCALL_64_after_hwframe+0x10a/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889413]  ? entry_SYSCALL_64_after_hwframe+0x103/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889437]  ? entry_SYSCALL_64_after_hwframe+0xfc/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889461]  ? entry_SYSCALL_64_after_hwframe+0xf5/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889486]  ? entry_SYSCALL_64_after_hwframe+0xee/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889510]  ? entry_SYSCALL_64_after_hwframe+0xe7/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889534]  ? entry_SYSCALL_64_after_hwframe+0xe0/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889558]  SyS_ioctl+0x79/0x90

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889574]  ? entry_SYSCALL_64_after_hwframe+0xa1/0x168

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889598]  entry_SYSCALL_64_fastpath+0x33/0xa3

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889619] RIP: 0033:0x7f76d40bff47

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889635] RSP: 002b:00007ffc4c6631c8 EFLAGS: 00003202 ORIG_RAX: 0000000000000010

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889672] RAX: ffffffffffffffda RBX: 000055e25566ace0 RCX: 00007f76d40bff47

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889703] RDX: 00007ffc4c663270 RSI: 00000000c01064ab RDI: 000000000000000d

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889736] RBP: 00007ffc4c663200 R08: 00007ffc4c663260 R09: 0000000000000001

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889767] R10: 00007ffc4c663170 R11: 0000000000003202 R12: 000055e255677880

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889798] R13: 000055e255d99f70 R14: 000055e255669ac0 R15: 000055e255677801

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.889831] Code: d1 e8 89 43 1c eb 86 0f 1f 40 00 0f 1f 44 00 00 55 48 89 e5 41 54 53 48 8b 47 08 48 89 fb 48 8b bf f0 00 00 00 48 8b 10 48 85 ff <4c> 8b a2 e0 02 00 00 74 39 48 8b 07 ff 50 18 48 8b 43 08 8b b8

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.890009] RIP: dce110_disable_stream+0x20/0x100 [amdgpu] RSP: ffffb50f8466f830

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.890041] CR2: 00000000000002e0

Feb 11 16:12:33 thomas-desktop kernel: [ 1240.913204] ---[ end trace afd2762d802dfab3 ]---

0 Likes

Thanks poggs​​. That seems to stabilize my issues. Your description of your issues matched mine exactly too. I never used any kernel before 4.13.0-32 because this was my first install of the GPU. I didn't know if this was a hardware issue or software stack issue.

hydrian
Adept I

Now we need to figure out if any thing past 4.13.0-32 will works. 4.13.0-31 is still venerable to all of the Specter and Meldown related kernel exploits.

0 Likes

Hi, i got only one failure with the 4.13.0-31, but after this everything runs normally.

With an update to linux kernel 4.14.20-041420 the screen freezed again directly.

0 Likes

tbludau​ I wouldn't expect that kernel 4.14.20-041420 work. The AMD provided drivers explicitly say that their amdgpu drivers only support the linux-generic-hwe kernels on ubuntu. I'm using mint-18.3 so use linux-generic-hwe-16.04 drivers stack. You also have to make sure your using the XOrg drivers that are linked to the hwe version. 

My concern is the Meltdown / Spectra fixes are aimed for 4.13.0-34. I'm hoping 4.13.0-32 is just a blip and not the new standard of driver quality.

0 Likes
hydrian
Adept I

Now that 4.13.0.36.55 is our with all of the spectra/meltdown patches, has anybody upgrade to that kernel and what were their experiences with the 17.50 AMD driver?.

0 Likes

I'm running 4.13.0-36-generic here and it's so stable I forgot previous kernels were an issue.