AnsweredAssumed Answered

amdgpu-pro 17.30 linux "radeon_setup_kernel_mem failed

Question asked by fermulator on Sep 25, 2017
Latest reply on Jan 2, 2018 by fermulator

Posted this in IRC as well (freenode, #amdgpu)

 

intro MANDATORY information:

----

  • AMD Graphics Card
    • Hawaii XT AMD Radeon R9 290X
  • Desktop or Laptop System
    • desktop
  • Operating System
    • Ubuntu 16.04.3 LTS 64-bit - Linux fermmy 4.4.0-96-generic #119-Ubuntu SMP Tue Sep 12 14:59:54 UTC 2017 x86_64 x86_64 x86_64 GNU/Linux
  • Driver version installed
    • see attached (17.30)
  • Display Devices
    • dual monitor, 1680x1050 each
  • Motherboard + Bios Revision
    • SABERTOOTH 990FX R2.0
  • CPU/APU
    • AMD FX(tm)-8350 Eight-Core Processor

 

Description of problem:

---

yesterday I upgraded from v17.10 to v17.30 amdgpu-pro Linux driver; (followed How-To Install/Uninstall AMD Radeon™ Software AMDGPU-PRO Driver for Linux® on an Ubuntu System )

 

--I use two primary local user accounts, my PRIMARY one worked fine (have been using it) - however this morning tried to login to my other account and it fails Xorg init with "radeon_setup_kernel_mem failed" - snippet of info:  https://pastebin.com/f0SinVDR (see session ID 17873)

 

(a "symptomatic" description of the problem - this procedure worked fine yesterday before driver upgrade)

1. from active PRIMARY user account running gnome3, go "switch user"

2. select the other SECONDARY user account, enter password, GO

3. GDM flashes for a moment, then bails back to the login screen listing all the users

  (rinse and repeat) - analyzed logs and found the below

 

attached also [amdgpu-pro_xorg_user_log.txt]

 

SNIPPET:

{{{

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) Loading sub module "ramdac"

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) LoadModule: "ramdac"

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) Module "ramdac" already built-in

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) UnloadModule: "modesetting"

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) Unloading modesetting

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) UnloadModule: "fbdev"

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) Unloading fbdev

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) UnloadSubModule: "fbdevhw"

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) Unloading fbdevhw

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) UnloadModule: "vesa"

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) Unloading vesa

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (--) Depth 24 pixmap format is 32 bpp

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) RADEON(0): [DRI2] Setup complete

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) RADEON(0): [DRI2]   DRI driver: radeonsi

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (II) RADEON(0): [DRI2]   VDPAU driver: radeonsi

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE) RADEON(0): failed to initialise surface manager

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE) RADEON(0): radeon_setup_kernel_mem failed

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE)

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: Fatal server error:

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE) AddScreen/ScreenInit failed for driver 0

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE)

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE)

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: Please consult the The X.Org Foundation support

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: #011 at http://wiki.x.org

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]:  for help.

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE) Please also check the log file at "/home/<SECONDARY_USER>/.local/share/xorg/Xorg.2.log" for additional information.

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE)

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: (EE) Server terminated with error (1). Closing log file.

Sep 25 08:38:04 fermmy /usr/lib/gdm3/gdm-x-session[17873]: Unable to run X server

}}}

 

I then proceeded to initiate a call with AMD technical support (CANADA) and acquired a ticket number.

 

Proceeding with the next steps of debug (which I wanted to do anyway just hadn't done yet)...

 

---

Logged out of my PRIMARY account, and tried to log back in.

--> FAIL (same symptoms/logs as above)

 

SO, unrelated at all to accounts. (makes much more sense now)

 

---

Rebooted

--> FAIL - system stuck (black screen), dug into the syslog from that time, indeed, same thing

{{{

Sep 25 09:25:01 fermmy /usr/lib/gdm3/gdm-x-session[12260]: (II) RADEON(0): [DRI2] Setup complete

Sep 25 09:25:01 fermmy /usr/lib/gdm3/gdm-x-session[12260]: (II) RADEON(0): [DRI2]   DRI driver: radeonsi

Sep 25 09:25:01 fermmy /usr/lib/gdm3/gdm-x-session[12260]: (II) RADEON(0): [DRI2]   VDPAU driver: radeonsi

Sep 25 09:25:01 fermmy /usr/lib/gdm3/gdm-x-session[12260]: (EE) RADEON(0): failed to initialise surface manager

Sep 25 09:25:01 fermmy /usr/lib/gdm3/gdm-x-session[12260]: (EE) RADEON(0): radeon_setup_kernel_mem failed

Sep 25 09:25:01 fermmy /usr/lib/gdm3/gdm-x-session[12260]: (EE)

Sep 25 09:25:01 fermmy /usr/lib/gdm3/gdm-x-session[12260]: Fatal server error:

Sep 25 09:25:01 fermmy /usr/lib/gdm3/gdm-x-session[12260]: (EE) AddScreen/ScreenInit failed for driver 0

}}}

 

---

Then dropped into recovery mode, enabled networking + rw on the OS partition, and ran the amdgpu-pro-uninstall, and it removed all the bits.

, rebooted, system is now on open source driver (but functional)

{{{

$ sudo lshw -C video | grep driver

       configuration: driver=radeon latency=0

}}}

---

 

As per instructions from support, I will attempt to re-install the driver next and update.

 

Message was edited by: Fermulator (updated with more debug/troubleshooting information)

Outcomes