cancel
Showing results for 
Search instead for 
Did you mean: 

Server Gurus Discussions

bmayer
Journeyman III

Amdgpu Segmentation fault

I'm struggling to get my fresh Ubuntu installation (16.04.3 LTS) working with amdgpu-pro. I have a NVIDIA gpu installed at PCI port 1 for which I haven't installed any drivers. My screen is connected to that card. Now I've installed an AMD RX 470 card to PCI port 2 and installed the amdgpu package following the installation instructions provided here: Installation Instructions for amdgpu Pro / amdgpu All Open Graphics Stacks .

Sadly after a reboot Xorg doesn't start anymore. I'm getting the following error message:

X.Org X Server 1.19.3

Release Date: 2017-03-15

X Protocol Version 11, Revision 0

Build Operating System: Linux 4.4.0-97-generic x86_64 Ubuntu

Current Operating System: Linux minion01 4.10.0-42-generic #46~16.04.1-Ubuntu SMP Mon Dec 4 15:57:59 UTC 2017 x86_64

Kernel command line: BOOT_IMAGE=/boot/vmlinuz-4.10.0-42-generic.efi.signed root=UUID=68b3a5ef-9413-47ac-99d7-ad9486493389 ro quiet splash vt.handoff=7

Build Date: 13 October 2017  02:11:50PM

xorg-server 2:1.19.3-1ubuntu1~16.04.4 (For technical support please see http://www.ubuntu.com/support)

Current version of pixman: 0.33.6

        Before reporting problems, check http://wiki.x.org

        to make sure that you have the latest version.

Markers: (--) probed, (**) from config file, (==) default setting,

        (++) from command line, (!!) notice, (II) informational,

        (WW) warning, (EE) error, (NI) not implemented, (??) unknown.

(==) Log file: "/var/log/Xorg.0.log", Time: Tue Jan  2 22:38:01 2018

(==) Using system config directory "/usr/share/X11/xorg.conf.d"

(II) [KMS] Kernel modesetting enabled.

amdgpu_device_initialize: DRM version is 1.3.1 but this driver is only compatible with 3.x.x.

amdgpu_device_initialize: DRM version is 1.3.1 but this driver is only compatible with 3.x.x.

(EE)

(EE) Backtrace:

(EE) 0: /usr/lib/xorg/Xorg (xorg_backtrace+0x4e) [0x5633bb503a9e]

(EE) 1: /usr/lib/xorg/Xorg (0x5633bb352000+0x1b57f9) [0x5633bb5077f9]

(EE) 2: /lib/x86_64-linux-gnu/libpthread.so.0 (0x7fd058926000+0x11390) [0x7fd058937390]

(EE) 3: /opt/amdgpu/lib/x86_64-linux-gnu/libdrm_amdgpu.so.1 (amdgpu_get_marketing_name+0xc) [0x7fd05480a994]

(EE) 4: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x25b181d) [0x7fd051f1081d]

(EE) 5: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x25b1f61) [0x7fd051f10f61]

(EE) 6: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x25b62a0) [0x7fd051f152a0]

(EE) 7: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x192d698) [0x7fd05128c698]

(EE) 8: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x18d8af2) [0x7fd051237af2]

(EE) 9: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x540a71) [0x7fd04fe9fa71]

(EE) 10: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x1463a67) [0x7fd050dc2a67]

(EE) 11: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x1463d37) [0x7fd050dc2d37]

(EE) 12: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x1478a2c) [0x7fd050dd7a2c]

(EE) 13: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x1478fa6) [0x7fd050dd7fa6]

(EE) 14: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x2574340) [0x7fd051ed3340]

(EE) 15: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x25744e4) [0x7fd051ed34e4]

(EE) 16: /opt/amdgpu-pro/lib/xorg/modules/extensions/libglx.so (0x7fd056d2b000+0x45a7f) [0x7fd056d70a7f]

(EE) 17: /opt/amdgpu-pro/lib/xorg/modules/extensions/libglx.so (GlxExtensionInit+0x138) [0x7fd056d6e8f1]

(EE) 18: /usr/lib/xorg/Xorg (InitExtensions+0x43) [0x5633bb418313]

(EE) 19: /usr/lib/xorg/Xorg (0x5633bb352000+0x581cc) [0x5633bb3aa1cc]

(EE) 20: /lib/x86_64-linux-gnu/libc.so.6 (__libc_start_main+0xf0) [0x7fd05857c830]

(EE) 21: /usr/lib/xorg/Xorg (_start+0x29) [0x5633bb394329]

(EE)

(EE) Segmentation fault at address 0x18

(EE)

Fatal server error:

(EE) Caught signal 11 (Segmentation fault). Server aborting

(EE)

(EE)

Please consult the The X.Org Foundation support

        at http://wiki.x.org

for help.

(EE) Please also check the log file at "/var/log/Xorg.0.log" for additional information.

(EE)

(EE) Server terminated with error (1). Closing log file.

I've googled a lot and found many similar issues but none of the threads provided an answer on how to fix this. Can someone help me out? Thank you!!!

0 Likes
3 Replies
Anonymous
Not applicable

Hi bmayer,

Are you using an EPYC platform? Try installing it with these additional parameters:

amdgpu-install --opencl=legacy --headless

Please let me know if that works for you.

0 Likes

Hello;

As of today I am experiencing a segfault in the exact same function tho under a different use case.

It repros with both the most recent amd linux distributions

amdgpu-pro-18.30-641594.tar.xz

amdgpu-pro-18.40-676022-ubuntu-18.04.tar.xz

I grabbed 18.40 with the profound hope that the problem was fixed. NOPE! 😞

Program terminated with signal SIGSEGV, Segmentation fault.

#0  0x00007fedd729eb4d in amdgpu_get_marketing_name ()

   from /opt/amdgpu/lib/x86_64-linux-gnu/libdrm_amdgpu.so.1

(gdb) bt

#0  0x00007fedd729eb4d in amdgpu_get_marketing_name ()

    at /opt/amdgpu/lib/x86_64-linux-gnu/libdrm_amdgpu.so.1

#1  0x00007fedda58acf4 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#2  0x00007fedda58b861 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#3  0x00007fedda3694b6 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#4  0x00007fedda38a187 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#5  0x00007fedd974c1b5 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#6  0x00007fedda216dec in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#7  0x00007fedda58d785 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#8  0x00007fedda1dbafe in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#9  0x00007fedda1dbeba in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#10 0x00007fedda1dfbf9 in eglInitialize ()

    at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

#11 0x00007feddbe44fbd in gst_egl_adaptation_init_display ()

    at /usr/lib/x86_64-linux-gnu/gstreamer-1.0/libgstnveglglessink.so

#12 0x00007feddbe39d15 in egl_init ()

    at /usr/lib/x86_64-linux-gnu/gstreamer-1.0/libgstnveglglessink.so

#13 0x00007feddbe417e3 in gst_eglglessink_open ()

    at /usr/lib/x86_64-linux-gnu/gstreamer-1.0/libgstnveglglessink.so

#14 0x00007feddbe41aad in gst_eglglessink_change_state ()

    at /usr/lib/x86_64-linux-gnu/gstreamer-1.0/libgstnveglglessink.so

#15 0x00007fee11e72bde in gst_element_change_state ()

    at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

#16 0x00007fee11e73319 in  ()

    at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

#17 0x00007fee11e50942 in  ()

    at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

#18 0x00007fee11e72bde in gst_element_change_state ()

    at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

#19 0x00007fee11e73319 in  ()

    at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

#20 0x000055797ac6d530 in main (argc=<optimized out>, argv=<optimized out>)

    at SrcFilVdo.c:386

please let me know what data i can provide to you to help troubleshoot this.

Tnx!

johnu

0 Likes
johnutz-pnsr
Journeyman III

Further data points:

1. This is what my app reports on startup:

     Xlib:  extension "DRI2" missing on display ":0".

     screen 0 does not appear to be DRI2 capable

     Xlib:  extension "DRI2" missing on display ":0".

     screen 0 does not appear to be DRI2 capable

     amdgpu_device_initialize: amdgpu_get_auth (1) failed (-1)

     Segmentation fault (core dumped) <- this is the stacktrace in gdb in my previous post.

2. I have 2 GPUs on this hp z40 workstation:

    AMD RX 580                                <- which is selected as my bootup graphics card on my hp z40 workstation's bios settings menu

    NVidia GK104GL Quadro K5000 <- which is what i am doing compute work on.

0 Likes