3 Replies Latest reply on Oct 31, 2018 12:34 PM by johnutz-pnsr

    Amdgpu Segmentation fault

    bmayer

      I'm struggling to get my fresh Ubuntu installation (16.04.3 LTS) working with amdgpu-pro. I have a NVIDIA gpu installed at PCI port 1 for which I haven't installed any drivers. My screen is connected to that card. Now I've installed an AMD RX 470 card to PCI port 2 and installed the amdgpu package following the installation instructions provided here: Installation Instructions for amdgpu Pro / amdgpu All Open Graphics Stacks .

       

      Sadly after a reboot Xorg doesn't start anymore. I'm getting the following error message:

       

       

       

      X.Org X Server 1.19.3

      Release Date: 2017-03-15

      X Protocol Version 11, Revision 0

      Build Operating System: Linux 4.4.0-97-generic x86_64 Ubuntu

      Current Operating System: Linux minion01 4.10.0-42-generic #46~16.04.1-Ubuntu SMP Mon Dec 4 15:57:59 UTC 2017 x86_64

      Kernel command line: BOOT_IMAGE=/boot/vmlinuz-4.10.0-42-generic.efi.signed root=UUID=68b3a5ef-9413-47ac-99d7-ad9486493389 ro quiet splash vt.handoff=7

      Build Date: 13 October 2017  02:11:50PM

      xorg-server 2:1.19.3-1ubuntu1~16.04.4 (For technical support please see http://www.ubuntu.com/support)

      Current version of pixman: 0.33.6

              Before reporting problems, check http://wiki.x.org

              to make sure that you have the latest version.

      Markers: (--) probed, (**) from config file, (==) default setting,

              (++) from command line, (!!) notice, (II) informational,

              (WW) warning, (EE) error, (NI) not implemented, (??) unknown.

      (==) Log file: "/var/log/Xorg.0.log", Time: Tue Jan  2 22:38:01 2018

      (==) Using system config directory "/usr/share/X11/xorg.conf.d"

      (II) [KMS] Kernel modesetting enabled.

      amdgpu_device_initialize: DRM version is 1.3.1 but this driver is only compatible with 3.x.x.

      amdgpu_device_initialize: DRM version is 1.3.1 but this driver is only compatible with 3.x.x.

      (EE)

      (EE) Backtrace:

      (EE) 0: /usr/lib/xorg/Xorg (xorg_backtrace+0x4e) [0x5633bb503a9e]

      (EE) 1: /usr/lib/xorg/Xorg (0x5633bb352000+0x1b57f9) [0x5633bb5077f9]

      (EE) 2: /lib/x86_64-linux-gnu/libpthread.so.0 (0x7fd058926000+0x11390) [0x7fd058937390]

      (EE) 3: /opt/amdgpu/lib/x86_64-linux-gnu/libdrm_amdgpu.so.1 (amdgpu_get_marketing_name+0xc) [0x7fd05480a994]

      (EE) 4: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x25b181d) [0x7fd051f1081d]

      (EE) 5: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x25b1f61) [0x7fd051f10f61]

      (EE) 6: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x25b62a0) [0x7fd051f152a0]

      (EE) 7: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x192d698) [0x7fd05128c698]

      (EE) 8: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x18d8af2) [0x7fd051237af2]

      (EE) 9: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x540a71) [0x7fd04fe9fa71]

      (EE) 10: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x1463a67) [0x7fd050dc2a67]

      (EE) 11: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x1463d37) [0x7fd050dc2d37]

      (EE) 12: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x1478a2c) [0x7fd050dd7a2c]

      (EE) 13: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x1478fa6) [0x7fd050dd7fa6]

      (EE) 14: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x2574340) [0x7fd051ed3340]

      (EE) 15: /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so (0x7fd04f95f000+0x25744e4) [0x7fd051ed34e4]

      (EE) 16: /opt/amdgpu-pro/lib/xorg/modules/extensions/libglx.so (0x7fd056d2b000+0x45a7f) [0x7fd056d70a7f]

      (EE) 17: /opt/amdgpu-pro/lib/xorg/modules/extensions/libglx.so (GlxExtensionInit+0x138) [0x7fd056d6e8f1]

      (EE) 18: /usr/lib/xorg/Xorg (InitExtensions+0x43) [0x5633bb418313]

      (EE) 19: /usr/lib/xorg/Xorg (0x5633bb352000+0x581cc) [0x5633bb3aa1cc]

      (EE) 20: /lib/x86_64-linux-gnu/libc.so.6 (__libc_start_main+0xf0) [0x7fd05857c830]

      (EE) 21: /usr/lib/xorg/Xorg (_start+0x29) [0x5633bb394329]

      (EE)

      (EE) Segmentation fault at address 0x18

      (EE)

      Fatal server error:

      (EE) Caught signal 11 (Segmentation fault). Server aborting

      (EE)

      (EE)

      Please consult the The X.Org Foundation support

              at http://wiki.x.org

      for help.

      (EE) Please also check the log file at "/var/log/Xorg.0.log" for additional information.

      (EE)

      (EE) Server terminated with error (1). Closing log file.

      I've googled a lot and found many similar issues but none of the threads provided an answer on how to fix this. Can someone help me out? Thank you!!!

        • Re: Amdgpu Segmentation fault
          jesse_amd

          Hi bmayer,

           

          Are you using an EPYC platform? Try installing it with these additional parameters:

           

          amdgpu-install --opencl=legacy --headless

           

          Please let me know if that works for you.

            • Re: Amdgpu Segmentation fault
              johnutz-pnsr

              Hello;

               

              As of today I am experiencing a segfault in the exact same function tho under a different use case.

               

              It repros with both the most recent amd linux distributions

               

              amdgpu-pro-18.30-641594.tar.xz

              amdgpu-pro-18.40-676022-ubuntu-18.04.tar.xz

               

              I grabbed 18.40 with the profound hope that the problem was fixed. NOPE! :-(

               

              Program terminated with signal SIGSEGV, Segmentation fault.

              #0  0x00007fedd729eb4d in amdgpu_get_marketing_name ()

                 from /opt/amdgpu/lib/x86_64-linux-gnu/libdrm_amdgpu.so.1

              (gdb) bt

              #0  0x00007fedd729eb4d in amdgpu_get_marketing_name ()

                  at /opt/amdgpu/lib/x86_64-linux-gnu/libdrm_amdgpu.so.1

              #1  0x00007fedda58acf4 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #2  0x00007fedda58b861 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #3  0x00007fedda3694b6 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #4  0x00007fedda38a187 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #5  0x00007fedd974c1b5 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #6  0x00007fedda216dec in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #7  0x00007fedda58d785 in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #8  0x00007fedda1dbafe in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #9  0x00007fedda1dbeba in  () at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #10 0x00007fedda1dfbf9 in eglInitialize ()

                  at /usr/lib/x86_64-linux-gnu/dri/amdgpu_dri.so

              #11 0x00007feddbe44fbd in gst_egl_adaptation_init_display ()

                  at /usr/lib/x86_64-linux-gnu/gstreamer-1.0/libgstnveglglessink.so

              #12 0x00007feddbe39d15 in egl_init ()

                  at /usr/lib/x86_64-linux-gnu/gstreamer-1.0/libgstnveglglessink.so

              #13 0x00007feddbe417e3 in gst_eglglessink_open ()

                  at /usr/lib/x86_64-linux-gnu/gstreamer-1.0/libgstnveglglessink.so

              #14 0x00007feddbe41aad in gst_eglglessink_change_state ()

                  at /usr/lib/x86_64-linux-gnu/gstreamer-1.0/libgstnveglglessink.so

              #15 0x00007fee11e72bde in gst_element_change_state ()

                  at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

              #16 0x00007fee11e73319 in  ()

                  at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

              #17 0x00007fee11e50942 in  ()

                  at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

              #18 0x00007fee11e72bde in gst_element_change_state ()

                  at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

              #19 0x00007fee11e73319 in  ()

                  at /usr/lib/x86_64-linux-gnu/libgstreamer-1.0.so.0

              #20 0x000055797ac6d530 in main (argc=<optimized out>, argv=<optimized out>)

                  at SrcFilVdo.c:386

               

              please let me know what data i can provide to you to help troubleshoot this.

               

              Tnx!

               

              johnu

            • Re: Amdgpu Segmentation fault
              johnutz-pnsr

              Further data points:

               

              1. This is what my app reports on startup:

               

                   Xlib:  extension "DRI2" missing on display ":0".

                   screen 0 does not appear to be DRI2 capable

                   Xlib:  extension "DRI2" missing on display ":0".

                   screen 0 does not appear to be DRI2 capable

                   amdgpu_device_initialize: amdgpu_get_auth (1) failed (-1)

                   Segmentation fault (core dumped) <- this is the stacktrace in gdb in my previous post.

               

              2. I have 2 GPUs on this hp z40 workstation:

               

                  AMD RX 580                                <- which is selected as my bootup graphics card on my hp z40 workstation's bios settings menu

                  NVidia GK104GL Quadro K5000 <- which is what i am doing compute work on.