Project

General

Profile

Bug #12965

starting Lightdm triggers reboot when graphics card is Nvidia GTX 1650

Added by r a 4 months ago. Updated 14 days ago.

Status:
New
Priority:
Normal
Assignee:
-
Category:
-
Target version:
Start date:
Due date:
% Done:

0%

Estimated time:
Difficulty:
Medium
Tags:

Description

I purchased a new Gigabyte Geforce GTX 1650 Mini ITX OC 4G card which is identifiied as a Nvidia TU117 0x10de,0x1f82 {using scanpci}.
As I was running Nvidia NVIDIA-Solaris-x86-450.51.run driver, I swapped out my 750 Ti and installed the 1650, system booted normally it was only when it tried to start lightdm and present the login screen that the system went quiet and rebooted.
Downloaded Nvidia-Solaris-x86-450.57.run driver and installed the same problem.

Wanting to eliminate a hardware issue, power off my system, unplugged my six disk zfs pool and mirror boot disk, swapped my primary boot disk with 500GB drive.
Proceeded to boot Linuxmint 20 and installed on to 500GB drive, everything was recognised correctly, installation was fine and I was able to boot Linuxmint.

[ Jul 16 08:05:50 Method "start" exited with status 0. ]

  • (lightdm:788): WARNING **: 08:05:50.795: Failed to get list of logind seats: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.freedesktop.login1 was not provided by any .service files
  • (lightdm:788): WARNING **: 08:05:50.808: Error getting user list from org.freedesktop.Accounts: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.freedesktop.Accounts was not provided by any .service files
    lightdm: Got tty: '/dev/vt/7'
  • (process:813): WARNING **: 08:05:55.798: Error getting user list from org.freedesktop.Accounts: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.freedesktop.Accounts was not provided by any .service files
  • (lightdm:788): WARNING **: 08:05:56.328: Error activating ConsoleKit session: GDBus.Error:org.freedesktop.DBus.GLib.UnmappedError.CkSeatError.Code0: Activation is not supported for this kind of seat
    lightdm: Got tty: '/dev/vt/7'lightdm: Got tty: '/dev/vt/7'
  • (process:830): WARNING **: 08:07:35.219: Error getting user list from org.freedesktop.Accounts: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.freedesktop.Accounts was not provided by any .service files
  • (process:830): WARNING **: 08:07:36.238: Error getting XDG_RUNTIME_DIR from ConsoleKit: GDBus.Error:org.freedesktop.DBus.Error.UnknownMethod: Method "GetXDGRuntimeDir" with signature "" on interface "org.freedesktop.ConsoleKit.Session" doesn't exist
  • (lightdm:788): WARNING **: 08:07:36.239: Error activating ConsoleKit session: GDBus.Error:org.freedesktop.DBus.GLib.UnmappedError.CkSeatError.Code0: Activation is not supported for this kind of seat
    Bad enum index 10 for control 'record-source'
    lightdm: Got tty: '/dev/vt/7'
  • (process:956): WARNING **: 08:08:51.769: Error getting user list from org.freedesktop.Accounts: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.freedesktop.Accounts was not provided by any .service files
  • (lightdm:788): WARNING **: 08:08:52.273: Error activating ConsoleKit session: GDBus.Error:org.freedesktop.DBus.GLib.UnmappedError.CkSeatError.Code0: Activation is not supported for this kind of seat
    lightdm: Got tty: '/dev/vt/7'lightdm: Got tty: '/dev/vt/7'
  • (process:973): WARNING **: 08:09:59.824: Error getting user list from org.freedesktop.Accounts: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.freedesktop.Accounts was not provided by any .service files
  • (process:973): WARNING **: 08:10:00.852: Error getting XDG_RUNTIME_DIR from ConsoleKit: GDBus.Error:org.freedesktop.DBus.Error.UnknownMethod: Method "GetXDGRuntimeDir" with signature "" on interface "org.freedesktop.ConsoleKit.Session" doesn't exist
  • (lightdm:788): WARNING **: 08:10:00.853: Error activating ConsoleKit session: GDBus.Error:org.freedesktop.DBus.GLib.UnmappedError.CkSeatError.Code0: Activation is not supported for this kind of seat
    end from FAM server connection
    end from FAM server connection
    Failed to get D-Bus connection
    Bad enum index 10 for control 'record-source'
  • (process:973): WARNING **: 08:11:01.991: Error ending ConsoleKit session: The connection is closed
    [ Jul 16 08:12:38 Enabled. ]
    [ Jul 16 08:12:46 Executing start method ("/lib/svc/method/svc-lightdm start"). ]
    [ Jul 16 08:12:46 Method "start" exited with status 0. ]
  • (lightdm:760): WARNING **: 08:12:46.924: Failed to get list of logind seats: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.freedesktop.login1 was not provided by any .service files
  • (lightdm:760): WARNING **: 08:12:46.952: Error getting user list from org.freedesktop.Accounts: GDBus.Error:org.freedesktop.DBus.Error.ServiceUnknown: The name org.freedesktop.Accounts was not provided by any .service files
    [ Jul 16 08:13:17 Stopping because all processes in service exited. ]

Swapped back the original boot disk and plugged drives back in then switched back to 750 Ti in order to access the GUI


Files

xid.txt (1.55 MB) xid.txt gpu driver errors Rick V, 2020-07-21 08:25 AM

Related issues

Related to OpenIndiana Distribution - Bug #12964: Continuously reboot after installing the latest updateResolved

Actions
#1

Updated by gh origin 3 months ago

  • Related to Bug #12964: Continuously reboot after installing the latest update added
#2

Updated by Rick V 3 months ago

same problem here but i can at least get to X about 50% of the time

output of cat /var/adm/messages | grep Xid is attached

#3

Updated by r a about 2 months ago

Updated to illumos-8515d72326 and Nvidia Solaris Driver 450.66 the same problem of the system rebooting when attempting to start lightdm occurs.

Using mdb to look a the unix.0 and vmcore.0 gets the following

Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 450.66 Wed Aug 12 19:38:08 UTC 2020
NOTICE: SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major

panic[cpu5]/thread=fffffe0017c2dc20:
pcieb-2: PCI Express Fatal Error. (0x41)

fffffe0017c2db60 pcieb:pcieb_intr_handler+1dc ()
fffffe0017c2dbc0 unix:av_dispatch_autovect+83 ()
fffffe0017c2dc00 unix:dispatch_hardint+36 ()
fffffe0017bf7a50 unix:switch_sp_and_call+15 ()
fffffe0017bf7aa0 unix:do_interrupt+aa ()
fffffe0017bf7ab0 unix:cmnint+c3 ()
fffffe0017bf7ba0 unix:mach_cpu_idle+b ()
fffffe0017bf7bd0 unix:cpu_idle+10f ()
fffffe0017bf7be0 unix:cpu_idle_adaptive+19 ()
fffffe0017bf7c00 unix:idle+ae ()
fffffe0017bf7c10 unix:thread_start+b ()

dumping to /dev/zvol/dsk/rpool1/dump, offset 65536, content: kernel

#4

Updated by r a 26 days ago

After downloading Nvidia driver 455.23.04 and running it for a couple of weeks on my Geforce 750 Ti card.
I swapped back in the Nvidia Geforce 1650 OC Card. Booted to single user, removed the Nvidia NVDAgraphicsr and NVDAgraphics package. Touched /etc/reconfigure and also performed a "reboot -- -r".
Upon reboot entered tried to enter multi-user state but lightdm did not start, moved /etc/X11/xorg.conf sideways took the system to single user mode and then returned lightdm did not start. Installed Nvidia 455.23.04 driver rebooted, Lightdm attempted to start and a minute or so later the computer rebooted.
Reverted to Geforce 750 Ti.

#5

Updated by r a 15 days ago

Installed NVIDIA-Solaris-x86-455.28.run driver which was working fine with the Geforce 750Ti. Swapped card, booted to single user used

  1. touch /reconfigure
  2. reboot

POST, and boot completed, got messages reconfiguring devices, Xorg.0.log recorded

[ 105.466]
X.Org X Server 1.19.7
Release Date: 2019-03-02
[ 105.467] X Protocol Version 11, Revision 0
[ 105.467] Build Operating System: SunOS 5.11 i86pc
[ 105.467] Current Operating System: SunOS telsa 5.11 illumos-aefb332f56 i86pc
[ 105.467] Build Date: 24 September 2020 02:36:42PM
[ 105.467] Solaris ABI: 64-bit
[ 105.467] Current version of pixman: 0.38.0
[ 105.467] Before reporting problems, check http://openindiana.org
to make sure that you have the latest version.
[ 105.467] Markers: (--) probed, () from config file, (==) default setting,
(+) from command line, (!!) notice, (II) informational,
(WW) warning, (EE) error, (NI) not implemented, (??) unknown.
[ 105.467] (==) Log file: "/var/log/Xorg.0.log", Time: Fri Oct 16 13:27:30 2020
[ 105.468] (==) Using config file: "/etc/X11/xorg.conf"
[ 105.468] (==) Using system config directory "/usr/share/X11/xorg.conf.d"
[ 105.470] (==) ServerLayout "Layout0"
[ 105.470] (
) |-->Screen "Screen0" (0)
[ 105.470] () | |-->Monitor "Monitor0"
[ 105.472] (
) | |-->Device "Device0"
[ 105.472] () |-->Input Device "Keyboard0"
[ 105.472] (
) |-->Input Device "Mouse0"
[ 105.472] (==) Automatically adding devices
[ 105.472] (==) Automatically enabling devices
[ 105.473] (==) Not automatically adding GPU devices
[ 105.473] (==) Max clients allowed: 256, resource mask: 0x1fffff
[ 105.482] (*) FontPath set to:
/usr/X11R6/lib/X11/fonts/misc/:unscaled,
/usr/X11R6/lib/X11/fonts/100dpi/:unscaled,
/usr/X11R6/lib/X11/fonts/75dpi/:unscaled,
/usr/X11R6/lib/X11/fonts/misc/,
/usr/X11R6/lib/X11/fonts/100dpi/,
/usr/X11R6/lib/X11/fonts/75dpi/,
catalogue:/etc/X11/fontpath.d
[ 105.482] (==) ModulePath set to "/usr/lib/xorg/modules/amd64,/usr/X11/lib/modules/"
[ 105.482] (WW) Hotplugging is on, devices using drivers 'kbd', 'mouse' or 'vmmouse' will be disabled.
[ 105.482] (WW) Disabling Keyboard0
[ 105.482] (WW) Disabling Mouse0
[ 105.482] (II) Loader magic: 6f9020
[ 105.483] (II) Module ABI versions:
[ 105.483] X.Org ANSI C Emulation: 0.4
[ 105.483] X.Org Video Driver: 23.0
[ 105.483] X.Org XInput driver : 24.1
[ 105.483] X.Org Server Extension : 10.0
[ 105.491] (--) PCI:
(0:4:0:0) 10de:1f82:1458:3fca rev 161, Mem 0xdf000000/16777216, 0xc0000000/268435456, 0xdc000000/33554432, I/O 0x0000cc00/128
[ 105.491] (II) LoadModule: "glx"
[ 105.493] (II) Loading /usr/lib/xorg/modules/extensions/amd64/libglx.so
[ 105.690] (II) Module glx: vendor="NVIDIA Corporation"
[ 105.690] compiled for 1.6.99.901, module version = 1.0.0
[ 105.691] Module class: X.Org Server Extension
[ 105.691] (II) NVIDIA GLX Module 455.28 Wed Sep 30 01:00:57 UTC 2020
[ 105.691] (II) LoadModule: "nvidia"
[ 105.706] (II) Loading /usr/X11/lib/modules/drivers/amd64/nvidia_drv.so
[ 105.720] (II) Module nvidia: vendor="NVIDIA Corporation"
[ 105.720] compiled for 1.6.99.901, module version = 1.0.0
[ 105.720] Module class: X.Org Video Driver
[ 105.721] (II) NVIDIA dlloader X Driver 455.28 Wed Sep 30 01:02:14 UTC 2020
[ 105.721] (II) NVIDIA Unified Driver for all Supported NVIDIA GPUs
[ 105.722] (+) using VT number 7

However, when lighdm attempts to start the system reboot, /var/adm/messages is reporting

"reboot after panic: pcieb-2: PCI Express Fatal Error. (0x41)"

Examining vmcore.3
root@tesla:/var/crash/tesla# mdb vmcore.3
Loading modules: [ unix genunix specfs dtrace mac cpu.generic uppc apix scsi_vhci zfs sata sd ip hook neti sockfs arp usba xhci mm s1394 smbios fctl stmf stmf_sbd lofs mr_sas random idm cpc crypto fcip fcp ufs logindmux nsmb ptm smbsrv nfs sppp ]

::stack

vpanic()
pcieb_intr_handler+0x1dc(fffffe1156a17728, 1)
apix_dispatch_by_vector+0x8c(21)
apix_dispatch_lowlevel+0x1c(21, 0)
switch_sp_and_call+0x15()
apix_do_interrupt+0xec(fffffe0017b4eab0, 5)
_interrupt+0xc3()
mach_cpu_idle+0xb()
cpu_idle+0x10f()
cpu_idle_adaptive+0x19()
idle+0xae()
thread_start+0xb()

The problem appears to be related to the PCI-Express access, something that my initial testing with Linux did not occur, so it appears to be driver related rather than a hardware issue. Updating the Nvidia driver does not appear to help, it looks like something in the OpenIndiana code.

#6

Updated by r a 14 days ago

Going through the mdb vmcore.3 again

::status

debugging crash dump vmcore.3 (64-bit) from ts
operating system: 5.11 illumos-aefb332f56 (i86pc)
build version: heads/master-0-gaefb332f56-dirty

image uuid: 99a3c93c-b13c-686f-b573-f60ee3f5e4ed
panic message: pcieb-2: PCI Express Fatal Error. (0x41)
dump content: kernel pages only

::vars

cr3 = 9800000
bx = 1
ch = 0
sil = 80
cl = 41
r8d = 17b84918
r15w = 1
cs = 30
r8l = 18
cx = 41
dh = 0
gsbase = fffffe1158248000
di = 67e0
dl = 2
kgsbase = 0
savfp = 0
r9d = 17b84918
r8w = 4918
ds = 4b
trapno = 0
r9l = 18
dx = 2
r9w = 4918
es = 4b
si = 4a80
. = c
0 = 0
1 = 0
rax = fffffe0017b84aa0
2 = 0
sp = 4a68
r10 = fffffffffb875f98
r11 = 64
ss = 38
r12 = fffffe114d647880
r13 = 2
9 = 0
r14 = 41
rbp = fffffe0017b84ae0
fs = 0
r15 = 1
rbx = 1
gs = 0
rcx = 41
rdi = fffffffff7cd67e0
_ = 0
eax = 17b84aa0
rdx = 2
b = fffffffffbc00000
d = 120978
e = c00000
eflags = 246
ebp = 17b84ae0
m = 464c457f
spl = 68
rsi = fffffe0017b84a80
ebx = 1
t = 389aa8
rsp = fffffe0017b84a68
r10d = fb875f98
r10l = 98
ecx = 41
hits = 0
edi = f7cd67e0
r11d = 64
r10w = 5f98
r11l = 64
edx = 2
rflags = 246
r12d = 4d647880
err = 0
r11w = 64
r12l = 80
esi = 17b84a80
esp = 17b84a68
r13d = 2
savpc = 0
dil = e0
thread = 0
rip = fffffffffb8814c0
r12w = 7880
r13l = 2
ah = 4a
al = a0
r14d = 41
r13w = 2
bpl = e0
r14l = 41
r8 = fffffe0017b84918
r9 = 1
bh = 0
ax = 4aa0
bl = 1
r15d = 1
r14w = 41
bp = 4ae0
r15l = 1
cr2 = 80b31c8

Also available in: Atom PDF