-
Notifications
You must be signed in to change notification settings - Fork 60
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failure after suspend/resume? #253
Comments
For reference here's a session using the direct backend. The first query was before a suspend/resume, the latter two were after.
I also have the same two dmesg lines as before. |
I'm seeing something related to this, but in my case Firefox crashes upon resuming. I've just disabled nvidia-vaapi-driver completely and will see if the crashes continue. |
Unfortunately this is an issue with the NVIDIA driver, and there's not much I can do about it. The driver really doesn't like having any sort of NVDEC context that's left active over the suspend/resume causes it to break the driver until a reboot is done. |
Hmm. If firefox is closed before I suspend, then is there anything else I can do to prevent NVDEC context from being left active? Is there something else I need to explicitly kill, or is it really just that I've ever used it at all? |
Know issue of the nvidia driver. After suspend/resume, the nvidia-uvm module is defunct even if not used. The workaround being unloading/reloading it. |
Can confirm this. I wrote up a specific "how to" for Pop!_OS users just yesterday, but after resume from suspend HW acceleration in Firefox is broken. Only a reboot fixes it. I haven't tried unloading/reloading but that's not really a solution for the average user. Question, it's a "known issue" with the NVIDIA driver, but is there any actual confirmation or bug tracking within NVIDIA as a company? Does this bug affect Wayland or only X11 windowing systems? I ask that because, and I'm only moderately knowledgeable about Linux with nearly ZERO experience with Wayland, so I don't know if Wayland even requires a vaapi layer for hardware acceleration of video codecs. |
I'm not sure if there's an actual NVIDIA bug for it. I've bumped the issue[1] in the NVIDIA forums and we'll see if we get a response. [1] https://forums.developer.nvidia.com/t/xid-31-after-wakeup-from-sleep/139870/6 |
Having the same issue under laptop in secondary nvidia card in PRIME configuration.
Doing nvidia-uvm reloading solves the issue:
|
|
Last time I tried some 535 driver, it refused to decrease cooler speed after some video playback. My laptop sounded like a jet-plane & never stopped unless rebooted. I'll try 545 this time. Thanks for suggestion. |
I checked 545.23.08 version and looks like they've fixed both cooler speed & hw acceleration after suspend/resume issues. I think the issue might be closed now. |
Looks like I was too quick. |
This bug is still there in driver 550.78 |
I am using Archlinux, the instructions here solved my problem, I hope it will be useful to you. |
My NVIDIA driver is |
NVidia driver 535.183.06-1 Uncommenting
... in /etc/modprobe.d/nvidia-options.conf results in errors in syslog and my laptop just stops suspending at all :) Commenting that line back brings back Firefox crashes, but suspend starts working. |
The instructions clearly mentions that you have to enable the services too. |
It's Arch wiki and I am under Devuan. So I don't have any of those *.service daemons. |
The Nvidia suspend/resume mechanism relies on systemd hacks. Since Devuan promises to stay systemd-free they should get in contact with the Gentoo devs who maintain elogind for the same pupose incorporating the needed Nvidia hooks. Please support your Distro. |
|
this works! :) |
Devuan can be fixed quite easily. |
@elFarto it would be really nice if you could mention in the readme this detail about suspending in the readme |
You could do a PR for this :) |
In fedora 40 I found a workaround method: shutdown any process using nvidia decoding function before suspend or hibernate.
|
I have acceleration working fine on my media PC, as long as I try it soon after boot. However I suspend this PC in between uses, and acceleration never works following such a cycle until I reboot. Every other GPU function I've tested continues working after the failure: OpenGL, VDPAU, etc are all fine. Hardware is a GeForce GT 1030, OS is Ubuntu 22.04, nvidia driver version is 535.113.01, and nvidia-vaapi-driver version is git 0a924c.
The first time I try running
vainfo
after a resume, I get:Also, the following lines appear in dmesg during that first
vainfo
query:Subsequent calls to
vainfo
produce no more dmesg output, and the console output changes somewhat:I have tried direct backend instead of egl, and get no different results, aside from some slightly different error text.
I'm not 100% certain the suspend and resume is the cause. I have attempted a quick suspend/resume cycle in order to troubleshoot this problem and been unable to reproduce; but it always happens if I leave it suspended for a normal amount of time (hours). So possibly something else about the elapsed time is involved.
I also have tried to leave firefox running during a suspend/resume, thinking that acceleration might continue to function if I just didn't have to repeat the initialization process, however firefox seems to explode immediately upon resume, so this is not an option.
The text was updated successfully, but these errors were encountered: