-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
tar: *.ko: Cannot stat: No such file or directory #1
Comments
I get something similar:
|
Looks like we're exiting the container after |
Sorry, I missed this. I now have a filter to highlight these issues. Which systemd version are you running on the machine building the driver? I saw similar problems with older releases. That's why the README says "tested on version 229, there might be issues with <= 225". Another thing worth checking... are you running low on disk? |
One measure that I could take right away is stopping execution right away. There is no point in continuing, it only obfuscates the real issue. When I last debugged this, I traced it to the main event loop in nspawn exiting in an unexpected fashion (the KILL signal came from nspawn itself). Upgrading systemd made the problem go away, so I didn't investigate further. |
Version 231, running under 4.7.3-coreos-r2:
I do agree that this issue is a bug in systemd, and not your scripts.
No, I went to the trouble of expanding the image first with
Good idea! |
Speaking of -coreos-r2... I found and fixed a silly bug that affected 1185.2.0 and up. Can you retry with the latest commit? |
I think I found the culprit. Running systemd-nspawn with --share-system increases the chances of a SIGKILL. Please try the latest version, I have just pushed a number of fixes. With the latest CoreOS security fix, you might need to add the --emerge-sources flag, since CoreOS engineers seem to not have uploaded portage binary packages for the those versions. To compensate for the larger of packages built now, the scripts build 4 of them at a time. |
I tried the latest version, got the same error: $ ./build.sh 367.57 stable 1298.5.0
|
1298.5.0 is never going to work with 367.57. Newer versions of the driver (375, etc.) support Linux 4.9.9 (which is what the latest CoreOS uses), but 367.57 dates back to last October and does not support it (get_user_pages() has a different signature). Why are you using it instead of a more recent version? Is it because you have old GRID cards and Nvidia docs tell you to use v367, as that's the last version to support them? E.g. from http://us.download.nvidia.com/XFree86/Linux-x86_64/378.13/README/supportedchips.html "Below are the legacy GPUs that are no longer supported in the unified driver. These GPUs will continue to be maintained through the special legacy NVIDIA GPU driver releases." If that's the case, you need to ask Nvidia to make one of the special legacy releases they have promised. An example here: https://devtalk.nvidia.com/default/topic/997603/linux/newer-367-driver-for-grid-k520-/ |
Yes, also looked at the logs and it was "error: too many arguments to function 'get_user_pages'" No particular reason! Will try v375 now, thanks. |
$ ./build.sh 375.20 stable 1298.5.0 succeeded. Thanks. |
I had another problem when I used Otherwise this issue is resolved. |
apt install systemd-container |
$./build.sh 367.27 alpha 1097.0.0
I am following your instructions, but during the ./build.sh step I get the following message:
++ basename -a 'pkg/run_files/1097.0.0/NVIDIA-Linux-x86_64-367.27/kernel/*.ko'
tar -C pkg/run_files/1097.0.0/NVIDIA-Linux-x86_64-367.27/kernel -cvj '*.ko'
tar: *.ko: Cannot stat: No such file or directory
tar: Exiting with failure status due to previous errors
Did I miss something?
The text was updated successfully, but these errors were encountered: