-
Notifications
You must be signed in to change notification settings - Fork 106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Compose hangs at [uWSGI] getting INI configuration from /opt/mailman-web/uwsgi.ini #730
Comments
The next step is the container waiting for the database to be up and running, is your database container and up and running fine? |
Yes, they are. I can see that the database and tables have been initialized and loaded. `Nov 02 08:25:05 RHEL01.(mydomain) (mycontainer)[479382]: MySQL is up - continuing Nov 02 08:25:54 RHEL01.(mydomain) (mycontainer)[479382]: Applying postorius.0011_auto_20191109_1219... OK ...and that's where everything hangs. There are no more messages after that. |
Perusing the logs after I persisted them, I found a message in uwsgi-error.log:
I don't understand how that could be because the other components of mailmanweb that ran ahead of this were clearly able to access and update the database? |
No ideas I guess? |
I don't have any great ideas why something like this would happen. Can you try it out with some other versions of mysql to see if this is an isolated issue? |
I was running 0.4 on Fedora 40. I upgraded to Fedora 41 and, not realizing it, the CPU was being pegged at nearly 100% for a while by the process If I do a
Mailman does appear to be working, but I wonder if the process is getting stuck at this point and thrashing trying to do something, while other processes are doing what they should do? I'm using MariaDB, but I did not see any errors in the logs about database access timeouts or failures. And, it's definitely able to access the database. I tried moving to 0.5.2 hoping that might resolve it, but I get the same result. I'm not sure if this is related. @BearGFR, were you also seeing high CPU usage from EDIT: I should note that my database is outside of docker natively on the host. EDIT2: I ran "strace" to see what system calls the stuck process was running. It spewed these lines faster than I think the display could render:
I am not sure what it's doing, but definitely one of those I attached to the offending process and I could see the call stack. Best I can tell, it might be this loop that is closing sockets: https://github.com/unbit/uwsgi/blob/master/core/utils.c#L2542. If that EDIT 3: Indeed, that line of code is the issue. I'm not sure where "max_fd" is set, but I build the code from the current source repository and printed it out the logging function in the code. The value produced was 1073741816. So, it's in that loop trying to close a ton of file descriptors. Next... to find where that is being set. EDIT 4: It appears to be set in uwsgi_start(). There is also a log corresponding to it:
Specifically, this line of code: https://github.com/unbit/uwsgi/blob/master/core/uwsgi.c#L2776 I added this to the docker-compose file for the web service: ulimits:
nofile:
soft: 1024
hard: 2048 This seems to have tamed it. I know it's still trying to close maybe 1024 or 2048 sockets, but it doesn't seem to do it constantly. I did not investigate further, but this at least keeps the CPU from pegging at 100%. |
Hey Paul, I can't say if it was soaking the cpu or not. I finally just gave up trying to run the container and instead installed everything "the old way" directly on the system. I've got that working.
On Jan 2, 2025, at 12:01, "Paul E. Jones" ***@***.******@***.***>> wrote:
I was running 0.4 on Fedora 40. I upgraded to Fedora 41 and, not realizing it, the CPU was being pegged at nearly 100% for a while by the process uwsgi. I went searching for a reason, but I could not find one.
If I do a docker compose up, I get this as the last line of output:
mailman-web | [uWSGI] getting INI configuration from /opt/mailman-web/uwsgi.ini
Mailman does appear to be working, but I wonder if the process is getting stuck at this point and thrashing trying to do something, while other processes are doing what they should do? I'm using MariaDB, but I did not see any errors in the logs about database access timeouts or failures. And, it's definitely able to access the database.
I tried moving to 0.5.2 hoping that might resolve it, but I get the same result.
I'm not sure if this is related. @BearGFR<https://github.com/BearGFR>, were you also seeing high CPU usage from uwsgi?
—
Reply to this email directly, view it on GitHub<#730 (comment)>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AGXLC3N4O5NWLNITRRIEVE32IV5ERAVCNFSM6AAAAABRA53WJSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDKNRYGE2TSMBQGQ>.
You are receiving this because you were mentioned.Message ID: ***@***.***>
|
ACK. Sounds like the issue I uncovered and yours may not be the same. It still might have been, as high CPU usage can cause weird things to happen. But, I felt it was useful to document what I found since I figured others were quite likely to encounter the same thing. In my case, there's nothing wrong with the Docker config kindly provided by @maxking. Rather, it has something to do with the combination of Docker on Fedora 41 + that uwbgsi process is handling sockets. I did open an issue at that the uwbgsi repository, as I think they ought not attempt to close 1,073,741,816 sockets -- even if the OS tells them they can :) |
Attempting to bring up containerized mailman/mailman-web following the instructions at https://asynchronous.in/docker-mailman/
Everything seems to progress more or less normally, then everything stops/hangs after the message:
No matter how long I let it sit, it never gets past that.
The text was updated successfully, but these errors were encountered: