-
Notifications
You must be signed in to change notification settings - Fork 101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deadlock #1081
Comments
Thank you for your reporting a bug. The issue has been placed in triage, the MobilityData team will follow-up on it. |
Update : we were able to reproduce every time by forcing a memory limit in a cgroup in linux. Using a limit of 500MB and this feed reproduces the bug every time. Using 100MB, the problem is not reproduced and the process is killed by the kernel. Using cgroups and a memory limit that's big enough to start the process but big enough to make it run all the rules is enough to reproduce the problem. |
Hi Guillaume, Thanks for reporting this issue! From a brief look, it seems the validator is failing with OutOfMemoryError, I can reproduce it by adding
I can see that the feed you provided has ~600MB in size. We didn't do heavy memory optimizations and on some bigger feeds 3-6GB should be enough. We would recommend to reserve at least 3GB of memory, this could be done by passing Could you please provide a bit more context to make sure that we understand the problem correctly?
|
@asvechnikov2 yes, the validator deadlocks. It rans out of memory but the problem is actually the deadlock : it hits a point where it's not running anything but never returns or crash. An OutOfMemory crash is actually the expected behaviour.
More context : we use cgroups in our data pipeline to make sure a process doesn't use more memory than it's expected to use. We use historical running data to know how much memory to allocate, so if a feed X used 100MB the last time it ran, we'll use a limit of 100MB + a buffer to avoid slowdown on the server. To reproduce the bug you really need to use cgroups memory limits of the same size of the |
Hi @gcamp ! Thanks for the details. Unfortunately, we were not able to reproduce the exact same behavior using That being said, like you, we noticed that using
Here are our observations. Our scripts and results can be found in this archive.
It seems pretty clear that using As the figure above suggests, I suggest you increase the memory limit of your Our understanding is that the problem may be caused by |
Bug report
Describe the bug
The validator seems to deadlock sometimes. It seems to be related to out of memory errors.
How we reproduce the bug
We were not able to reproduce it in a reliable way.
Screenshots:
If I run
strace -s 99
on the process while it's deadlock this is what I getThe error log is :
Environment versions
openjdk 11.0.13 2021-10-19
OpenJDK Runtime Environment (build 11.0.13+8-Ubuntu-0ubuntu1.20.04)
OpenJDK 64-Bit Server VM (build 11.0.13+8-Ubuntu-0ubuntu1.20.04, mixed mode, sharing)
The text was updated successfully, but these errors were encountered: