-
Hello! I am trying to simulate a very large model, and I am using cloud to run the forward and inverse simulations. I have seen the above error when using JUDI package. Are there any solutions?
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 18 replies
-
This looks like a worker died, this error message is just julia saying it lost communication with that worker, would need a complete error trace that shows where JUDI errored. side note: Also it's easier if you use code blocks rather than screenshots (i.e triple quotes) |
Beta Was this translation helpful? Give feedback.
-
I have seem some warnings during forward modeling, I will post it here.
```
From worker 16: /home/hp/.local/lib/python3.9/site-packages/numpy/core/getlimits.py:500: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
From worker 16: setattr(self, word, getattr(machar, word).flat[0])
From worker 16: /home/hp/.local/lib/python3.9/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
From worker 16: return self._float_to_str(self.smallest_subnormal)
From worker 18: Building forward operator
From worker 50: Building forward operator
From worker 3: /home/hp/.local/lib/python3.9/site-packages/numpy/core/getlimits.py:500: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
From worker 3: setattr(self, word, getattr(machar, word).flat[0])
From worker 3: /home/hp/.local/lib/python3.9/site-packages/numpy/core/getlimits.py:89: UserWarning: The value of the smallest subnormal for <class 'numpy.float32'> type is zero.
From worker 3: return self._float_to_str(self.smallest_subnormal)
```
|
Beta Was this translation helpful? Give feedback.
addprocs
add julia workers so it will process shots in parallel. However, If you are on a single node now, the resources are shared by all these workers, including memory and CPU cores, and will compete for it. So it will get quite slower because now the process spends time waiting for an available core, and you will run out of memory much quicker than if you were using x32 or x64 more memory.So if you use
addprocs
you need to make sure that:GOMP_CPU_AFFINITY
, you can see an example here in section 3Hope this…