You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
See analysis in test_sanity fails. The root cause is the same: #36
The root cause seems to be interaction between UCP and UCX in docker environment in the test machine.
Default behavior of Worker class is to not pass any data plane hostname and port.
UCP data plane defaults are used, which uses ucp.get_address(), which returns 0.0.0.0 at this host and random port.
The UCP server and client actually use 172.17.0.1 what is a docker0 interface and it doesn't work with error message wireup_cm.c:597 UCX DIAG client ep 0x733c91e49080 connect to 172.17.0.1:38467 failed: device docker0 is not enabled, enable it in UCX_NET_DEVICES or use corresponding ip address
JIRA: DLIS-7830
The example fails
single_file.py
with unreachable UCX error:Default example configuration of network interface used for UCX doesn't work at this machine with running docker.
Reproduction
Use branch https://github.com/triton-inference-server/triton-distributed/tree/piotrm-add-nats-hosts
Start NATS.io server
Start example with default host and port passed:
Expected result
The example sends requests and processes them in workers and return with no error.
Results
Log indicates no request was processed:
Error logs analysis
The most important output logs are not printed at output but pushed into several log files:
All logs zip: logs.zip
It is necessary to inspect all other them to identify root cause of failure.
Log
encoder_decoder.0.ab22e895-d51e-11ef-9355-88a4c2b6c3a5.2511.stdout.log
:Log
encoder_decoder.0.ab22e895-d51e-11ef-9355-88a4c2b6c3a5.2511.stderr.log
:The above exception was the direct cause of other exceptions.
Network configuration of the docker instance
Network configuration in docker:
Python workers analysis
All python thread stopped at idle:
The text was updated successfully, but these errors were encountered: