-
Notifications
You must be signed in to change notification settings - Fork 3.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
task_rust.sh
and task_cpp_unittest.sh
fail with updated Docker images when USE_VITIS_AI ON
#10696
Comments
Skipping |
./tests/scripts/task_rust.sh
fails with updated Docker images when USE_VITIS_AI ON
task_rust.sh
and task_cpp_unittest.sh
fail with updated Docker images when USE_VITIS_AI ON
@leandron I am looking at this. Not sure what is going wrong yet. |
The CI update for I've hit this error twice today. |
This seems to be happening when tensorflow is being loaded inside pyxir. I don't know what the exact cause is for this starting to happen with tensorflow 2.6 but the issue looks very similar to this one: triton-inference-server/server#3777 but with tensorflow instead of pytorch. I have a workaround by loading tensorflow eagerly only when needed. I am currently verifying with the ci-cpu docker image locally and will create a PR when successful. |
Based on results at https://ci.tlcpack.ai/job/docker-images-ci/job/docker-image-run-tests/82/, which include #10858 (up to fcdf463 in the repo), we can update the Docker images now, as the bug reported here is fixed. |
It looks like there is an error happening in
./tests/scripts/task_rust.sh
when we have updated images including Python 3.7/TensorFlow 2.6/h5py 3.1.0. The error only happens whenUSE_VITIS_AI ON
in./tests/scripts/task_config_build_cpu.sh
.This is causing me issues when testing TensorFlow 2.6 images. I tried downgrading h5py and reproducing the steps without success. Despite that TensorFlow 2.6 requires h5py>3, so it is not a viable solution anyway.
Steps to reproduce:
Then you should see an error similar to this:
By setting
USE_VITIS_AI OFF
in./tests/scripts/task_config_build_cpu.sh
and repeating the steps, the issue doesn't reproduce.Full output in Jenkins: https://ci.tlcpack.ai/blue/organizations/jenkins/docker-images-ci%2Fdocker-image-run-tests/detail/docker-image-run-tests/69/pipeline/
cc @jtuyls @anilmartha can you help me troubleshooting this?
The text was updated successfully, but these errors were encountered: