-
Notifications
You must be signed in to change notification settings - Fork 636
libcudart.so.10.1 (and others) are not in the built docker image #497
Comments
@reedwm Toby is the person who added Docker support for PerfZero. I don't have experience with docker and I don't know which part of service in our infra uses docker. I don't know who is able to maintain this feature now that Toby has left this project. |
@lindong28 I can look into the code and create a PR with the necessary changes |
Thank you @gaborvecsei for offering to fix this issue! If the PR is easy to review (e.g. it just changed a version), it will be great and I can just approve it. If the PR involves something that requires docker expertise, I will ask around and see who can help with this. |
I have the same problem.
Then i run the benchmark the error is similar:
Then i seraching for the libcuadart* i get:
@gaborvecsei Any other way to solve the problem? |
Having the same issue |
I found out that for me the dockerfile with Ubuntu 18.04 and Cuda 11.0 works, when using tensorflow version 2.4. The command to build the docker looks like this: |
Steps to reproduce the error:
python perfzero/lib/setup.py --tensorflow_pip_spec=tensorflow==2.3.0
docker run -it --gpus all --rm -v $(pwd):/workspace perfzero/tensorflow bash
python3 /workspace/perfzero/lib/benchmark.py --git_repos="https://github.com/tensorflow/models.git;benchmark" --python_path=models --gcloud_key_file_url="" --benchmark_methods=official.benchmark.keras _cifar_benchmark.Resnet56KerasBenchmarkSynth.benchmark_1_gpu_no_dist_strat
The benchmark starts but only on CPUs because of the error:
When I am searching for the "
libcudart.so.*
" (find / -name "libcudart.so.*"
) the results are the following:So only the wrong version is installed.
The text was updated successfully, but these errors were encountered: