Hi,
I was at the final stage of the C3W3 assignment “Distributed MultiWorker Tensorflow Training…”.
However, the following command was expected to run and make Status column “Running…”. This is not happening.
kubectl get pods
Output should be:
NAME READY STATUS RESTARTS AGE
multi-worker-worker-0 0/1 Running 0 21m
multi-worker-worker-1 0/1 Running 0 21m
multi-worker-worker-2 0/1 Running 0 21m
Rather, this is the output. Status column should have been “Running” as per the assignment.
NAME READY STATUS RESTARTS AGE
multi-worker-worker-0 0/1 ErrImagePull 0 21m
multi-worker-worker-1 0/1 ImagePullBackOff 0 21m
multi-worker-worker-2 0/1 ErrImagePull 0 21m
When I run the following command to retrieve logs
kubectl logs --follow ${JOB_NAME}-worker-0
Output is :
Error from server (BadRequest): container “tensorflow” in pod “multi-worker-worker-0” is waiting to start: trying and failing to pull image