I am receiving a grade of zero for the second programming assignment in Week 1, despite all tests passing and getting an accuracy greater than 80% on the final image classifier.
When I look at the output from the grader, there seems to be errors with loading libraries and kernel communication:
[ValidateApp | INFO] Validating '/home/jovyan/work/submitted/courseraLearner/W1A2/Convolution_model_Application.ipynb' [ValidateApp | INFO] Executing notebook with kernel: python3 2021-05-28 00:40:47.761136: W tensorflow/stream_executor/platform/default/dso_loader.cc:59] Could not load dynamic library 'libcudart.so.10.1'; dlerror: libcudart.so.10.1: cannot open shared object file: No such file or directory 2021-05-28 00:40:47.761180: I tensorflow/stream_executor/cuda/cudart_stub.cc:29] Ignore above cudart dlerror if you do not have a GPU set up on your machine. 2021-05-28 00:40:49.250465: W tensorflow/stream_executor/platform/default/dso_loader.cc:59] Could not load dynamic library 'libcuda.so.1'; dlerror: libcuda.so.1: cannot open shared object file: No such file or directory 2021-05-28 00:40:49.250497: W tensorflow/stream_executor/cuda/cuda_driver.cc:312] failed call to cuInit: UNKNOWN ERROR (303) 2021-05-28 00:40:49.250515: I tensorflow/stream_executor/cuda/cuda_diagnostics.cc:156] kernel driver does not appear to be running on this host (ip-10-3-0-31.ec2.internal): /proc/driver/nvidia/version does not exist 2021-05-28 00:40:49.250684: I tensorflow/core/platform/cpu_feature_guard.cc:142] This TensorFlow binary is optimized with oneAPI Deep Neural Network Library (oneDNN)to use the following CPU instructions in performance-critical operations: AVX2 AVX512F FMA To enable them in other operations, rebuild TensorFlow with the appropriate compiler flags. 2021-05-28 00:40:49.277433: I tensorflow/core/platform/profile_utils/cpu_utils.cc:104] CPU Frequency: 2999995000 Hz 2021-05-28 00:40:49.279334: I tensorflow/compiler/xla/service/service.cc:168] XLA service 0x558281cf3880 initialized for platform Host (this does not guarantee that XLA will be used). Devices: 2021-05-28 00:40:49.279359: I tensorflow/compiler/xla/service/service.cc:176] StreamExecutor device (0): Host, Default Version [ValidateApp | ERROR] Timeout waiting for execute reply (30s). [ValidateApp | ERROR] Interrupting kernel [ValidateApp | ERROR] Timeout waiting for execute reply (30s). [ValidateApp | ERROR] Interrupting kernel Success! Your notebook passes all the tests.
Is it possible that something broke on the grading backend? Or is it more likely that I am doing something wrong?