Capstone Project Part 1 - Extract Glue Jobs failing

Both the de-c4w4a1-api-sessions-extract-job and the de-c4w4a1-api-users-extract-job jobs I have created are failing.

The only debug information I get through the AWS Glue console is an AnalysisException (thats the entirety of the stacktrace that is returned to the console). No other logging/debugging information seems to be available - the AWS account seems to be configured without permissions for me to view log output in Cloudwatch or the PySpark logs.

This makes it almost impossible to debug what is wrong with the jobs - but surely this is the approach one would take if one was doing this for real?

I don’t expect anyone to be able to diagnose the error (!) from the information I’ve got, but if anyone can provide guidance on where I might find more informative stack traces/output that would be much appreciated. There must be a better way than looking over terraform scripts with a microscope for typos.

FWIW, myde-c4w4a1-rds-extract-job is working correctly.

I managed to get the spark logs written to S3, downloaded them and opened them in the Spark history server.

I can’t see any stages or tasks that failed, and it appears that the job did write some output to s3: