UPDATE: I was reading other posts
- Maybe the problem is originated from the glue.tf date format
'''terraform/modules/transform_job/glue.tf'''
# Set "--ingest_date" to the server's current date in Pacific Time (UTC-7), in "yyyy-mm-dd" format.
# (replace the placeholder `<PACIFIC-TIME-CURRENT-DATE>`)
"--ingest_date" = "yyyy-mm-dd"
# Review the users source path
- Perhaps the solution is to declare something into the terraform/assets/transform_job/
*
.py files to correctly declare the datetime format or something else (JUST a crazy hipothesis)
- Or mayble use a crazy method from the terraform API
I will update later or you can update this post too
Hello @Hvnt3rK3ys
In the part "--ingest_date" = "yyyy-mm-dd"
of the glue.tf
files, you are supposed to replace yyyy-mm-dd
with the current pacific date. Did you do so?
If you want to see the status of your glue jobs in the UI, you can search for AWS Glue in the AWS Console, select ETL jobs from the left hand-side menu, choose a job, and choose a run for that job. There you can see the status of all the glue jobs’ runs.
1 Like
I was into the AWS gui for Glue looking for the Monitoring section:
ETL jobs
- Visual ETL
- Notebooks
- Job run monitoring <--- I think this option is the correct
Then, aws told me that:
Job runs summary
Error encountered
AccessDeniedException (status: 403): Unauthorized access for account id 719******
So in the IAM section appears:
Access denied
You do not have permission to iam:GetAccountSummary . To request access, copy the following text and send it to your AWS administrator. Learn more about troubleshooting access denied issues.
Now what ??
FIXED: Its easier than it seems
Into the datetime options
'''terraform/modules/transform_job/glue.tf'''
# Set "--ingest_date" to the server's current date in Pacific Time (UTC-7), in "yyyy-mm-dd" format.
# (replace the placeholder `<PACIFIC-TIME-CURRENT-DATE>`)
"--ingest_date" = "<PUT Todays date into the format>"
# Review the users source path
To debug using de CLI:
## GET the current ID
aws glue start-job-run --job-name <JOB-NAME> | jq -r '.JobRunId'
## EXECUTE the order
aws glue get-job-run --job-name <JOB-NAME> --run-id <JobRunID> --output text --query "JobRun.JobRunState"
## CHECK if something explode
aws glue get-job-run --job-name <JOB-NAME> --run-id <job_run_id>
Wait until you hace SUCCED test
❯ aws glue get-job-run --job-name de-c4w4a1-songs-transform-job --run-id jr_6981b5a4b5557140147044ed18c93da1f89a00c36d102c5426fd042b4acdfb9f
{
"JobRun": {
"Id": "jr_6981b5a4b5557140147044ed18c93da1f89a00c36d102c5426fd042b4acdfb9f",
"Attempt": 0,
"JobName": "de-c4w4a1-songs-transform-job",
"JobMode": "SCRIPT",
"JobRunQueuingEnabled": false,
"StartedOn": 1735594749.885,
"LastModifiedOn": 1735594873.402,
"CompletedOn": 1735594873.402,
"JobRunState": "SUCCEEDED",
"PredecessorRuns": [],
"AllocatedCapacity": 2,
"ExecutionTime": 107,
"Timeout": 5,
"MaxCapacity": 2.0,
"WorkerType": "G.1X",
"NumberOfWorkers": 2,
"LogGroupName": "/aws-glue/jobs",
"GlueVersion": "4.0"
}
}
@Hvnt3rK3ys happy to hear that the issue has been fixed. In order to see the runs status, you don’t need to go to the monitoring section in the UI. You can simply select the job, and there is a table including the job’s runs and their status.
1 Like