C4W4 Capstone Project Part 1 - ETL and Data Modeling: Landing Zone Job run failed

Dear Support Team,

I was able to successfully execute all three Landing Zone jobs. However, due to other priorities, I couldn’t continue with the next part of the notebook on the same day.

When I tried to restart the process from the beginning later, two jobs consistently failed:

  • de-c4w4a1-api-users-extract-job
  • de-c4w4a1-api-sessions-extract-job

I made sure to use the refreshed in both the notebook and glue.tf, but the issue persists despite multiple attempts. I’ve also attached a screenshot of the job run status for your reference.

With only one week left in my subscription and the final capstone project remaining, I’m hoping to resolve this issue quickly so I can complete the course without needing to extend my subscription.

I would greatly appreciate your help in identifying and fixing the problem.

Thank you very much in advance.
Namuna

Hello @Namuna,

I could reproduce your issue, it seems you excluded the http:// and /users or /sessions when you replaced the API Endpoint of the api_url in the users and sessions extract jobs:

Use this --api_url for /users:

    # Replace the placeholder <API-ENDPOINT> with the value from the CloudFormation outputs
    "--api_url"             = "http://ec2***.compute-1.amazonaws.com/users" 
    # Notice the target path. This line of the code code is complete - no changes are required
    "--target_path"         = "s3://${var.data_lake_bucket}/landing_zone/api/users"

Use this --api_url for /sessions:

    # Replace the placeholder <API-ENDPOINT> with the value from the CloudFormation outputs
    "--api_url"             = "http://ec2***.compute-1.amazonaws.com/sessions"
    # Notice the target path. This line of the code code is complete - no changes are required
    "--target_path"         = "s3://${var.data_lake_bucket}/landing_zone/api/sessions"