Capstone Project Part 1 - ETL and Data Modeling

4.1.7. ETL and Data Modeling
I got failed on extract-job I do know what is the problem


I have try run job on AWS Glue studio. It successfully started job but why my terminal is Failed

Can someone please explain this to me, I don’t understand how it works and for what reason it failed.and how can I trouble shooting

  • Where should I start checking first?
  • and how can I reset the lab in case my code is getting worse

Hi,

When an AWS Glue Job fails, a good place to start is the AWS Glue Console:

  1. Go to AWS Glue > Jobs.
  2. Click on the job name.
  3. Under the Runs tab, select the failed run.
  4. You’ll find a Logs link and an error message summary—these often point to what went wrong.

Hi @benjamintabaresjr could you please help me?
this is the error I got. I not understand what is the problem. is it because the Script or what?



Is it possible that it has something to do with API URL?

Hi,

In step 2.1:

Go to the AWS console and search for **CloudFormation**. Click on the alphanumeric stack name and then open the **Outputs** tab. You will see the key APIEndpoint, copy the corresponding **Value**. Open the glue file terraform/modules/extract_job/glue.tf, replace the <API_ENDPOINT> placeholders with the API Endpoint value (in two places).

Each time you start the lab, you’ll need to repeat this step since the API endpoint changes.

Hi, @benjamintabaresjr
I tried what you suggested but it still doesn’t work.I got the same error

Hi,

Does it return the same error?

Could you go to Run details tab and share a screenshot of the error message? Thanks

Hi, @benjamintabaresjr

on 4.2.6. I have run this step


In Terminal, it should give me a message to enter Yes, but it gives me a message to enter the db value name.

How can I solve this problem

Thank you in advance

Hi,

Please run source scripts/setup.sh the very first thing.

This step defines the environment variables needed for Terraform. If Terraform ever prompts you to enter the values for these variables, it means they are not set up.

Running source scripts/setup.sh defines these variables.

Hi @benjamintabaresjr

I just don’t quite understand this question. I’m not sure what I need to fill in schema.yml?
Do I have to copy and paste it from the example file?


Hi,

You can use the schema.yml from the example folder as a template.

This file contains the schema definition for the fact and dimension tables.