I am working on the Airflow’s last grading exercise. In all of the Airflow’s implementation it is having the instruction like this:
For educational purposes, the proposed DAG uses pandas , SciPy , Great Expectations , and NumPy to handle the data within the Airflow instance. This is not desirable in real-life Airflow pipelines.
I understand that for the 1 or 2 labs it is good to keep it simple and provide all the implementation and use-cases in the single file to make it easy to run and execute.
However, this makes it difficult and confusing to learners on how the real production implementation/workflow will work using this operators in projects.
In the last exercise, at least it should demonstrate the real world use-case implementation or at least provide a code reference for the interested persons.
If any mentor or support person can help with the real-life implementation structure and template. It is a great help moving forward.