(TBD)
terraform
: Creates project infrastructure (GCS & BigQuery)docker
config to containerize Postgres, Airflow & Sparkairflow
: Workflows (DAGs) for ingestion (extraction) of raw data to Data Lake (GCS) & DWH (BigQuery)dbt
: Workflows to transform DWH data to queryable viewsspark
: Transformation of Raw Data (GCS) to DWH (BigQuery), orchestrated by Airflowkafka
: ingesting streaming data