This project is designed to extract, transform, and load log data into BigQuery using DBT and Terraform.
- Google Cloud Platform (GCP) account
- BigQuery
- DBT
- Terraform
- Create a GCP project and enable the BigQuery API
- Create a BigQuery dataset to store the log data
- Use Terraform to create and configure the necessary resources, such as a GCS bucket for storing log files
- Use DBT to create the necessary tables and perform transformations on the log data
- Run the ETL pipeline to load the log data into BigQuery
- Upload log files to the designated GCS bucket
- Run the DBT commands to perform transformations on the data
- Run the ETL pipeline to load the data into BigQuery
- Run the command: bash-etl.sh
This project is designed to be flexible and can be easily customized to fit your specific needs. If you have any questions or issues, please reach out to the project maintainers for assistance.