Based on the course: End-to-End Data Engineering Project,
by LinkedIn Learning.
Note: because we added invalid FKs, this repo has an altered dump of the original dataset for education purposes only. The original data and repo's licence can be found at: https://github.com/LinkedInLearning/end-to-end-data-engineering-project-4413618
The current implementation follows a classic star schema–based data warehouse architecture. We may consider migrating to a Medallion-style approach in future iterations.
- Python Poetry
- GNU Make
- Podman or Docker
- (Optional) Any SQL client of your choice
make env-setup
make db-run db-reset
Note: The
db-reset
command will drop and recreate the database, so use it with caution.
make dbt-debug
Note: If you encounter connection issues, ensure that the database container is running and that your connection settings in
.env
are correct.
make dbt-build