Similar Articles

Articles similar to the selected content.

Domain: www.startdataengineering.com Added: 2025-08-13 Status: βœ“ Success
www.startdataengineering.com
Docker Fundamentals for Data Engineers 1. Introduction Docker can be overwhelming to start with. Most data projects use Docker to set up the data infra locally (and often in production). Setting up da...
Similar Articles (10 found)
πŸ” 77.9% similar
Setting up a local development environment for python data projects using Docker
https://www.startdataengineering.com/post/local-dev/
Setting up a local development environment for python data projects using Docker - 1. Introduction - 2. Set up - 3. Reproducibility - 4. Developer erg...
πŸ” View Similar Articles
πŸ” 77.1% similar
How to quickly set up a local Spark development environment?
https://www.startdataengineering.com/post/spark-local-setup/
How to quickly set up a local Spark development environment? - 1. Introduction - 2. Setup - 3. Use VSCode devcontainers to set up Spark environment - ...
πŸ” View Similar Articles
πŸ” 69.6% similar
Build Data Engineering Projects, with Free Template
https://www.startdataengineering.com/post/data-engineering-projects-with-free-template/
Build Data Engineering Projects, with Free Template - 1. Introduction - 2. Run Data Pipeline - 3. Architecture and services in this template - 4. CI/C...
πŸ” View Similar Articles
πŸ” 67.5% similar
Data Engineering Projects
https://www.startdataengineering.com/post/data-engineering-projects/
Data Engineering Projects 1. Introduction Whether you are new to data engineering or have been in the data field for a few years, one of the most chal...
πŸ” View Similar Articles
πŸ” 64.2% similar
Where I Use Docker Containers
http://mcottondesign3.appspot.com/post/ahRzfm1jb3R0b25kZXNpZ24zLWhyZHIRCxIEQmxvZxiAgICM8OGNCQw
Where I Use Docker Containers Skipping the hype around Docker, Kubernetes, and containers in general; I wanted to talk through how I use them and wher...
πŸ” View Similar Articles
πŸ” 64.2% similar
End-to-end data engineering project - batch edition
https://www.startdataengineering.com/post/data-engineering-project-e2e/
End-to-end data engineering project - batch edition - Objective - Setup - Components - Choosing tools & frameworks - Future work & improvements - Conc...
πŸ” View Similar Articles
πŸ” 64.0% similar
How to test PySpark code with pytest
https://www.startdataengineering.com/post/test-pyspark/
How to test PySpark code with pytest - 1. Introduction - 2. Ensure the code’s logic is working as expected with tests - 3. Conclusion - 4. Further Rea...
πŸ” View Similar Articles
πŸ” 63.7% similar
Data Engineering Project for Beginners - Batch edition
https://www.startdataengineering.com/post/data-engineering-project-for-beginners-batch-edition/
Data Engineering Project for Beginners - Batch edition - 1. Introduction - 2. Objective - 3. Run Data Pipeline - 4. Architecture - 5. Code walkthrough...
πŸ” View Similar Articles 🟠 HN
πŸ” 63.5% similar
Ask HN: What is the best source to learn Docker in 2023?
https://news.ycombinator.com/item?id=34563353
| | | | Ask HN: What is the best source to learn Docker in 2023? | | 172 points by lukasfischer on Jan 29, 2023 | hide | past | favorite | 78 comments...
πŸ” View Similar Articles
πŸ” 62.9% similar
Why use Apache Airflow (or any orchestrator)?
https://www.startdataengineering.com/post/why-to-use-orchestrators/
Why use Apache Airflow (or any orchestrator)? - 1. Introduction - 2. Features crucial to building and maintaining data pipelines - 3. Conclusion - 4. ...
πŸ” View Similar Articles