Similar Articles

Articles similar to the selected content.

Domain: www.startdataengineering.com Added: 2025-08-28 Status: βœ“ Success
www.startdataengineering.com
How to quickly set up a local Spark development environment? - 1. Introduction - 2. Setup - 3. Use VSCode devcontainers to set up Spark environment - 4. Conclusion - 5. Read these 1. Introduction Sett...
Similar Articles (10 found)
πŸ” 77.1% similar
Docker Fundamentals for Data Engineers
https://www.startdataengineering.com/post/docker-for-de/
Docker Fundamentals for Data Engineers 1. Introduction Docker can be overwhelming to start with. Most data projects use Docker to set up the data infr...
πŸ” View Similar Articles
πŸ” 76.1% similar
Setting up a local development environment for python data projects using Docker
https://www.startdataengineering.com/post/local-dev/
Setting up a local development environment for python data projects using Docker - 1. Introduction - 2. Set up - 3. Reproducibility - 4. Developer erg...
πŸ” View Similar Articles
πŸ” 70.0% similar
Build Data Engineering Projects, with Free Template
https://www.startdataengineering.com/post/data-engineering-projects-with-free-template/
Build Data Engineering Projects, with Free Template - 1. Introduction - 2. Run Data Pipeline - 3. Architecture and services in this template - 4. CI/C...
πŸ” View Similar Articles
πŸ” 68.2% similar
How to test PySpark code with pytest
https://www.startdataengineering.com/post/test-pyspark/
How to test PySpark code with pytest - 1. Introduction - 2. Ensure the code’s logic is working as expected with tests - 3. Conclusion - 4. Further Rea...
πŸ” View Similar Articles
πŸ” 66.5% similar
Visual Studio Code (VSCode) extensions for data engineers
https://www.startdataengineering.com/post/vscode-extensions-for-data-engineers/
Visual Studio Code (VSCode) extensions for data engineers - 1. Introduction - 2. Python environment setup - 3. VSCode Primer - 4. Extensions overview ...
πŸ” View Similar Articles
πŸ” 66.1% similar
Data Engineering Projects
https://www.startdataengineering.com/post/data-engineering-projects/
Data Engineering Projects 1. Introduction Whether you are new to data engineering or have been in the data field for a few years, one of the most chal...
πŸ” View Similar Articles
πŸ” 62.2% similar
Data Engineering Project for Beginners - Batch edition
https://www.startdataengineering.com/post/data-engineering-project-for-beginners-batch-edition/
Data Engineering Project for Beginners - Batch edition - 1. Introduction - 2. Objective - 3. Run Data Pipeline - 4. Architecture - 5. Code walkthrough...
πŸ” View Similar Articles 🟠 HN
πŸ” 60.8% similar
Why use Apache Airflow (or any orchestrator)?
https://www.startdataengineering.com/post/why-to-use-orchestrators/
Why use Apache Airflow (or any orchestrator)? - 1. Introduction - 2. Features crucial to building and maintaining data pipelines - 3. Conclusion - 4. ...
πŸ” View Similar Articles
πŸ” 60.4% similar
How to set up a dbt data-ops workflow, using dbt cloud and Snowflake
https://www.startdataengineering.com/post/cicd-dbt/
How to set up a dbt data-ops workflow, using dbt cloud and Snowflake - Introduction - Pre-requisites - Setting up the data-ops pipeline - Conclusion a...
πŸ” View Similar Articles
πŸ” 60.1% similar
End-to-end data engineering project - batch edition
https://www.startdataengineering.com/post/data-engineering-project-e2e/
End-to-end data engineering project - batch edition - Objective - Setup - Components - Choosing tools & frameworks - Future work & improvements - Conc...
πŸ” View Similar Articles