Similar Articles

Articles similar to the selected content.

Domain: www.startdataengineering.com Added: 2025-08-13 Status: βœ“ Success
www.startdataengineering.com
How to test PySpark code with pytest - 1. Introduction - 2. Ensure the code’s logic is working as expected with tests - 3. Conclusion - 4. Further Reading - 5. References 1. Introduction Have you work...
Similar Articles (10 found)
πŸ” 78.5% similar
How to add tests to your data pipelines
https://www.startdataengineering.com/post/how-to-add-tests-to-your-data-pipeline/
How to add tests to your data pipelines Introduction Testing data pipelines are different from testing other applications, like a website backend. If ...
πŸ” View Similar Articles
πŸ” 78.3% similar
Setting up end-to-end tests for cloud data pipelines
https://www.startdataengineering.com/post/setting-up-e2e-tests/
Setting up end-to-end tests for cloud data pipelines - 1. Introduction - 2. Setting up services locally - 3. Writing an end-to-end data pipeline test ...
πŸ” View Similar Articles
πŸ” 71.5% similar
Automating data testing with CI pipelines, using Github Actions
https://www.startdataengineering.com/post/ci-data-test/
Automating data testing with CI pipelines, using Github Actions - 1. Introduction - 2. CI - 3. Sample project: Data testing with Github Actions - 4. C...
πŸ” View Similar Articles
πŸ” 70.4% similar
Data Engineering Projects
https://www.startdataengineering.com/post/data-engineering-projects/
Data Engineering Projects 1. Introduction Whether you are new to data engineering or have been in the data field for a few years, one of the most chal...
πŸ” View Similar Articles
πŸ” 70.3% similar
Data Engineering Best Practices - #1. Data flow & Code
https://www.startdataengineering.com/post/de_best_practices/
Data Engineering Best Practices - #1. Data flow & Code - 1. Introduction - 2. Sample project - 3. Best practices - 3.1. Use standard patterns that pro...
πŸ” View Similar Articles
πŸ” 69.3% similar
Python Essentials for Data Engineers
https://www.startdataengineering.com/post/python-for-de/
Python Essentials for Data Engineers - Introduction - Data is stored on disk and processed in memory - Practicing Python - Python basics - Python is u...
πŸ” View Similar Articles
πŸ” 69.2% similar
How to implement data quality checks with greatexpectations
https://www.startdataengineering.com/post/implement_data_quality_with_great_expectations/
How to implement data quality checks with greatexpectations - 1. Introduction - 2. Project overview - 3. Check your data before making it available to...
πŸ” View Similar Articles
πŸ” 68.7% similar
Build Data Engineering Projects, with Free Template
https://www.startdataengineering.com/post/data-engineering-projects-with-free-template/
Build Data Engineering Projects, with Free Template - 1. Introduction - 2. Run Data Pipeline - 3. Architecture and services in this template - 4. CI/C...
πŸ” View Similar Articles
πŸ” 68.2% similar
How to quickly set up a local Spark development environment?
https://www.startdataengineering.com/post/spark-local-setup/
How to quickly set up a local Spark development environment? - 1. Introduction - 2. Setup - 3. Use VSCode devcontainers to set up Spark environment - ...
πŸ” View Similar Articles
πŸ” 67.7% similar
Data Engineering Project for Beginners - Batch edition
https://www.startdataengineering.com/post/data-engineering-project-for-beginners-batch-edition/
Data Engineering Project for Beginners - Batch edition - 1. Introduction - 2. Objective - 3. Run Data Pipeline - 4. Architecture - 5. Code walkthrough...
πŸ” View Similar Articles 🟠 HN