Similar Articles

Articles similar to the selected content.

Domain: www.startdataengineering.com Added: 2025-08-13 Status: ✓ Success
www.startdataengineering.com
Writing memory efficient data pipelines in Python - Introduction - 1. Using generators - 2. Using distributed frameworks - Conclusion - Further reading - References Introduction If you are Wondering h...
Similar Articles (10 found)
🔍 68.4% similar
Python Essentials for Data Engineers
https://www.startdataengineering.com/post/python-for-de/
Python Essentials for Data Engineers - Introduction - Data is stored on disk and processed in memory - Practicing Python - Python basics - Python is u...
🔍 View Similar Articles
🔍 64.9% similar
Why use Apache Airflow (or any orchestrator)?
https://www.startdataengineering.com/post/why-to-use-orchestrators/
Why use Apache Airflow (or any orchestrator)? - 1. Introduction - 2. Features crucial to building and maintaining data pipelines - 3. Conclusion - 4. ...
🔍 View Similar Articles
🔍 64.8% similar
How to Scale Your Data Pipelines
https://www.startdataengineering.com/post/scale-data-pipelines/
How to Scale Your Data Pipelines - 1. Introduction - 2. What is scaling & why do we need it? - 3. Types of scaling - 4. Choose your scaling strategy -...
🔍 View Similar Articles
🔍 64.4% similar
Data Engineering Projects
https://www.startdataengineering.com/post/data-engineering-projects/
Data Engineering Projects 1. Introduction Whether you are new to data engineering or have been in the data field for a few years, one of the most chal...
🔍 View Similar Articles
🔍 64.2% similar
Building Cost Efficient Data Pipelines with Python & DuckDB
https://www.startdataengineering.com/post/cost-effective-pipelines/
Building Cost Efficient Data Pipelines with Python & DuckDB - 1. Introduction - 2. Project demo - 3. TL;DR - 4. Considerations when building pipelines...
🔍 View Similar Articles
🔍 63.8% similar
End-to-end data engineering project - batch edition
https://www.startdataengineering.com/post/data-engineering-project-e2e/
End-to-end data engineering project - batch edition - Objective - Setup - Components - Choosing tools & frameworks - Future work & improvements - Conc...
🔍 View Similar Articles
🔍 63.1% similar
Should Data Pipelines in Python be Function based or Object-Oriented (OOP)?
https://www.startdataengineering.com/post/python-fp-v-oop/
Should Data Pipelines in Python be Function based or Object-Oriented (OOP)? - 1. Introduction - 2. Data transformations as functions lead to maintaina...
🔍 View Similar Articles
🔍 62.1% similar
How to make data pipelines idempotent
https://www.startdataengineering.com/post/why-how-idempotent-data-pipeline/
How to make data pipelines idempotent - What is an idempotent function - Pre-requisites - Why idempotency matters - Making your data pipeline idempote...
🔍 View Similar Articles
🔍 61.5% similar
Data Engineering Project for Beginners - Batch edition
https://www.startdataengineering.com/post/data-engineering-project-for-beginners-batch-edition/
Data Engineering Project for Beginners - Batch edition - 1. Introduction - 2. Objective - 3. Run Data Pipeline - 4. Architecture - 5. Code walkthrough...
🔍 View Similar Articles 🟠 HN
🔍 61.3% similar
How to choose the right tools for your data pipeline
https://www.startdataengineering.com/post/choose-tools-dp/
How to choose the right tools for your data pipeline 1. Introduction If you are building data pipelines from the ground up, the number of available da...
🔍 View Similar Articles