Building Cost Efficient Data Pipelines with Python & DuckDB
- 1. Introduction
- 2. Project demo
- 3. TL;DR
- 4. Considerations when building pipelines with DuckDB
- 4.1. ⭐ Use DuckDB to process data, ...
Similar Articles (10 found)
🔍 70.7% similar
What do Snowflake, Databricks, Redshift, BigQuery actually do?
- 1. Introduction
- 2. Analytical databases aggregate large amounts of data
- 3. Most p...
🔍 68.8% similar
Data Engineering Best Practices - #1. Data flow & Code
- 1. Introduction
- 2. Sample project
- 3. Best practices
- 3.1. Use standard patterns that pro...
🔍 68.5% similar
How to reduce your Snowflake cost
- 1. Introduction
- 2. Snowflake pricing and settings inheritance model
- 3. Strategies to reduce Snowflake cost
- 4...
🔍 66.1% similar
Data Engineering Projects
1. Introduction
Whether you are new to data engineering or have been in the data field for a few years, one of the most chal...
🔍 66.1% similar
Python Essentials for Data Engineers
- Introduction
- Data is stored on disk and processed in memory
- Practicing Python
- Python basics
- Python is u...
🔍 65.4% similar
How to build a data project with step-by-step instructions
- 1. Introduction
- 2. Setup
- 3. Parts of data engineering
- 3.1. Requirements
- 3.2. Iden...
🔍 65.2% similar
> New languages like Rust/Ocaml/Nim.. if yes then which?
Completely irrelevant. DE is SQL, Python, sometimes Scala/Java.
Get really good at SQL. Learn...
🔍 64.3% similar
Hi, fellow future and current Data Leaders; Ben here 👋
Today I wanted to talk about Iceberg. I’ve been seeing a lot about it recently. Everyone wants ...
🔍 64.2% similar
Writing memory efficient data pipelines in Python
- Introduction
- 1. Using generators
- 2. Using distributed frameworks
- Conclusion
- Further readin...
🔍 63.9% similar
SQL or Python for Data Transformations?
- 1. Introduction
- 2. Code is an interface to the execution engine
- 3. How to choose the execution engine an...