Similar Articles

Articles similar to the selected content.

Domain: www.startdataengineering.com Added: 2025-08-13 Status: βœ“ Success
www.startdataengineering.com
How to submit Spark jobs to EMR cluster from Airflow Table of Contents Introduction I have been asked and seen the questions how others are automating apache spark jobs on EMR how to submit spark jobs...
Similar Articles (10 found)
πŸ” 72.5% similar
How to trigger a spark job from AWS Lambda
https://www.startdataengineering.com/post/trigger-emr-spark-job-from-lambda/
How to trigger a spark job from AWS Lambda - Event driven pipelines - Lambda function to trigger spark jobs - Setup and run - Monitoring and logging -...
πŸ” View Similar Articles
πŸ” 70.9% similar
Data Engineering Project for Beginners - Batch edition
https://www.startdataengineering.com/post/data-engineering-project-for-beginners-batch-edition/
Data Engineering Project for Beginners - Batch edition - 1. Introduction - 2. Objective - 3. Run Data Pipeline - 4. Architecture - 5. Code walkthrough...
πŸ” View Similar Articles 🟠 HN
πŸ” 68.1% similar
Why use Apache Airflow (or any orchestrator)?
https://www.startdataengineering.com/post/why-to-use-orchestrators/
Why use Apache Airflow (or any orchestrator)? - 1. Introduction - 2. Features crucial to building and maintaining data pipelines - 3. Conclusion - 4. ...
πŸ” View Similar Articles
πŸ” 66.6% similar
How to Backfill a SQL query using Apache Airflow
https://www.startdataengineering.com/post/how-to-backfill-sql-query-using-apache-airflow/
How to Backfill a SQL query using Apache Airflow - What is backfilling ? - Setup - Apache Airflow - Execution Day - Backfill - Conclusion - Further Re...
πŸ” View Similar Articles
πŸ” 65.2% similar
3 Key techniques, to optimize your Apache Spark code
https://www.startdataengineering.com/post/how-to-optimize-your-spark-jobs/
3 Key techniques, to optimize your Apache Spark code - Intro - Distributed Systems - Setup - Optimizing your spark code - Technique 1: reduce data shu...
πŸ” View Similar Articles
πŸ” 62.9% similar
Build Data Engineering Projects, with Free Template
https://www.startdataengineering.com/post/data-engineering-projects-with-free-template/
Build Data Engineering Projects, with Free Template - 1. Introduction - 2. Run Data Pipeline - 3. Architecture and services in this template - 4. CI/C...
πŸ” View Similar Articles
πŸ” 62.0% similar
Apache Airflow Review: the good, the bad
https://www.startdataengineering.com/post/apache-airflow-review-the-good-the-bad/
Apache Airflow Review: the good, the bad When getting started with Apache Airflow , data engineers have questions similar to the two below β€œWhat are p...
πŸ” View Similar Articles
πŸ” 61.2% similar
Data Engineering Projects
https://www.startdataengineering.com/post/data-engineering-projects/
Data Engineering Projects 1. Introduction Whether you are new to data engineering or have been in the data field for a few years, one of the most chal...
πŸ” View Similar Articles
πŸ” 59.8% similar
Scheduling a SQL script, using Apache Airflow, with an example
https://www.startdataengineering.com/post/how-to-schedule-a-sql-script-using-apache-airflow-with-an-example/
Scheduling a SQL script, using Apache Airflow, with an example One of the most common use cases for Apache Airflow is to run scheduled SQL scripts. De...
πŸ” View Similar Articles
πŸ” 58.7% similar
Designing a Data Project to Impress Hiring Managers
https://www.startdataengineering.com/post/data-engineering-project-to-impress-hiring-managers/
Designing a Data Project to Impress Hiring Managers - Introduction - Objective - Setup - Project - Future Work - Tear down infra - Conclusion - Furthe...
πŸ” View Similar Articles