1

Fully Orchestrating Databricks from Airflow [Video]

 2 years ago
source link: https://www.inovex.de/de/blog/orchestrating-databricks-with-airflow/
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
20.12.2021

Fully Orchestrating Databricks from Airflow [Video]

Lesezeit 0 ​​min
Home / Blog / Fully Orchestrating Databricks from Airflow [Video]

In this talk given at WeAreDevelopers‘ Python Day we will introduce how to use the popular cloud service Databricks to host Apache Spark applications for distributed data processing in combination with Apache Airflow, an orchestration framework for ETL batch workflows.

YouTube

By loading the video, you agree to YouTube’s privacy policy.
Learn more

Load video

Always unblock YouTube

After a brief exploration of the Databricks Workspace and the fundamentals of Airflow we will take a
deeper look into the functionality Databricks provides in Airflow for orchestrating its workspace. Afterwards, we will find out how to extend and customize that functionality to manage virtually every aspect of the Databricks Workspace from Airflow.

This talk does not require any prior knowledge of Databricks, Spark or Airflow but it does assume familiarity with the fundamentals of the Python programming language especially object oriented programming and REST api requests. The actual distributed data processing with Apache Spark itself is not the focus of this talk.

About Alan Mazankiewicz

Alan finished his Master’s degree from Karlsruhe Institute of Technology in Information Engineering and Management in 2020 before starting his career as a Machine Learning Engineer at inovex GmbH in Cologne, Germany. He (co-) authored two scientific papers in the area of machine learning published at major journals and conferences and is a regular contributor to the open source community.

Share:

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK