Lesser-Known Tips on Apache Oozie

4 years ago

source link: https://towardsdatascience.com/lesser-known-tips-on-apache-oozie-1e9bee9169da?gi=cd7dcdd3dc23
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

Tips and best practices for job scheduling using Apache Oozie

Xinran Waibel

Nov 24 ·4min read

1*uoVl2GcziNS1uEHIt9wlOg.png?q=20

Source: Apache Oozie

At work, I build automated data pipelines that perform ETL/ELT on millions of rows of data on a daily basis and one of the job schedulers widely used in my team is Apache Oozie . Oozie makes it easy to schedule and coordinate Hadoop jobs (such as MapReduce, Sqoop, Hive jobs), track job progresses, and recover from failures. Most importantly, Oozie is very scalable as it can run hundreds or even thousands of jobs concurrently!

I had a few painful debugging experiences with Oozie and I found that job scheduling with Oozie can be very tricky if you don’t know the mechanism behind Oozie’s scheduling system (which the official documentation itself does not explain much about.) In this blog post, I will demonstrate how to schedule Hadoop jobs with data dependency using Oozie, provide solutions to potential problems you may run into and explain its underlying mechanisms to help you understand how Oozie works behind-the-scenes.

Lesser-Known Tips on Apache Oozie

Tips and best practices for job scheduling using Apache Oozie

Recommend

刘强东：如果京东员工遭遇不幸，将负责子女学习生活费用到22岁

Understanding and Implementing Distributed Prioritized Experience Replay (Horgan...

1.1.1.1 — The free app that makes your Internet faster.

gir.st - A one-liner version number incrementor

GitHub - JoshMcguigan/estream: Parse file location info out error streams for an...

中国移动董事长杨杰：运营商这几年干的确实挺辛苦

刘强东:京东员工遭遇不幸的话将负责子女费用到22岁

历时3个月，我们在两家淘系女装店铺探索私域真相

性能号称“傲视群雄” 华为路由A2将于11月25日发布 - Huawei 华为 - cnBeta.COM

官宣：奥特曼加入漫威新片2020年见 - Disney - Marvel 漫威工作室 - cnBeta.COM

About Joyk