GitHub - uber/hudi: Spark Library for Hadoop Upserts And Incrementals

5 years ago

source link: https://github.com/uber/hudi
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

README.md

Hudi

Hudi (pronounced Hoodie) stands for Hadoop Upserts anD Incrementals. Hudi manages storage of large analytical datasets on HDFS and serve them out via two types of tables

Read Optimized Table - Provides excellent query performance via purely columnar storage (e.g. Parquet)
Near-Real time Table (WIP) - Provides queries on real-time data, using a combination of columnar & row based storage (e.g Parquet + Avro)

For more, head over here

Recommend

JLocalDateTime Class API Guide in Java 8
An Introduction to GPU Programming in Julia
Should You Learn TypeScript? (Benefits & Resources)
Distributed Filesystems for Deep Learning
OOP Is Dead, Long Live OOP
Fuzz in sixty seconds
Using roughtime as a “cryptographic notary”
原来你是这样的 Stream：浅析 Java Stream 实现原理
Open Source at Uber: A Conversation with Yuri Shkuro, Jaeger Project Lead
OpenBSD 6.4

About Joyk

Aggregate valuable and interesting links.
Joyk means Joy of geeK