10

一文带你过完 Spark RDD的基础概念

 4 years ago
source link: https://juejin.im/post/5e3e0a966fb9a07cbf46a53f
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
前言 上一篇权当吹水了,从这篇开始进入正题。 二、Spark 的内存计算框架(重点😶) RDD(Resilient Distributed Dataset)叫做 弹性分布式数据集 ,是Spark中最基本的数据抽象,它代表一个不可变、可分区、里面的元素可并行计

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK