31

让Spark Mlib的预测性能再飞一会儿 - 简书

 6 years ago
source link: https://www.jianshu.com/p/84cfe6747ca7?
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
背景介绍 我们的系统有一小部分机器学习模型识别需求,因为种种原因,最终选用了Spark Mlib来进行训练和预测。Mlib的Pipeline设计很好地契合了一个机器学习流水线,在模型训练和效果验证阶段,pipeline可以简化开发流程,然而在预测阶段,mlib pipeline的表现有点差强人意。 问题描述 某个模型的输入为一个字符串,假设长度为N,在我们的场景里面这个N一般不会大于10。特...

About Joyk


Aggregate valuable and interesting links.
Joyk means Joy of geeK