Uber Open Sources JVM Profiler for Tracing Distributed JVMs

Uber open sourced a distributed profiler called JVM Profiler in late June. They built JVM Profiler to solve resource allocation issues they had with Apache Spark. Apache Spark is a popular framework for processing large data streams, of which Uber has many. JVM Profiler was built for Spark, but it's applicable to any JVM-based service or application.

Uber wanted the ability to correlate metrics across a large number of processes across tens of thousands of applications, running on thousands of machines. In their distributed environment, many Spark applications run on the same server, and each application has thousands of executors. Their existing tools could only monitor server-level metrics and did not allow them to monitor metrics for individual applications. They needed a solution that could collect metrics for each process and correlate them across processes for each application.

JVM Profiler is made up of three features that simplify collecting performance and resource usage metrics, and then publishing them to other systems (e.g. Apache Kafka) for further analysis.

A Java agent : allows collecting metrics on JVM processes in a distributed way.
Advanced profiling capabilities : allows tracing arbitrary methods and arguments without code changes. Makes it possible to identify slow method calls in Spark applications, and identify hot files in HDFS file paths.
Data analytics reporting : allows for faster data analytics via Kafka topics and Apache Hive tables.

JVM Profiler has a simple and extensible design, which allows you to add additional profiler implementation and collect more metrics. This also allows you to add your own custom reporter for publishing metrics.

Uber's blog post on JVM Profiler has additional information on how to add a custom reporter, as well as how to use it to trace your own applications.

Uber used JVM Profiler on one of their largest Spark applications and was able to reduce the memory allocation for each executor by 2GB, going from 7GB to 5GB. They were able to save 2TB of memory for this application alone.

JVM Profiler is on GitHub at https://github.com/uber-common/jvm-profiler . Pull requests are encouraged!

Recommend

Bat – A Cat Clone With Syntax Highlighting And Git Integration

The best To-Do List System

汽车融资租赁的赛道上，哪些玩家会脱颖而出？

从菊姐到Giao哥，“土味儿”毁了谁又成就了谁

到家O2O会是传统商超的救命稻草吗？

疑似华住1.3亿用户数据泄露 37万元在暗网售卖|股价盘前暴跌8%

“第一性原理“不应该是玄学

vue.js响应式原理解析与实现 - 陈陈jg

GitHub - SandboxEscaper/randomrepo: Repo for random stuff...

大多数人的平庸，来自于极差的执行力

About Joyk