Sampling RUM: When and why it's a good idea

MONDAY 16TH OF DECEMBER 2019

sampling-RUM.jpg?auto=format,compress&fit=clamp&max-w=1000

I confess, I’m not a statistician. While I pride myself on the 'A' I received in my college statistics class, admittedly it was on a pretty steep curve. That said, I’ve been looking at performance data for many years and have found myself on both sides of the debate about whether or not the practice of sampling performance data is inherently a good or bad idea.

When it comes to real user monitoring (RUM), I’m convinced that the marginal cost of collection, computation, storage, etc. is not always great enough to warrant a practice of collecting ALL THE THINGS by default.

Like any experiment, how you sample RUM data – as well as how much data to sample – depends on the answers you seek. While certainly not an exhaustive list, here are some questions you might ask when looking at implementing a sampled approach to real user monitoring...

Are sessions important to you?

At SpeedCurve, we’re big fans of not only looking at individual page performance, but also looking at performance in the context of a user’s entire visit to the site. This helps us do things like:

correlate various performance metrics with user behavior,
understand the user journey and popular pages so you can better model your synthetic testing, and
identify bottlenecks impacting a site’s business goals.

However, some teams we've worked with want the option to look at individual pages in isolation if they are part of a product team just focused on one area. That’s fine, too!

A lot of performance users really love that they can track business metrics specific to a session alongside their performance data. While real user monitoring tools aren’t typically looked at as a replacement for marketing analytics tools, having session metrics available with your performance data is pretty great for anyone who cares about performance. This way you can easily base your performance budgets on a business outcome, such as a cart conversion or a click on a “call to action” link.

In the case where sessions are important to you, finding a RUM solution (or creating one) that does session-based sampling is key. This does mean that the size of your sample for page views will likely not be exact. At SpeedCurve, we’re good with that and prefer to maintain the integrity of the session vs. sample too aggressively and break them up to stay within a strict page view budget.

Where should you do the sampling?

Another big thing to think about when applying sampling is where you want to sample. Having the ability to sample at the collection point as well as in the client/browser itself provides you with a lot of flexibility. We offer both options at SpeedCurve, but sometimes people aren’t sure which to go with.

In most cases, sampling from the collection endpoint (either through a vendor, your CDN or your origin) gets the job done and is a fine approach. However, for some more sophisticated users who want to be very sensitive to how they are spending their “beacon budget”, you may elect to sample at the client. This allows you to do things like increase the sample rate for an A/B test group for comparison with a control group when the test group is extremely small. Potentially you want to sample based on the location of the user agent, the type of user agent, or even the type of user you are serving content to.

One thing to be careful of: If you do manipulate your sampling to collect more data for specific segments, make sure you are able to exclude these segments appropriately or keep them completely separate when drawing conclusions about the entire population of your users. One great way to do this is to create a separate flag for these groups that you can use to isolate from your greater population. SpeedCurve allows for adding Customer Data using our LUX.addData API.

Having this flexibility is really great, and helps you make sure you are getting the most out of your monitoring budget.

Considerations for determining your sample size

How distributed is your traffic?

We’ve heard great accounts of customers who assume their traffic is 100% regional, only to find they have an entire contingent of users in another country! (Watch this great talk from Harry Roberts at performance.now() to hear such a tale. It’s at minute 10:45, but the entire talk is fantastic, so make sure to watch the whole thing!)

For most sites, they want to understand where their users are. If sampled too heavily, the dataset for these important cohorts of users is often too small to get relevant insight. That said, when you have to make decisions about where to spend your monitoring budget, drawing insights from your primary geographic region in isolation may be warranted.

Are you running experiments in production?

This is a rapidly growing practice that is becoming more and more important for product teams. Whether you’re pushing canary builds or running A/B test variants, you want to ensure that you have enough data in those buckets to factor performance into the decision criteria. We’ve seen sites make business decisions based on performance data that was so inconclusive that it would have been better if it had been ignored.

Are you only interested in a specific type of traffic?

Most organizations want to understand how their site is performing across multiple form factors (desktop/tablet/mobile) as well as browser types. Oftentimes an important subset of users using a technology may represent a significantly smaller user base. Given the massive fragmentation of mobile devices –namely Android – understanding performance is crucial to creating a more inclusive web. Ensuring a proper sample size for these cohorts is crucial and often hard to do if you aren’t paying attention. That said, for some teams the focus may be more specifically targeted toward a certain technology or user agent, so a smaller sample for a more popular user agent or device may work just fine.

Do you have a relatively small population of REALLY important users?

Whether you're a retail site, financial services, media or other, you likely have a smaller set of users that you are VERY interested in. Getting an adequate dataset for a retailer when the rate of conversion is really low (say 1-2%) may be really hard to do if your sample size is too small. Similarly, media sites care a lot about registered users on the other side of a metered paywall, but most likely they make up a MUCH smaller segment of the population. These examples exist across all of our sites, and it’s important that we have the ability to represent them in our datasets.

The important thing is to get started

Whether you are considering getting started with RUM or changing the approach you use today, don’t let the decision of how much to sample send you into paralysis. We always advise customers who are unsure to start small, see what you can learn and continue to expand as necessary.

I hope this is helpful and encourage you to check out some of these support articles below or reach out if you have questions. Otherwise, I highly encourage you to get started with a free trial of LUX today!

SpeedCurve | Sampling RUM: When and why it's a good idea

Sampling RUM: When and why it's a good idea

Are sessions important to you?

Where should you do the sampling?

Considerations for determining your sample size

How distributed is your traffic?

Are you running experiments in production?

Are you only interested in a specific type of traffic?

Do you have a relatively small population of REALLY important users?

The important thing is to get started

Recommend

SpeedCurve | JavaScript growth and third parties

SpeedCurve | Measuring Jank and UX

NEW: Exploring RUM sessions

基于 RUM 的前端优化理论与实践 - 性能篇（一）

The RUM Conjecture

Why RUM changes your programming style.

Reorgs with a sip of spiced rum

Sampling RUM: A closer look

NEW! RUM Compare dashboard

About Joyk