CUPED

CUPED is a statistical technique that leverages pre-experimental data to reduce variance and pre-exposure bias in experiment results. It was first popularized for online testing by Microsoft in 2013.

In the context of Statsig, CUPED is used to enhance the accuracy and speed of running experiments. It is particularly effective for metrics and behaviors that are predictable from past behavior. If a metric is consistent over time for the same user, CUPED can be very effective.

CUPED works by using information about an experiment's users from before an experiment started to reduce the variance in their experiment metrics. This pre-experiment information is referred to as a "control variate". The user's metric value is adjusted based on this control variate multiplied by a coefficient θ.

The more correlated the pre-experiment information is with the post-experiment information, the more of the error or noise in the experiment results is explained by the covariate, and the more the variance in the experimental term is reduced.

However, CUPED does not work on new users, because there is not pre-exposure data to leverage. It is also less effective if a user's metric value is uncorrelated with historical behavior.

In Statsig, CUPED is automatically applied to experiments and is run for the topline results on key metrics in Pulse. This leads to significant variance reduction in the large majority of metrics where CUPED can be applied.

CUPED is also used to address pre-experiment bias, which can occur when users in two experiment groups have meaningfully different average behaviors before any intervention is applied. If this difference is maintained after the experiment starts, it could cause misinterpretations of the results. CUPED helps to debias these experiments.

In the Statsig platform, all Scorecard metrics by default have CUPED applied to them. The "CUPED" flag above key metrics indicates that CUPED-adjusted results are available.

Related:

Join the #1 experimentation community

Connect with like-minded product leaders, data scientists, and engineers to share the latest in product experimentation.

Try Statsig Today

Get started for free. Add your whole team!

Why the best build with us

OpenAI OpenAI
Brex Brex
Notion Notion
SoundCloud SoundCloud
Ancestry Ancestry
At OpenAI, we want to iterate as fast as possible. Statsig enables us to grow, scale, and learn efficiently. Integrating experimentation with product analytics and feature flagging has been crucial for quickly understanding and addressing our users' top priorities.
OpenAI
Dave Cummings
Engineering Manager, ChatGPT
Brex's mission is to help businesses move fast. Statsig is now helping our engineers move fast. It has been a game changer to automate the manual lift typical to running experiments and has helped product teams ship the right features to their users quickly.
Brex
Karandeep Anand
President
At Notion, we're continuously learning what our users value and want every team to run experiments to learn more. It’s also critical to maintain speed as a habit. Statsig's experimentation platform enables both this speed and learning for us.
Notion
Mengying Li
Data Science Manager
We evaluated Optimizely, LaunchDarkly, Split, and Eppo, but ultimately selected Statsig due to its comprehensive end-to-end integration. We wanted a complete solution rather than a partial one, including everything from the stats engine to data ingestion.
SoundCloud
Don Browning
SVP, Data & Platform Engineering
We only had so many analysts. Statsig provided the necessary tools to remove the bottleneck. I know that we are able to impact our key business metrics in a positive way with Statsig. We are definitely heading in the right direction with Statsig.
Ancestry
Partha Sarathi
Director of Engineering
We use cookies to ensure you get the best experience on our website.
Privacy Policy