Selection bias happens whenever we draw a conclusion based on a skewed data sample, which hinders the validity of such a conclusion. It is quite tricky to spot, but once you’re trained to see it, you start seeing it everywhere.
A typical example involves mobile apps for e-commerce brands. Whenever an e-commerce team launches a mobile app, it finds that the conversion rate for customers shopping through the app is astronomically higher than that for customers shopping on the web. As a result, the brand concludes that the mobile app is causing customers to convert more, and they double down on the app.
In most cases, however, customers who download and shop through a brand’s mobile app are already converting more, regardless of their shopping channel. In other words, the mobile app is taking undue credit for an effect it never created (higher CVRs).
Loyalty programs suffer from similar problems, as do most business initiatives where the target audience is not representative of our business’s general population—or where one or more parties have an interest in skewing the truth.
In the mobile app example, there are two ways we can determine whether the higher conversion rate is coming from the app itself or selection bias: