Roadmap of experiments: how many times and on which media?

Inspiration

MMM is designed to guide optimal advertising investment decisions, but MMM has its duality. One side of the goals of MMM is convincing explanation to customers, and another side is a better estimating of ROI. When we are calibrating with results from experiments, we often get two questions.

On which media?：“Which media should we conduct experiments on? We do a lot of experiments on facebook. Is it enough?"
How many times?：“How many experiments should we conduct on instagram?”

We try to adress these two questions by conducting analysis with synthetic data.

What it does

We add an implementation of realistic data generation processes to the siMMMulator, which is an open source R-package for generating dataset for MMM.
We conduct experiments on multiple media make Robyn's ROI estimation much better.
We conduct multi-time experiment on each media make Robyn's ROI estimation accurate.

How we built it

Preparation：Add implementations in siMMMulator

We add implementations to siMMMulator.
- Correlation between media spend and organic sales.
- Correlation between media spends.
- Cyclic seasonality and events.
See Github repository in detail.

Experiment #1：Variety of experiments

Settings

Goal
- Prove that calibration improves the accuracy of causal effect estimation.
Number of media
- 2 media（TV and Facebook）
Comparison
- No calibration（None）
- Calibration with single media（TV | Facebook）
- Calibration with All media（TV & Facebook）
Evaluation metrics
- Average of absolute percentage error of ROI.
Assumption:
- No estimating error exists.

Results

Calibrating two media achieve smallest MAPE of ROI.
Even though calibration doesn't cover all of media, MAPE of ROI decrease.
it is better that conduct experiments on as much media as possible.

Experiment #2：Number of experiments

Settings

Goal
- Prove that the calibration with numerous RCTs improves the accuracy of causal effect estimation.
Comparison
- Number of RCT results used for calibration (1, 5, 10, 50, 260)
Evaluation metrics
- MAPE of ROI

Results

Multi-time experiment on each media make Robyn's ROI estimation accurate.

Challenges we ran into

Creating realistic data generation assumptions.
Understanding the internal structure (optimization, various parameters) from Robyn code around calibration.

Accomplishments that we're proud of

We show bellow by conducting analysis with synthetic data.
- 1. Experiments on multiple media make Robyn's ROI estimation much better
- 2. Multi-time experiment on each media make Robyn's ROI estimation accurate
In order to verify the accuracy of ROI estimation using generated data close to realistic settings, , we add an implementation to the siMMMulator, which is an open source R-package for generating dataset for MMM.

What we learned

In the planning phase of marketing, we just plan the experiment roadmap.
If we can conduct experiments on all media, it's the best. If cannot, it is better that cover as much media as possible.
If we can conduct experiments as many as possible on each media, that makes our MMM results more accurate.

What's next?

Theoretical guarantees.
Implementation of data generation assumptions for complex relationships between organic sales and media spends, etc.
- Reverse causality：Higher sales allow for a larger budget allocation.
- Limited range of data：MMM model is fitted to limited historical data and expected to provide insights outside the scope of this data. （e.g., Wide range ad spend）
- Funnel effects [Angrist and Krueger, 1999]：An ad channel also impacts the level of another ad channel. That will lead to biased estimates.
Cost-optimal design of number of experiments.

Built With

python
r
robyn
simmmulator

Updates

Ryosuke Hyodo started this project — Nov 09, 2022 10:39 AM EST

Leave feedback in the comments!

Log in or sign up for Devpost to join the conversation.