Incrementality

Geo lift test design checklist

Geo lift tests are useful when advertising, retail distribution, pricing, or operations change at a market level. They can also create confident-looking results from weak comparisons if markets are chosen because they are convenient instead of comparable.

The goal is not to make every market identical. The goal is to show that the treatment markets and comparison markets would have moved similarly if the campaign had not changed.

Advertisement In-article programmatic unit.

When a geo test fits

Good fitWeak fitDesign implication
Media can be bought or withheld by market.The campaign spills heavily across market borders.Use larger market groups, exclude border areas, or choose another design.
Outcomes are observed consistently by location.Sales, leads, or visits are missing for some regions.Fix measurement coverage before assigning treatment.
Enough pre-period history exists to check trend similarity.The product, price, or tracking system just changed.Delay the test or treat the result as directional only.
Markets are not being changed by unrelated launches.A store rollout, promo, supply issue, or pricing change overlaps the test.Remove affected markets or pre-register adjustment rules.

The minimum design brief

Before spend moves, write a short brief that can survive a skeptical readout. It should state the business decision, the test markets, the control markets, the treatment period, the primary outcome, the expected lag, and the rule for calling the result useful.

Decision

Name the decision the result will change. A test for national budget expansion needs a different readout than a test for a local launch playbook.

Estimand

State the lift being estimated: incremental sales, leads, visits, subscriptions, qualified accounts, or another primary outcome during a defined window.

Assignment

Explain why each market is in treatment or control. Avoid letting the highest-opportunity markets become treatment simply because the campaign team wants them most.

Guardrails

List changes that would invalidate or downgrade the readout: inventory breaks, channel outages, price changes, competitor shocks, tracking changes, or major weather disruptions.

Pre-period checks

CheckWhat to inspectRed flag
Level balanceAverage outcomes, spend, customer base, stores, distribution, and seasonality.Treatment markets are much larger, richer, or more mature than controls.
Trend balanceWeekly or daily movement before the campaign.Treatment is already accelerating faster before media changes.
VolatilityNoise in the primary outcome and any extreme weeks.A few markets swing enough to dominate the readout.
CovariatesPrice, promotions, stock, distribution, local events, holidays, and competitor pressure.Important drivers are only tracked after the test starts.
Placebo timingApply the proposed method to an earlier period with no treatment.The method finds lift where no campaign change happened.
Advertisement Lower in-article unit.

Analysis choices to lock early

A credible geo readout is usually won before the campaign starts. Lock the outcome, exclusion rules, matching approach, covariates, readout date, and uncertainty treatment before anyone can see whether the answer looks favorable.

  • Choose one primary outcome and keep secondary metrics clearly labeled.
  • Use pre-period performance to create controls, but do not tune the control group after seeing test-period lift.
  • Show absolute lift, percentage lift, confidence or credible intervals, and the implied business value.
  • Report market-level variation instead of hiding it behind one average.
  • Run sensitivity checks with obvious outliers removed and explain whether the conclusion changes.
  • Separate media effect from distribution, pricing, promotion, and product availability changes.

Readout language that stays honest

Weak phrasingStronger phrasing
The campaign drove a 12% sales lift.In the tested markets, observed sales were 12% above the matched comparison estimate during the readout window.
The test proves this channel works nationally.The test supports expansion if future markets have similar demand, media delivery, and operating conditions.
The control group was similar in size.The control group had similar pre-period levels, trends, volatility, and tracked business drivers.
The result was positive, so the test passed.The result cleared the pre-set decision threshold after uncertainty, costs, and operational constraints were considered.

Takeaway

A geo lift test is a comparison discipline, not just a map with shaded regions. The central question is whether the control markets make a believable estimate of what would have happened anyway.