Measuring accuracy honestly
Slides done? Here’s the same idea in a bit more depth — the part worth keeping.
In depth: score it fairly or tune it wrong
“The forecast was good” is an opinion, not a measurement — and you can’t improve what you don’t measure. Worse, how you measure changes what you optimise for, so the wrong accuracy metric will have you cheerfully tuning the forecast in the wrong direction. The raw material is simple: error is forecast minus actual, per interval. The whole craft is in how you summarise thousands of those into a number that means something, without letting the tiny intervals or the cancelling signs mislead you.
MAPE flatters the wrong intervals; WAPE doesn’t
MAPE — the average absolute percentage error — is intuitive and widely quoted, but it has a nasty flaw in a contact centre: it treats every interval as equally important. Being five contacts short when ten were expected is a 50% error; being fifty short when two thousand were expected is 2.5%. MAPE makes the quiet 8am interval count for far more than the busy midday one — precisely backwards from what staffing cares about. WAPE fixes this by summing the absolute errors and dividing by total actual volume, so every interval is weighted by how busy it was. For most planning, WAPE is the fairer headline number.
Bias is the error that actually hurts
Both MAPE and WAPE take the absolute error, so neither can see direction — and direction is the thing that does the damage. Bias keeps the sign: it’s the average of forecast minus actual, and it answers whether you’re consistently over- or under-forecasting. Two forecasts can share an identical WAPE and do very different harm. Random scatter around zero roughly nets off across a week — some intervals over, some under. A persistent lean does not: a forecast that’s always short under-staffs every interval, every day, draining service level and burning agents with no relief. So track WAPE and bias side by side, measure them at the grain you actually staff to rather than the monthly total that flatters you, and remember a metric you don’t act on is just decoration.
The principle to remember: WAPE for size, bias for direction, both at staffing grain — then act on what they tell you. A forecast that’s always short hurts far more than one that’s simply noisy.
Quick quiz
Five questions. Pick an answer to each, then check your score.
1. What’s the main flaw of MAPE in a contact centre?
A small absolute miss on a low-volume interval is a huge percentage, so MAPE over-weights the quiet intervals.
2. How is WAPE different?
WAPE divides total absolute error by total volume — the intervals that matter for staffing count most.
3. What does bias measure that MAPE and WAPE can’t?
MAPE and WAPE use absolute error, so they lose the sign. Bias keeps it.
4. Why is a biased forecast worse than a noisy one with the same WAPE?
Random errors roughly net off; a consistent lean hits every interval the same way and compounds.
5. At what level should you measure accuracy?
Monthly totals hide interval errors. Score at the grain you act on, not the one that flatters.