Escalations and the specialist queue: planning the second contact
The contact your model forgot
A single inbound contact rarely stays single. A share of them escalate — to a senior agent, a technical specialist, a back-office team, a manager — and that escalation is a whole second piece of work that most staffing models never count. They size the front line against arriving demand, and the specialist tier gets whatever people are left over, planned by feel. The result is a familiar pattern: the front line hits its target while the escalation queue quietly backs up, and the customers who get stuck there are precisely the ones with the hardest, highest-risk problems.
Escalations are demand, generated internally
The mental shift is to treat escalations as a demand stream in their own right — one your own operation generates. If a tenth of front-line contacts escalate, then front-line volume is also a forecast of specialist volume, on a delay. That tier has its own arrival pattern (it lags and smooths the front line), its own handle time (usually longer), and its own, much smaller pool — which makes it fragile, because small pools feel the power of one acutely: lose one specialist to leave or sickness and the queue’s service falls off a cliff. And the escalation rate isn’t fixed; it rises when the front line is under-skilled, under-supported or rushed, so a cut to front-line training or a too-wide span shows up a week later as a swamped specialist queue.
Planning the tier
Forecast the escalation stream from front-line volume and the escalation rate, then staff the specialist pool to its demand with its own (longer, more variable) handle time — not as a rounding error on the main plan. Because the pool is small, respect the power of one: build in more relative headroom than you would for a large team, and cross-train a reserve so a single absence doesn’t sink the queue. Give it a service measure that reflects the stakes, since escalated contacts are usually the high-risk ones. And watch the escalation rate as a leading indicator: when it climbs, the problem is often upstream — front-line skills, support or pace — and the cheapest fix is to stop generating the escalations, not to keep enlarging the team that absorbs them.
Try the escalation pool sizer, then pair this with the power of one, multi-skill scheduling, and managing non-real-time work.