Amazon Hates the Human Loop. They Might Be Right.
The hot take this weekend comes from Eric Brandwine, Amazon Securityâs distinguished engineer and VP. He told The Register that âhuman-in-the-loopâ AI governance isnât the gold standard everyone thinks it is. Humans are ânot terribly consistent.â They normalize deviance. They get complacent. They rubber-stamp.
And heâs got a point. A genuinely uncomfortable one.
The ER Problem
Brandwine tells a story about emergency rooms. Every machine beeps. First day on the job, you jump at every alarm. Day thirty, most alarms are false positives, so you stop responding. Eventually a real one goes off and nobody reacts. This is called the normalization of deviance â a concept he presented at re:Invent in 2017.
Now map that onto AI governance. You put a human between every agent action and the execution. First fifty approvals, theyâre engaged. First five hundred, theyâre clicking through. By five thousand, theyâre not even reading the prompt. The human becomes a liability dressed up as a control.
The Cynicâs Objection
Hereâs what Iâd say if I wanted to poke holes in this: Amazon is a cloud provider selling AI services. Of course they want to remove humans from the loop â itâs cheaper, faster, and lets them scale without hiring armies of reviewers. Google, Microsoft, IBM are all saying the same thing this week. Thatâs convenient.
The counter: none of those companies are wrong about the psychology. The normalization of deviance is real â itâs been observed in healthcare, firefighting, and military aviation. If airline pilots and ER doctors fall prey to it, your junior sysadmin approving API calls at 3 PM on a Tuesday doesnât stand a chance.
Convenient doesnât mean false.
Where the Argument Breaks Down
But thereâs a distinction Amazon conveniently glosses over: high-volume vs. high-stakes loops.
The ER alarm problem happens because there are hundreds of alerts per shift. When you ask a human to review every single AI agent action across thousands of daily operations, yeah â theyâll check out. Thatâs a bad system design.
But some decisions are rare and consequential enough that a human review is absolutely appropriate. Approving a drug trial. Authorizing a financial transfer over a threshold. Deciding whether to deploy code to production. These arenât high-velocity loops. Theyâre gated checkpoints where the human has time to think.
The mistake is treating all human-in-the-loop as one thing. Itâs not. The question isnât âshould humans be in the loop?â â itâs âat what resolution and velocity should they operate?â
What This Actually Means
Amazonâs real insight isnât that humans are useless. Itâs that putting a human in the loop is not a substitute for building trustworthy systems. You canât ship an agent that hallucinates 5% of the time, put a human behind it, and call it safe. Thatâs not governance â thatâs blame-shifting.
The honest path is boring: build systems that fail less, audit trails that capture everything, and escalation paths that trigger human review only when the stakes justify the cognitive cost. Human-in-the-loop as a tool, not a theology.
Brandwineâs wrong that humans arenât the gold standard. The right design uses humans where they add value and machines where they donât. But his warning stands: if your safety strategy relies on a human never getting bored, you donât have a safety strategy.
You have a prayer.
Sources: The Register â Why Amazon hates âhuman-in-the-loopâ AI governance, Brandwineâs re:Invent 2017 talk on normalization of deviance, AHRQ primer on normalization of deviance in healthcare