Who Decides What's Critical?

Every autonomous system promises a human in the loop. The question no one asks is who gets to draw it.

Jun 23, 2026

One hundred and forty-seven to three.

That is the number I keep coming back to. It sat on the command dashboard of an autonomous business platform I was reviewing this week. 147 tasks completed today. 3 awaiting your approval. The banner above it read, with no apparent irony, total visibility.

I look at systems like this for a living. Operators bring me the agentic platform they are about to switch on, and I trace one thing through it: what happens to human judgment once the system is running. The marketing is always about speed and headcount. The thing worth examining is always the same, and it is rarely on the slide.

This particular product is built to run a company. Ten specialist agents, one for finance, one for legal, one for sales, one even labeled CEO and strategy, coordinated by an overseer that, in the company’s own words, translates your goals into operating plans, calls the tools, files the paperwork, and sends the reports. It runs around the clock. The promise is that you can run an entire company while you sleep, and stop managing people so you can start directing outcomes.

I am not here to dunk on the vendor. The product is well made and the category is real. I am here for the safety net, because it is identical across nearly every autonomous system I see, and it does not hold the weight buyers put on it.

The safety net is this: the system escalates to you only when it matters, and surfaces only the decisions that genuinely need you. Anything consequential pauses at a checkpoint. One tap to approve, redirect, or take over. You stay at the helm.

That is the line that sells. It is also the line that should stop you.

The variable nobody inspects

“A human in the loop for critical decisions” carries a hidden term, and it is the only one that matters: who defines critical?

Read the promise again and the answer is already sitting inside it. The system escalates what matters. The system surfaces what genuinely needs you. The system decides. You are not reviewing the decisions your business made today. You are reviewing the ones it chose to hand up. Everything below that line already happened, in your name, while the dashboard assured you that you had total visibility.

Go back to 147 and 3. The business took 147 actions and asked you about three. The other 144 were not hidden from you, exactly. They were classified as beneath you. Somewhere in that feed, the quarterly pricing review was marked complete. The item that did earn a checkpoint was a vendor contract renewal. Maybe that is the right split. The point is that you did not make it. The overseer made it, and it will make the same kind of call 147 times tomorrow, and a few thousand times by the end of the quarter.

That is the quiet move. Control was the word in the brochure. What I was actually looking at was a feed.

You consented once, in the abstract

The timing compounds it. You brief the system once. You set goals and guardrails in plain language and configure an auto, ask, or block policy for each tool. Then the agents run.

Which means your judgment was collected up front, as categories, before a single real situation existed. That is consent, not oversight. Judgment is not a preference you select in advance. It is what you do when a specific decision is in front of you, in its context, with what you know in that moment. A policy written at setup cannot do that. And the checkpoint queue that follows is mostly the work of clearing items the system pre-sorted, at the pace the system sets, framed by the information the system chose to attach. The button is real. The deliberation it implies left at brief time.

The layer underneath the checkpoint

There is a second problem that no queue can reach. Ten agents coordinated by an overseer do not only act, they interact. Finance feeds pricing, pricing shapes outreach, outreach moves pipeline, pipeline loops back to finance. Outcomes emerge from the coordination that no single agent decides and no checkpoint is built to catch, because a checkpoint sits on one action, not on the relationship between many. This is not a sci-fi warning. It is the ordinary behavior of coupled systems, and it lives exactly where the human is not looking.

And consider what the overseer is doing while you sleep. Its own description says it reviews all output. An AI grades the work of the other agents and then decides what reaches you. So the checkpoint you were handed sits outside a loop in which AI is already checking AI. You are not in the loop. You are downstream of it, holding the three items it elected to show you.

Watch what the premium tiers sell

The most honest page on the site is the pricing page. The upper plans advertise compliance autopilot and advanced approval automation. Sit with that second phrase. The product is now automating the approval step itself, the one human function the entire safety net was built around. When approval is automated, the checkpoint stops being a control. It becomes a notification, and notifications can be silenced.

The price for all of it runs from fifteen to forty dollars a month, marketed to solo founders and small operators. The people most drawn to full autonomy at streaming-subscription prices are precisely the ones least likely to have any governance of their own. Cheap autonomy, expensive accountability, sold to the buyers least equipped to tell which one they are paying for.

What you are actually buying

The selling logic treats management as overhead, a drag that automation removes. That is the wrong frame, and it is the costly one. You did not eliminate the management burden. You relocated it. It used to live with people, who are visible and correctable. Now it lives inside a system, which is opaque and compounding. The work of managing did not shrink. It moved to where you cannot see it, and then a banner told you that you could see everything.

So the founder who bought “stop managing people” did not buy a scaling solution. They have purchased a control problem in a scaling solution’s packaging, and they will not feel the weight of it until something authorized and wrong has already traveled through a few hundred actions nobody read.

The one question that turns a notification back into a control

A human in the loop becomes real governance under exactly one condition: a named human, not the vendor’s overseer, owns the threshold for what escalates. That person decides what counts as critical, can see what runs beneath the line, and can move the line when it is wrong.

So before you switch any of these systems on, ask the question the demo will not volunteer: who defines critical, and can I change it? If the definition belongs to the system, you do not have a checkpoint. You have a feed with a nicer font.

The trophy in this cycle is supposed to be ten million dollars in revenue with three people. The real achievement is ten million with three people and a control layer you actually understand. One is a headline. The other is a company that can survive its own automation.

AI operates. You own the decision. As long as you are still the one who decides which decisions are yours.

Who defines critical in your systems, and can you change it? Tell me in the comments.

The Evolving Mindset goes out every week on where AI actually breaks and who is left holding it. Subscribe and the next one finds you.

This is the question my work begins with. When an organization switches these systems on and no named human owns the threshold, that gap is what I help close.

Schedule a conversation

Discussion about this post

Ready for more?