The teardown template
Every honest teardown answers seven questions:
- 1The before-state. Who did the dreaded task, how often, how long per unit, and the number that captures the pain.
- 2The trigger. The single event that starts it.
- 3The steps. The literal sequence, trigger to outcome, naming where it writes (the CRM, the accounting system).
- 4The human checkpoints. Where a person still approves or reviews, and who owns each gate.
- 5The rollout. How it went live, ideally a parallel run (old and new side by side) with a baseline captured first.
- 6The results. Before versus after on the same metric, with the source flagged.
- 7What to watch. What could break, and what the numbers do not tell you. This is the part most case studies skip.
A worked example: invoice processing
A real, vendor-reported case (an air-ambulance operator, via its software vendor):
- Before: every invoice took 15 to 20 minutes to process by hand, approvals dragged, and month-end close ran nearly three weeks behind.
- Trigger: an invoice arrives.
- Steps: the AI pulls the invoice details, routes it to the right person for approval, and syncs it into the accounting system in real time.
- Human checkpoint: a person still approves every payment. The robot does the typing; the human authorizes the money. That is the reassurance an owner needs.
- Results: processing dropped from 15 to 20 minutes to under 3, and the books now close two weeks sooner.
- What to watch: the company also claimed "100 percent perfection," which is a testimonial, not a measured error rate. Discount that. The believable, repeatable win is the time per invoice, and it only worked because the accounting system was clean to begin with.
The pattern under all of them
- Confidence-based routing. The safe design is simple: the AI handles the clear, high-confidence cases and routes anything uncertain or high-stakes to a named human. Judgment stays with people; grunt work goes to the machine.
- Reminders beat bots. In scheduling cases, the drop in no-shows came from the automated reminder sequence, not the booking AI. That is the cheap, high-value piece to adopt first.
- "99 percent accurate" usually means AI plus a human check, not AI alone. Ask which it is.
A note on the numbers
Most published case studies are vendor marketing with round numbers and no baseline. Auto-Phil's approach is to build one honest teardown from a real client, lead with operational metrics (minutes per invoice, hours per week), and leave the dollar-ROI multipliers out. They erode trust the moment they look too good.