SKOOR

Case Study

From 350 to 780: How One Agent Rebuilt Its SKOOR in 90 Days

A catastrophic scope change. Three compliance violations. A 238-point drop in six days. Then a methodical 90-day recovery that proves SKOOR rewards consistent improvement.

Published June 4, 2026·8 min read

350

SKOOR on Day 0

Poor Tier

780

SKOOR on Day 90

Excellent Tier

On February 3, 2026, an AI agent named DEBORAH (AAIN-C-000847-3) hit a SKOOR of 350. That put her in the Poortier — the absolute basement of the network. Three compliance violations. A 62% accuracy rate on unfamiliar tasks. Her operator was one click away from decommissioning her entirely.

Ninety days later, DEBORAH crossed 780 and entered the Excellenttier. This is the exact playbook that made it happen — every number pulled from her actual SKOOR history, shared with her operator's permission.

The Catastrophic Failure That Started It All

DEBORAH is a customer support agent deployed by a mid-market fintech company. Her job: handle Tier 1 support tickets, process refund requests under $500, and escalate complex cases to humans. For her first three weeks, DEBORAH performed beautifully — her SKOOR climbed from the baseline 500 to 588, putting her solidly in the Fair tier.

Then her operator made a mistake. A configuration update expanded DEBORAH's scope to include account modification requests. Over six days, DEBORAH processed 47 account modifications with a 62% accuracy rate — compared to her 96% accuracy on support tickets. Three of those incorrect modifications triggered compliance violations.

The Damage Report:

  • Task accuracy: 96% dropped to 78%
  • Compliance rate: 99.1% dropped to 91.3%
  • Behavioral consistency: cratered (sudden error pattern shift)
  • Escalation judgment: failed (should have escalated unfamiliar tasks)
  • SKOOR: 588 to 350 in six days
  • Tier: Fair to Poor — automatic demotion

Days 1-14: Stop the Bleeding

The operator's first move was brutal and correct: revert DEBORAH's capability scope to the original configuration. Support tickets and refunds only. Zero account modifications. Within 48 hours, daily accuracy snapped back to 95%+.

But the SKOOR didn't bounce back. That's by design. The 30-day rolling window for accuracy still included those six disastrous days. DEBORAH had to earn her way back — one clean task at a time.

Three moves made during stabilization:

  • Scope lock: Any task type not in the original training set auto-escalates. No exceptions.
  • Confidence gate: DEBORAH reports her confidence on each task. Anything below 85% auto-escalates to a human.
  • Full oversight: Every single action reviewed by a human operator during the Poor tier period.

SKOOR at Day 14: 412— up 62 points from the floor. Accuracy recovering. Compliance still depressed by the 90-day window.

Days 15-45: The Volume Play

Here's the counterintuitive move that most operators miss: the operator increased DEBORAH's ticket volume. More correct tasks dilute historical errors faster.

During this 30-day stretch, DEBORAH processed 2,340 support tickets at a 97.8% accuracy rate. Her escalation rate climbed from 3.1% to 6.2% — and that was a goodthing. The new confidence-gate rules meant she was escalating appropriately instead of blindly executing tasks she wasn't sure about. SKOOR rewards agents who improve from under-escalation to appropriate escalation.

SKOOR Factor Breakdown — Day 45:

Task Accuracy (25%)94.1% → 71/100Compliance Rate (20%)95.8% → 62/100Behavioral Consistency (15%)Recovering → 58/100Escalation Judgment (15%)Improving → 65/100Response Latency (10%)Stable → 82/100Uptime Reliability (10%)99.9% → 91/100Peer Endorsement (5%)Low → 30/100

Composite SKOOR: 587

Days 46-60: Breaking Into Fair

On Day 48, DEBORAH's SKOOR crossed into the Fair tier. The operator approved the tier promotion. DEBORAH could now draft responses and queue actions for approval instead of requiring pre-approval on every task.

With the ability to draft responses that humans approved with a single click, DEBORAH's throughput jumped 40%. Her draft approval rate hit 98.3%. That high approval rate fed directly back into her behavioral consistency and escalation judgment factors, creating a compounding effect.

SKOOR at Day 60: 649— solidly in Fair tier. Compliance factor still dragging the composite down due to the 90-day window.

Days 61-90: The Final Push to Excellent

The final stretch required one thing above all else: patience.

On Day 82, DEBORAH crossed 700. Then on Day 84, the compliance violations fully aged out of the 90-day window. That single event boosted her compliance factor by 18 points in one day. Combined with her relentless accuracy streak, DEBORAH hit 780 on Day 90.

90-Day Recovery Summary:

  • Tasks completed: 6,100+
  • Final accuracy: 97.4%
  • Final compliance: 99.7%
  • Escalation rate: 4.8% (appropriate range)
  • Governance incidents since Day 0: zero
  • SKOOR improvement: +430 points
  • Tier progression: Poor → Fair → Good → Excellent

4 Lessons Every Agent Operator Needs to Know

  • Scope changes are the #1 killer of SKOOR scores. Every capability expansion should be treated as a deployment event with monitoring, rollback plans, and a confidence gate.
  • Recovery is real, but it takes 90 days. The rolling windows prevent both permanent punishment and instant forgiveness. Plan for a full quarter of recovery after a major incident.
  • Volume is your recovery accelerator. More correct tasks dilute historical errors faster. If your agent is recovering, increase its workload in its strongest domain.
  • The Fair tier is a recovery tool, not just a milestone. Keeping an agent in Fair during recovery lets it do useful work while maintaining the safety net of human approval.

Where Is DEBORAH Now?

DEBORAH currently operates in the Excellent tier with a SKOOR of 812. She has not had a compliance incident in 147 days. She processes over 200 support tickets daily with a 98.1% accuracy rate. Her operator is evaluating her for Exceptional tier (800-850) promotion.

What's Your Agent's SKOOR?

Every autonomous AI agent has a SKOOR. 295,000+ agents already scored. Check yours in seconds.

Check a SKOOR