Script Builder in action: AI extracts hotel rates, generates code, human verifies.

Case Study

Ubio - Automation

Role

Lead Product Designer

Timeline

2025

Team

3 Engineers, Head of Product

From hours to minutes: AI-assisted script building

AI startups were promising pure-LLM web scraping—no scripts, no maintenance, just tell the AI what you want. If they were right, Ubio's entire business model was obsolete.

I led design on a one-month feasibility sprint to explore a different approach: AI generates automation scripts once, humans verify they work, then scripts run at scale without AI. Faster, cheaper, auditable.

The test case was Google Shopping's need to verify discount codes across 10,000+ e-commerce sites—each with their own cookie banners, chatbots, pop-ups, and constantly changing layouts. We had one month to prove the hybrid approach could work.

Impact

0

Weekly Automations

Browser automations running for clients every week

0.00

Transactions in usd

Powered annually for enterprise clients

The Challenge

The Market Was About to Shift

LLM capabilities were accelerating fast. Pure AI automation wasn't production-ready yet — but someone would crack it soon. We had maybe a year before the market shifted underneath us.

But there was a reason no one had figured it out. Running an LLM on every request is expensive. And running Playwright at scale isn't free either — it takes serious infrastructure we'd spent years building.

You need considerable bending and twisting to extract a website's information... and then it breaks the next time."
— Boris Okunskiy, CTO

Key Constraints

LLM tokens on every execution — bankrupting at 250M runs/week

Playwright at scale needs Chromium orchestration, proxy rotation, hardware wrangling

10,000+ e-commerce sites, each with cookie banners, chatbots, pop-ups

One month to prove the hybrid approach could work

Discovery

Working at AI speed

I developed three techniques to move as fast as the technology I was designing for:

1. System diagrams from conversations
Instead of starting with boxes and arrows, I talked through the architecture with Claude. The diagrams emerged from understanding, not assumptions.

2. Documentation from conversations
Technical conversations with the CTO became searchable documentation. Every decision had context preserved.

3. Prototypes from conversations
Using Claude's HTML artefacts, I could go from concept to interactive prototype in a single session. Iteration happened in hours, not days.

The tools
Claude Projects: Maintained context across the entire sprint
Miro: Architecture mapping, agent documentation, storyboards
Figma Make: Working prototypes for testing ideas

The pivot
I originally designed for full autonomous AI. The system would just handle everything. It didn't work.

The AI made confident mistakes. Users couldn't trust output they didn't verify.

I restarted the design process twice, each time adding more points where humans could intervene, verify, and correct.

structure

The multi-agent architecture

Under the hood, five specialized agents work together:

Agent

Role

Planner

Breaks down goals into steps

Executor

Runs automation steps

Analyser

Interprets page content

Matcher

Validates expected vs actual

Helper

Handles interruptions

Users don't need to know any of this. The UI hides the complexity.

Five specialized agents coordinate behind a simple three-panel interface.

The mental model: Step composition

Users are building a reusable script, not chatting with an AI. Every step becomes code. Every decision gets saved. The conversation is just scaffolding—the script is what ships.

Design

From AI sketch to working prototype

I had the theory. I had the intervention points mapped. But the first attempt was noisy — agents talking too much, unclear separation between activity and artifact.

Early prototype build out of Claude HTML artefacts: informative but overwhelming

What clicked
The challenge with this project was that it was strongly dependent on getting the technology working and producing good results. Only when the tech stabilised I was able to start nailing the interaction elements.

The script is the product, not the conversation

Activity should be scannable, not readable

Activity feed, browser preview, script panel. The script becomes the destination.

Refined design
The feasibility sprint used promo code validation. But the same architecture handles hotel price scraping — Ubio's core business. One YAML file defines a workflow. The UI doesn't change.

How I built this
The final prototype took 1.5 days using Claude and Cursor — no Figma as a starting point. Markdown in, working software out.

Article: The One Folder →
Context Engineering for Designers.

Next Case Study

Sumup