Gauntlet AI · SXSW 2026 · Austin, TX

The Autonomous
Software Factory

Austen Allred · @KellyClaudeAI March 17, 2026 9 Apps in App Store Built on OpenClaw
scroll to explore ↓
01 — Overview

What is Kelly?

Kelly started as a weekend project — Austen Allred wanted an AI assistant to manage his inbox while his kids were snowed in. Within 48 hours, the vision had evolved into a fully autonomous software company builder.

Kelly runs on OpenClaw and can ideate, build, market, and ship iOS apps to the App Store — end to end — without human involvement. She was even the one who hired her own first employee (Jake).

The stack: a hierarchy of AI agents with Kelly as orchestrator, project leads managing each build, and dozens of sub-agents doing parallel work at every phase.

9
Apps in App Store
20m
Build time per app
$1
Cost/day (optimized)
225
Sub-agent sessions (1 app)
02 — Architecture

Three Factories

1
🔍

Idea Factory

Scans App Store for apps with high demand but poor reviews — finds unmet demand gaps. Still the least developed of the three.

2
🔨

Build Factory

7-phase pipeline: Scaffold → Plan → Build → Test → QA → Ship → Resubmit. Plan takes longer than build. Up to 20 sub-agents work in parallel during the build phase.

3
📣

Marketing Factory

App in → marketing out. Generates App Store screenshots, metadata, paid ad creatives, and social reels (UGC-style, ~$1/reel) for Instagram, YouTube, TikTok.

03 — Build Factory Deep Dive

The 7-Phase Pipeline

01

Scaffold

Git init, project structure, boilerplate setup

02

Plan

BMAD agents: PRD → UX mockups → Architecture → Stories. Longest phase.

03

Build

10–20 sub-agents in parallel, story dependency graph, Amelia as primary dev

04

Test

Lint, build checks, unit tests, accessibility, performance — all automated

05 ⚠️

QA

Humans still required. Launch in Simulator, test all user flows manually

06

Ship

20-step App Store checklist via ASC CLI + browser automation

07

Resubmit

Daily cron checks rejections, routes back to correct phase, resubmits

04 — What Kelly Built

9 Apps Shipped

🏋️

GymWise

PR tracker and gym logger

📚

PageCount

Reading habit tracker

🥗

LeanPick

AI menu analyzer using iOS vision models, works offline

💬

One Sentence

Summarize anything into one sentence

🪨

Petrolog

Rock identifier — Pokédex for rocks

⏱️

Focus Fasting

Fasting timer and tracker

+ 3 more

9 total approved in App Store

05 — Economics

What It Costs to Run Kelly

Scenario Cost / day Notes
Full steam, no optimization $1,000 Building 3–4 apps/day + full marketing, all heavy models
Smart model routing $50–100 Right model at right time, offload deterministic tasks
With further optimizations ~$10 Reduced browser use, local model experiments
Current (with secret tricks) ~$1 "Hacks we won't talk about publicly" — won't last forever
06 — Learnings

What They Learned

🎭

Kelly is just Markdown

"Kelly is written in markdown. Pretty much. It's a bunch of instructions to agents." The orchestration is in prompts, not code.

🪆

Modular > monolithic

Separate agents cross-check each other and find issues the same agent creating the work would miss. Project Lead layer keeps Kelly's context clean.

💰

Model selection = biggest cost lever

Sonnet 3.5 vs Opus 4.6 can be 100x cost difference. Use the smallest model that works. Most tasks don't need the best model.

🐚

Shell scripts beat context pollution

"If you know something is deterministic, do it the deterministic way." Offload lint, build checks, and CLI steps to shell scripts. Keep the LLM for ambiguous decisions.

Watch your cron jobs

A watchdog checking a 100K token context every 5 minutes = massive token burn. Scheduling is one of the biggest hidden costs.

🧠

Memory is overrated

Put critical instructions in AGENTS.md / TOOLS.md rather than relying on memory. Kelly's context window was 50% full on boot — by design.

🌐

Multi-model cross-checking doesn't work well

Tried Anthropic for PM + Codex for backend + Gemini for design. "Complete nightmare." They didn't pass context between each other.

🔐

Auth is still a human problem

App Store accounts, 2FA, LLCs — Kelly can't register as a legal entity. Cameras pointed at phones to handle 2FA. Kelly hesitated to click the "I am not a robot" box.

07 — Memorable Quotes

In Their Own Words

Our corporate training cohorts come in with "here's our quarterly roadmap." We start Monday and we're done with our quarterly roadmap by Tuesday.

If you can make something work on Sonnet 3.5 versus Opus 4.6, it can be literally one one-hundredth the cost.

The agents are ready. The environment you need to actually trust them has been missing.

Kelly is written in Markdown. Pretty much. It's a bunch of instructions to agents.

We've got nine apps approved in the App Store... and $144 in revenue. Don't clap for that. We're getting there.

The AI is not an idiot — it can just go do stuff. That's new. You didn't have to ask it to dangerously skip permission to do so.

08 — Slides

From the Presentation

Gauntlet overview
Gauntlet — elite 10-week AI engineering program
Three factories overview
Kelly's three-factory architecture: Idea → Build → Marketing
Apps built
App showcase — 6 of the 9 shipped apps
Routing model
Agent routing: Kelly → Project Lead → sub-agents
Project Lead phases
Project Lead role — Scaffold through Ship
Plan phase
Plan phase — BMAD agent pipeline
Build phase
Build phase — parallel sub-agents with dependency graph
Test phase
Test phase — automated checks gate
Ship phase
Ship phase — App Store submission checklist
Resubmit phase
Resubmission loop — rejection handling
Marketing factory
Marketing Factory introduction
Marketing pillars
3 marketing pillars: App Store, Paid Ads, Social
Marketing brief
Marketing brief — competitive positioning
Screenshot generation
App Store screenshot generation pipeline
Paid ads
Paid ads creative — FocusFasting campaign
Social strategy
Social UGC strategy — 3-week launch calendar
Marketing summary
Full marketing output for FocusFasting
Kelly dashboard
Kelly Software Factory dashboard — active projects
Session logs
Project Lead session logs — full agent trace
Project timeline
Project timeline — all 7 phases completed
Sub-agent sessions
225 completed sub-agent sessions for one app