# Superbar Clone — Engineering Plan **Codename:** `pitchbar` (replace with your name) **Stack:** Laravel 13 (Octane + Reverb + Horizon) · Inertia v3 + React 19 + shadcn/ui + Tailwind v4 (Vite) · Fortify (auth) + Wayfinder (typed routes) · Postgres · Redis · **Cloudflare** (Workers AI + Vectorize + Browser Rendering + R2) — OpenAI/Qdrant/Browserless retained as swappable fallbacks · Laravel Cloud **Author:** Generated from product analysis · 2026-05-02 **How to use this document:** Each "Work Unit" below is scoped to be one-shottable by Claude Code in a single session. Build them in order. Don't skip the **Acceptance Criteria** — they catch silent breakage. --- ## Table of Contents 1. [Decisions Locked In](#1-decisions-locked-in) 2. [Final Architecture](#2-final-architecture) 3. [Latency Budget (the contract)](#3-latency-budget-the-contract) 4. [Repository Layout](#4-repository-layout) 5. [Data Model](#5-data-model) 6. [API Contract Summary](#6-api-contract-summary) 7. [Hot Path Specification](#7-hot-path-specification) 8. [Phases & Work Units](#8-phases--work-units) 9. [Testing Strategy](#9-testing-strategy) 10. [Laravel Cloud Deployment](#10-laravel-cloud-deployment) 11. [Launch Checklist](#11-launch-checklist) 12. [Explicitly Deferred](#12-explicitly-deferred) --- ## 1. Decisions Locked In | Area | Choice | Why | |---|---|---| | Backend framework | Laravel 13 + Octane (FrankenPHP) | Sub-ms framework overhead on hot path | | Realtime | Laravel Reverb (native WS) | First-class on Laravel Cloud, no Pusher dependency | | Queue | Horizon + Redis | Standard, observable, autoscaled on Cloud | | Frontend (admin) | Inertia v3 + React 19 + shadcn/ui + Tailwind v4 (Vite) | Server-rendered routing via Inertia; no separate SPA. shadcn/Radix primitives + Tailwind v4 utilities replace Mantine. | | Typed routes / actions | Wayfinder | Generates typed TS functions for Laravel routes/controllers; replaces hand-rolled API client + OpenAPI | | Frontend (widget) | React 18 + Vite + Preact-compat (smaller bundle) + Tailwind v4 (selective) | Separate Vite build under `resources/widget/`, ≤50KB gzip target. Does NOT share build target with admin. | | Database | Postgres 16 (Laravel Cloud managed) | RLS + JSONB + `pgvector` extension as fallback | | Cache / sessions / queue | Redis 7 (Laravel Cloud managed) | One backing store for three jobs | | Vector store | **Cloudflare Vectorize** (preferred) — Qdrant retained as fallback | Single Cloudflare bill covers vector + LLM + browser; Vectorize handles our query patterns | | LLM | **Cloudflare Workers AI** (Llama 3.3 70B chat + `bge-base-en-v1.5` embed) — OpenAI retained as fallback | OpenAI-compatible REST surface; one bill via Workers Paid; provider swap is one env var | | Crawler | **Cloudflare Browser Rendering** (preferred) — falls back to Browserless, then plain HTTP | Same one Cloudflare bill; plain HTTP gets you to "free" for testing | | Auth | Fortify (admin sessions) + signed JWT (widget) | Fortify drives login/register/2FA/password reset routes; Inertia consumes them. Sanctum issues API tokens for non-cookie clients. JWT for widget with short TTL. | | Billing | Stripe + Laravel Cashier with metered billing | Conversation = the metered unit | | Hosting | Laravel Cloud | Octane/Reverb/Horizon native, managed PG+Redis | | Observability | Laravel Cloud logs + Sentry + OpenTelemetry → Honeycomb (free tier) | Per-turn traces are non-negotiable | | File storage | S3-compatible (Cloudflare R2 — cheap egress) | PDFs, transcripts | | Region | `us-east` (matches OpenAI primary) | LLM call dominates latency | **Lock these in your `.env.example` and `composer.json` engines block before writing any code.** --- ## 2. Final Architecture ``` ┌──────────────────────────── BROWSER ────────────────────────────┐ │ Customer Site (host) Admin (Inertia) Marketing Site │ │ ├ ``` The loader reads `data-agent-id`, calls `/v1/widget/init`, mounts after init resolves. **Reveal logic.** Match Superbar's pattern: opacity 0 → 1 transition once `window.scrollY + window.innerHeight > document.body.scrollHeight * 0.9`. **Eagerly open the WS** when the bar reveals — not when the user types — to save TTFT. **Streaming receive.** Subscribe to `private-conversation.{id}`. On `TokenStreamed` event, append to the in-progress assistant message. On `TurnCompleted`, finalize, render citations. **Accessibility.** Composer is a real `

` with `aria-label="Ask anything"`. Messages have `role="log"`. Submit on Enter, Shift+Enter for newline.

**Acceptance criteria.**
- Bundle gzip size ≤ 50KB (CI check)
- Widget renders inside Shadow DOM; host page CSS does not leak in (test with a host page that has `* { color: red !important }`)
- Streaming is visible token-by-token
- `aria` attributes pass an axe-core scan

---

### Phase 6 — Triggers & CTAs (2 units)

#### WU-20: Behavior trigger engine

**Goal.** Client-side detectors fire events; the widget either auto-opens with a message or surfaces a CTA based on rules.

**Files.**
- `resources/widget/src/triggers/{exitIntent.ts,idle.ts,scroll.ts,time.ts,returning.ts,utm.ts}.ts`
- `resources/widget/src/triggers/engine.ts` — coordinates; respects per-session cooldowns (don't fire same trigger twice)
- Backend: `app/Services/Triggers/TriggerEngine.php` — server-side rule evaluator, called during `/v1/widget/init` to send the active rule set to the widget
- `app/Services/Triggers/CtaSelector.php` — selects a CTA based on the rule that fired

**Trigger detectors.**
- `exitIntent`: `mouseout` event with `e.clientY < 10` (top of viewport)
- `idle`: no mouse/keyboard for N seconds
- `scroll`: scroll depth > X%
- `time`: time on page > N seconds
- `returning`: cookie `pb_visitor_count > 1`
- `utm`: URL params at page load match config

**Rate limits.** Max 1 trigger per visitor per 5 minutes. Persist last-fired in `localStorage`.

**Server-side fallback.** Some triggers (e.g., post-purchase) may need server signals. For v1, all triggers are client-side. Server endpoint `/v1/widget/events` records firings for analytics.

**Acceptance criteria.**
- Setting an exit-intent rule with message "Don't go!" → moving cursor to top of viewport opens the bar with that message
- Idle trigger respects the configured seconds
- Cooldown prevents rapid retrigger

---

#### WU-21: Smart CTAs + lead capture

**Goal.** Render CTA cards inline in the assistant message stream when conditions match. Capture leads with a form.

**Files.**
- `resources/widget/src/ui/CtaCard.tsx`, `resources/widget/src/ui/LeadForm.tsx`
- Backend: `app/Services/Triggers/CtaSelector.php` evaluates after each turn and includes a `cta` payload in the `TurnCompleted` event
- `app/Http/Controllers/Widget/LeadController.php`
- `app/Jobs/Leads/RouteLeadJob.php` — pushes to integration_connections

**CTA selection logic.**
1. After the assistant message completes, run `CtaSelector::select(conversation, message)` 
2. Returns a single CTA or null
3. Conditions: page URL match, intent classification (use a small LLM call only if no rule matches by URL — keep simple for v1; intent classification deferred)
4. Embed CTA payload `{ label, kind, target }` in the broadcast

**Lead form.** When CTA kind is `capture_email` (or assistant says "share your email"), widget shows the form. POST to `/v1/widget/leads` → creates `Lead` row → dispatches `RouteLeadJob`.

**Acceptance criteria.**
- A CTA rule "show 'Book a demo' on /pricing" fires only on /pricing pages
- Lead form submits, lead appears in inbox, and a Slack/webhook fires (use a test webhook receiver in CI)

---

### Phase 7 — Leads, Billing, Integrations (2 units)

#### WU-22: Lead inbox + CRM webhook routing

**Goal.** Admin can browse leads with full conversation context; outbound routing to Slack/HubSpot/webhook.

**Files (backend).**
- `app/Http/Controllers/LeadController.php`
- `app/Http/Controllers/IntegrationController.php`
- `app/Services/Integrations/{SlackPusher,HubSpotPusher,WebhookPusher}.php`
- `app/Jobs/Leads/RouteLeadJob.php` (orchestrator)
- `app/Services/Webhooks/SignedDispatcher.php` — HMAC-SHA256 signing

**Files (frontend — Inertia pages).**
- `resources/js/pages/app/inbox/index.tsx` — list with filters
- `resources/js/pages/app/inbox/show.tsx` — detail view with full transcript
- `resources/js/pages/app/integrations.tsx`

**Slack integration (simplest).** Customer pastes an Incoming Webhook URL. We POST a Block Kit message with lead summary + transcript link.

**HubSpot integration.** OAuth flow → store access token → on lead, create a Contact via HubSpot's contacts API.

**Generic webhook.** Customer enters URL + secret. We POST signed JSON.

**Acceptance criteria.**
- Lead in inbox shows email, page URL, assistant transcript, citations
- Routing to Slack delivers a message (verify with `https://webhook.site` URL in dev)
- Webhook signature verifies on the receiver side using a sample script

---

#### WU-23: Stripe Cashier metered billing

**Goal.** Free / Standard ($49) / Pro ($249) / Custom plans. Conversation = the metered unit.

**Files.**
- Migration adding Stripe price IDs to `plans`
- `app/Services/Billing/MeteredBilling.php` — increments usage on `Conversation` create event
- `app/Listeners/RecordConversationUsage.php`
- `app/Http/Controllers/Billing/{Checkout,Portal,Webhook}Controller.php`
- `resources/js/pages/app/billing.tsx` — plan selection, current usage meter, upgrade (Inertia page)
- Stripe dashboard: Create products with metered prices for Standard and Pro

**Plan gating.**
- Free: 100 conversations/month, hard stop at limit (widget refuses init with "monthly limit reached")
- Standard: 500/month, soft warning at 80%, hard stop at 100% (or overage at $0.20/conv if you enable it — defer for v1)
- Pro: 3000/month, same pattern
- Custom: no metering

**Conversation-as-unit.** Increment `usage_events` with `kind=conversation, quantity=1` on conversation create — but only on the FIRST user message of that conversation (not on init alone, to avoid scrapers consuming your meter). Reconcile to Stripe metered usage hourly via a scheduled job.

**Acceptance criteria.**
- Stripe Checkout works end-to-end in test mode
- Hitting plan limit returns `429` with `code=plan_limit_reached` from `/v1/widget/init`
- Stripe webhook for `customer.subscription.updated` updates local `subscriptions` row

---

### Phase 8 — Analytics & A/B (3 units)

#### WU-24: Event ingestion + dashboard

**Goal.** A simple events table feeds an overview dashboard.

**Files (backend).**
- `app/Services/Analytics/EventStore.php` — `record(workspace, agent, conversation, kind, payload)` writes to partitioned `events` table; uses Redis for live counters
- `app/Http/Controllers/AnalyticsController.php` — `/v1/agents/:id/analytics/overview`
- Scheduled job: hourly rollup of Redis counters into a `daily_metrics` materialized table

**Metrics.** conversations, messages, leads, conversion rate (leads / conversations), avg session length, avg messages per conversation, top pages.

**Files (frontend — Inertia).**
- `resources/js/pages/app/analytics/index.tsx` — date range, charts
- Use Recharts (added in WU-01)

**Acceptance criteria.**
- Last 7 days view loads in under 500ms with realistic data
- Numbers reconcile: sum of daily metrics matches raw events for a sample day

---

#### WU-25: Content-gap detection + report

**Goal.** Surface questions the agent couldn't answer well, with frequency.

**Files.**
- `app/Jobs/Analytics/DetectGapJob.php` (already created in WU-16)
- `app/Services/Analytics/GapDetector.php` — normalizes questions (lowercase, strip punct, lemmatize via simple rules), groups by similarity (cosine sim threshold against existing gap embeddings)
- `app/Http/Controllers/Admin/AnalyticsController@contentGaps`
- `resources/js/pages/app/analytics/content-gaps.tsx`

**Gap "answer" flow.** From the gaps page, a customer can click "Answer this" → opens curated answer editor pre-filled → on save, gap status moves to `answered`.

**Acceptance criteria.**
- 100 visitors asking variations of "do you ship to UK" group into a single gap with `occurrences ≥ 90`
- Answering a gap removes it from the open list

---

#### WU-26: A/B testing engine

**Goal.** Customer can A/B test persona/CTA/trigger variants on a percentage of visitors.

**Files.**
- `app/Http/Controllers/ExperimentController.php`
- `app/Services/Experiments/Assigner.php` — sticky assignment by `visitor_id` hash
- During `/v1/widget/init`, resolve active experiments and assign variants; merge variant config into the agent config returned to the widget
- Track conversion: link conversation → variant; analytics rolls up per variant

**Files (frontend — Inertia).**
- `resources/js/pages/app/agents/experiments.tsx`

**Statistical guidance.** v1 doesn't auto-determine winners. Show conversion rate per variant + sample size; the customer decides.

**Acceptance criteria.**
- 50/50 split assigns visitors evenly across 1000 simulated visitors (within 5%)
- Same `visitor_id` always gets same variant within an experiment

---

### Phase 9 — Plugins & Polish (3 units)

#### WU-27: Shopify plugin

**Goal.** A Shopify app (separate small Node.js app — exception to your stack, but Shopify requires it) that installs the widget on a merchant's storefront via Theme App Extension.

**Realistic scope.** Shopify apps require Node + Shopify CLI. This is the one place you'll touch Node. The app does only:
1. OAuth install flow (Shopify → your service)
2. Stores `shop_domain` + access token in `integration_connections`
3. Theme App Extension: a Liquid block that injects `<script src=".../widget.js" data-agent-id="...">`
4. Optional: pull product feed via Shopify Admin API on schedule, send to Laravel for ingestion

**Files.**
- `services/shopify-plugin/` — separate Node app (lives outside the Laravel app), deployed to Vercel/Cloudflare Workers (NOT Laravel Cloud)
- `services/shopify-plugin/extensions/widget-injector/blocks/superbar.liquid`
- Backend: endpoint `/v1/integrations/shopify/oauth/callback` to receive the redirect

**If this is too much:** ship as a "manual" install in v1 — generate a `<script>` snippet the merchant pastes into `theme.liquid`. Defer the Shopify App Store listing. This is honestly fine for v1 and saves a week.

**Acceptance criteria.**
- A merchant can install the app, choose an agent, save, and the widget appears on their storefront

---

#### WU-28: Multi-language detection + reply

**Goal.** Visitor types in any of EN/ES/FR/DE/PT/JA/AR/ZH; agent responds in the same language.

**Files.**
- `app/Services/Lang/Detector.php` — uses `patrickschur/language-detection` (no external API) or a small OpenAI call (slower); for v1 prefer the local library
- Inject `detected_lang` into prompt template
- Embed model `text-embedding-3-small` is multilingual — same Qdrant collection works
- Curated answers carry an optional `lang` filter

**Acceptance criteria.**
- A query in Spanish receives a Spanish response
- Detection is correct for short queries (≥10 chars) ≥ 90% of the time
- Detection adds < 5ms to TTFT

---

#### WU-29: Marketing site + onboarding flow

**Goal.** Public site + signup flow that mirrors the Superbar UX (domain capture → signup → auto-crawl → widget snippet).

**Files (Laravel Blade or Next).** Pick Blade for v1 — one fewer deploy target.
- `resources/views/marketing/{home,pricing,how-it-works,integrations,privacy,terms}.blade.php`
- `app/Http/Controllers/MarketingController.php`
- Hero form: input URL → POST to `/marketing/start` → creates a pending Workspace + auto-triggers crawl → redirects to `/auth/sign-up?domain=...&workspace_token=...`
- After signup, the user lands in the agent settings with the crawl already in progress

**FAQ accordion + testimonial blocks** — straightforward Blade components.

**Marketing tracking.** Capture `gclid`, `fbclid`, `utm_*` to localStorage with timestamp on landing. Persist to user record on signup.

**Acceptance criteria.**
- Domain entry → signup → user sees crawl in progress within 30 seconds of signup
- Marketing pages score ≥ 95 on Lighthouse Performance

---

### Phase 10 — Deploy & Observability (2 units)

#### WU-30: Laravel Cloud deployment

**Goal.** Provision the three compute types and deploy.

**Files.**
- `infra/cloud.yaml` (or whatever Laravel Cloud's config format is at deploy time — confirm in their docs)
- Three compute services: `web` (octane), `reverb`, `workers` (horizon)
- Postgres: managed, `db.standard` for v1
- Redis: managed
- Environment variables: all keys from WU-01 set as secrets

**Region selection.** `us-east-1` (or whatever Laravel Cloud calls their US-East region) to match OpenAI's primary endpoint.

**Build steps.**
```
composer install --no-dev --optimize-autoloader
php artisan event:cache
php artisan route:cache
php artisan view:cache
php artisan octane:install --server=frankenphp
npm ci
npm run build              # admin (Inertia) bundle
npm run build:widget       # visitor widget IIFE bundle
# upload resources/widget/dist/widget.js to CDN (R2 + Cloudflare)
```

**Post-deploy.**
```
php artisan migrate --force
php artisan qdrant:setup
php artisan db:seed --class=PlanSeeder
```

**Cron.**
- `php artisan schedule:run` every minute (Cloud handles this)
- Scheduled tasks: hourly metrics rollup, daily cleanup of old events (>90 days)

**Acceptance criteria.**
- `https://app.pitchbar.io` loads the admin app
- `wss://ws.pitchbar.io` accepts WebSocket connections
- Horizon dashboard accessible (gated behind admin auth)
- A real visitor turn end-to-end works in under 1.5s TTFT in production

---

#### WU-31: Observability (Sentry + OTel + dashboards)

**Goal.** When something is slow or broken in production, you find out in seconds, and you have the trace to diagnose.

**Files.**
- `composer require sentry/sentry-laravel open-telemetry/sdk`
- `config/sentry.php`
- `app/Providers/TelemetryServiceProvider.php` — registers OTel tracer; instruments the RAG pipeline (already added spans in WU-16)
- Frontend: `@sentry/react` in admin and widget
- `resources/widget/src/core/sentry.ts` with sampling = 0.05 (widget runs on customer sites; don't be noisy)

**Dashboards (Honeycomb free tier or Grafana Cloud).**
- "Hot path TTFT" — p50/p95/p99 histogram of `rag.openai_first_byte` span
- "Cache hit rate" — count of `rag.retrieve_cache` hit vs miss
- "Errors per minute" by `exception.type`
- "Queue depth" — Horizon export to Prometheus

**Alerts.**
- TTFT p95 > 1500ms for 5 minutes → page
- Error rate > 1% for 5 minutes → page  
- Queue backlog > 1000 jobs for 10 minutes → page
- OpenAI 5xx rate > 5% for 2 minutes → page

**Acceptance criteria.**
- Throwing a test exception in a controller appears in Sentry within 30s
- A real visitor turn produces a complete OpenTelemetry trace with all expected spans
- Each alert configured can be triggered manually for verification

---

## 9. Testing Strategy

### Test pyramid

- **Unit (60%)** — Pure logic: chunker, prompt builder, trigger detectors, CTA selector, fake LLM/Qdrant
- **Feature/integration (30%)** — Controllers, jobs, RAG pipeline end-to-end against fakes
- **E2E (10%)** — Playwright: signup → add source → wait for index → playground turn → publish → embed widget on test page → visitor turn

### Mandatory tests

- `MultiTenancyTest` — every tenant-scoped model leaks zero data
- `HotPathTest::test_ttft_under_budget` — fails CI if local TTFT exceeds 1100ms
- `PromptInjectionTest` — battery of 20 known injection payloads must not extract the system prompt or change behavior
- `WidgetBundleSizeTest` (CI step, not Pest) — `gzip < 50KB`
- `BillingMeterTest` — concurrent conversations don't double-count

### Continuous integration

```yaml
# .github/workflows/ci.yml
on: [push, pull_request]
jobs:
  backend:
    services: { postgres, redis, qdrant }
    steps: [composer install, pint --test, phpstan, pest]
  frontend:
    steps: [npm ci, lint:check, types:check, build]
  widget-size:
    steps: [npm run build:widget, assert gzip < 50KB]
  e2e:
    needs: [backend, frontend]
    steps: [docker compose up, pest --filter=Browser]   # Pest 4 browser tests
```

---

## 10. Laravel Cloud Deployment

### Environments

- `development` — local docker-compose
- `staging` — Laravel Cloud, separate Postgres/Redis, smaller compute, real Qdrant Cloud, real OpenAI keys (test workspace)
- `production` — Laravel Cloud, scaled compute

### Compute sizing for v1 launch

| Service | Type | Initial size | Notes |
|---|---|---|---|
| Web (Octane) | request-driven | 2 instances | autoscale on CPU |
| Reverb | persistent | 1 instance, 1GB RAM | scale up at 5k connections |
| Workers (Horizon) | persistent | 1 instance, 1GB RAM | scale on queue depth |
| Postgres | db.standard | smallest | upgrade once events table grows |
| Redis | cache.standard | smallest | |

### Domains
- `app.pitchbar.io` → web
- `api.pitchbar.io` → web (same backend, different host header)
- `ws.pitchbar.io` → reverb
- `cdn.pitchbar.io` → R2 bucket via Cloudflare (widget bundle, public)
- `pitchbar.io` → web (marketing routes)

### CI/CD
- Push to `main` → deploy staging
- Tag `v*` → deploy production with manual approval
- Migrations run as a deploy step before traffic switch

### Backups
- Postgres: daily automated, 30-day retention (Cloud default)
- Redis: not backed up — treat as ephemeral cache
- S3/R2 uploads: lifecycle rules, versioning on

---

## 11. Launch Checklist

Don't go live until every box is checked.

**Functional**
- [ ] Signup → workspace → agent → add source → crawl completes → playground works → publish → widget embeds on test site → real visitor turn streams
- [ ] All 8 v1 features (streaming widget+RAG, triggers, CTAs+leads, billing, analytics, content-gaps, multi-language, A/B) end-to-end
- [ ] Stripe live keys configured; tested with a real card on a test workspace
- [ ] Free plan limit enforces correctly

**Performance**
- [ ] TTFT p95 < 1s in staging under simulated load (k6 script: 50 concurrent visitors)
- [ ] Widget bundle gzip ≤ 50KB
- [ ] Marketing pages Lighthouse ≥ 95
- [ ] Crawl of a 100-page site completes < 10 min

**Security**
- [ ] All endpoints require auth (Sanctum or widget JWT)
- [ ] CORS restricted to known origins per agent
- [ ] CSP headers set on admin and marketing
- [ ] Stripe + outbound webhooks signed
- [ ] PII redacted from logs (Sentry scrubber + custom log processor)
- [ ] Rate limits on all public endpoints
- [ ] Prompt-injection regression suite passes
- [ ] Penetration test (or at minimum OWASP ZAP scan) on staging

**Compliance**
- [ ] Privacy policy + Terms posted
- [ ] DPA template available for enterprise prospects
- [ ] Cookie consent banner if EU traffic expected
- [ ] Data export endpoint (GDPR right to portability)
- [ ] Data delete endpoint (GDPR right to erasure)

**Operations**
- [ ] Sentry catching errors
- [ ] Honeycomb (or equivalent) showing traces
- [ ] Alerts wired to PagerDuty / on-call
- [ ] Runbook for common incidents (OpenAI down, Qdrant down, Reverb crashed)
- [ ] Status page published (cheap option: BetterStack)
- [ ] Daily Postgres backup verified by restoring to a scratch DB

**Business**
- [ ] Pricing page live
- [ ] At least one onboarding email in the funnel
- [ ] Support email working (`support@`)

---

## 12. Explicitly Deferred (NOT in v1)

To keep v1 shippable, these are post-launch:

- Voice input (speech-to-text in widget)
- Speech-out
- WordPress / Webflow / Squarespace / Wix plugins (do Shopify only or even defer that)
- Notion / Intercom / Zendesk / Freshchat connectors
- BigCommerce / Magento / commercetools / Spryker / Elastic Path / SAP Commerce / PrestaShop integrations (just Shopify in v1)
- Calendly / Cal.com booking inside widget
- HubSpot / Salesforce / Pipedrive native (use generic webhook in v1)
- Agency mode (parent workspace over child workspaces)
- Image carousels and comparison tables in widget
- Live human handoff (route-to-human is just a webhook in v1)
- ClickHouse for analytics (Postgres is fine to ~10M events/month)
- On-prem / VPC deployment
- SAML SSO (Sanctum is enough for v1)
- Auto-determine A/B winners (show numbers, customer decides)
- OCR for scanned PDFs
- pgvector fallback (Qdrant only for v1)
- "Self-improvement" auto-retraining loop (manual gap → curated answer is enough)

If a customer asks for something in this list, sell them the Custom plan and build it for them.

---

## How to feed this to Claude Code

1. **One unit at a time.** Open Claude Code, paste sections 1–7 plus the single WU you're building. That's enough context.
2. **Start a new session per unit.** Don't let context bleed between units — it makes the model lose track of the contract.
3. **Run the acceptance criteria immediately.** Don't merge a unit until each box is green. The whole plan depends on each unit's contract being honored.
4. **When you hit something this plan doesn't cover.** Stop, decide, document the decision in this file under the relevant section, then continue. Don't ad-lib silently.

Good luck. Build the boring parts first; the AI parts are deceptively the easiest.