Software Definition · Canonical Reference

Architecture
& Design

The single source of truth for how Field Engineer is built: the modular split that isolates each tool, the shared core they all depend on, the backend contracts, and the data models. For developers and architects.

Supersedes the architecture fragments in the Technical Reference & Stage 2 spec

What changed. This document adopts a modular split: each tool becomes its own independently deployable front end on a shared core library, rather than the single-file, multi-mode app described in earlier drafts. The goal is fault isolation — a failure in one tool cannot take down another. This resolves the open decision in Master Plan §14 ("modularize before Stage 3?"). The data models, recommendation logic, credential model, and backend contracts below are unchanged in behavior — only how the code is packaged and deployed changes.

How to read this document
The split architecture
Module boundaries & fault isolation
The shared core library
The backend (stateless relay)
Backend action contracts
Data models
Recommendation engine (Sales)
Credential model & gate evaluation (Repair)
Safety enforcement in app logic
Security & privacy
Storage & persistence
Cross-tool integration (planned)
Migration path & open decisions

01How to read this document

This is the canonical reference for architecture, data models, and backend contracts. Where the Technical Reference, the Stage 2 spec, or the User Guide describe the same model, they should point here — keeping one source prevents the drift the Master Plan warns about.

Status is tagged throughout so readers always know what exists versus what is designed: Built working & deployed · First pass built but intentionally shallow · Planned designed, not built · Future later stage.

02The split architecture

Field Engineer is a set of independently deployable tool apps, each consuming one versioned shared-core library, all talking to a single stateless backend. The customer-facing app is a separate build that never loads internal logic. Nothing in one tool can crash another.

The three layers

Tool apps — Sales/Survey, Pricing, Tech/Repair, Inventory. Each is its own deployable bundle with its own storage keys. A runtime error in one is contained to that app.
Shared core library — the design system, storage abstraction, backend client, and AI nameplate read, packaged once and versioned. This is the answer to the duplicate-code trap: fix a nameplate bug once, not four times.
Stateless backend — one Vercel serverless function (/api/lookup) that holds API keys and relays to external services. Because it stores no per-tool state, one relay safely serves every app; isolation lives at the front end, not here.

03Module boundaries & fault isolation

The contract between modules is deliberately narrow: a tool app may depend only on the shared core and on backend actions. Tool apps never import each other. That single rule is what guarantees the blast radius of any failure stays inside one tool.

Tool app	Owns	Depends on	Status
Sales / Survey	Survey flow, recommendation engine, rebate request UI	Core; backend `property`, `utility`, `nameplate`, `rebates`, `parsebill`, `rates`, `estimate`	First pass
Pricing	Price book, markup rules, owner/tech view toggle	Core; backend `parseprices`	First pass
Tech / Repair	Diagnostic loop, credential gating & hard-stop logic, job record	Core; backend `diagnose`, `nameplate`	First pass
Inventory	Locations, parts, per-location quantities, search	Core (local only in first pass)	First pass
Customer App	Public service request	Core (UI only); backend public actions	Planned

Safety lives in Repair, not core. The credential gates and hard-stop logic (§09–§10) reside inside the Tech/Repair app, not in the shared core. This is deliberate: it means a Pricing or Sales change can never regress a CO or cracked-heat-exchanger hard stop, because that code is in a separately deployed bundle that those changes don't touch.

Customer-app boundary (unchanged, Master Plan §09). The customer app is a separate build that talks to the system only through the backend. A customer's browser must never load internal cost, margin, inventory, or credential logic. The split makes this boundary structural rather than a convention.

04The shared core library

Everything every tool needs, packaged once and versioned. Tool apps pin a core version; core ships changes deliberately so an upgrade is a conscious act per app, not an accidental break.

Core module	Responsibility	Source today
Design system	Fonts (Archivo / IBM Plex), palette, component styles, the app shell chrome	Shared across all four current docs
Storage abstraction	Platform store when present (Claude preview), `localStorage` fallback on the hosted site; per-key get/set	Tech Ref §04 (resolved)
Backend client	Builds the `action` request, attaches the optional app token, posts to `/api/lookup`, returns clean JSON or a typed error	Tech Ref §01–02
Nameplate read	Captures photos, downscales to thumbnail, sends full-res once for the AI read, returns structured fields + confidence	Tech Ref §03, §08 (shared by Sales & Repair)

core/ design-system/ // tokens.css, shell, components storage.js // get(key) / set(key,val) — platform | localStorage backend.js // call(action, payload) -> Promise<json> nameplate.js // read(photos) -> { brand, model, serial, type, ageEst, confidence } version: "1.x" // pinned by each tool app

05The backend (stateless relay)

One Vercel serverless function, one endpoint, routed on an action field. It is stateless — no database, stores nothing. It exists purely to hold the keys and relay calls, which is why a single shared backend does not undermine front-end isolation.

Endpoint: POST /api/lookup, dispatching on action.
Actions: property, utility, nameplate, rebates, parsebill, rates, estimate, diagnose, parseprices, ping (health check for the Test button).
Holds: RentCast & Anthropic keys as Vercel environment variables; never in any app or repo.
Stores: nothing. Lookup payloads pass through to the third party for that call only.

Future option, not now. If a single function ever becomes a contention point, actions can be split into per-domain functions (e.g. /api/diagnose, /api/price) without any app change beyond the core backend client's routing. The stateless design makes that a later, low-risk move — full backend microservices are explicitly out of scope at this stage.

06Backend action contracts

The canonical request/response for each action. Earlier docs show fragments of these; this is the reference.

`property` Built

{ action:"property", address:"1420 Oak St, Brighton, CO" } // backend → RentCast GET /v1/properties?address= // returns public county / assessor records; tech can override { sqft:1840, yearBuilt:2006, stories:1 } // utility provider is NOT supplied by RentCast — resolved by the `utility` action below

`utility` Planned

{ action:"utility", address:"1420 Oak St, Brighton, CO 80601" } // backend → Anthropic API with web search enabled (same pattern as `rebates`) // prompt asks for the DISTRIBUTION utility (whose efficiency rebates apply), // not the retail energy supplier the customer may have chosen. { electric:{ name:"Xcel Energy (Public Service Co. of Colorado)", confidence:"high", sourceUrl:"…", alternatives:["United Power","IREA","Black Hills Energy"] }, gas:{ name:"Xcel Energy", confidence:"high", sourceUrl:"…", alternatives:["Atmos Energy","Colorado Natural Gas"] } } // alternatives = short curated list of other plausible providers for the area; // either name may be null with confidence:"low" if not found.

Mandatory confirmation gate (app-enforced). The utility result is a suggestion, never an authoritative value. When it returns, the Sales app pre-fills the electric and gas fields and opens a blocking confirmation window — "We found these utilities for this address. Is this correct?" — showing each provider, its confidence badge, and a source link. The salesperson must Confirm before the Customer & Property step can advance; the Next control stays disabled until then. Edit is a pick-from-list, not free text: tapping edit shows the returned alternatives for that area plus an "Other…" entry, so the chosen provider stays on a known, normalized name (clean data for rebates and the savings profile). What was detected, what was finally chosen, and who confirmed it are stamped to the survey. Low-confidence or null results still require an explicit confirm — the gate never auto-passes. Confirming first sharpens the downstream rebates and Savings Profile (§08), both of which read the utility and its rate.

`nameplate` Built

{ action:"nameplate", images:[ {base64, mediaType} ] } // backend → Anthropic vision { brand, model, serial, type, tonnage, ageEstimate:{ years:10, confidence:"high", flagged:true } } // age is best-effort, brand-specific, flagged for verification

`rebates` Built

{ action:"rebates", address, utility, equipmentTypes:[…] } // backend → Anthropic API with web search enabled { programs:[ { name, level:"federal|state|utility", amount, sourceUrl } ] } // live snapshot; amounts always verified by the tech

`parsebill` Planned

{ action:"parsebill", images:[ {base64, mediaType} ] } // 1+ utility bills // backend → Anthropic vision (same pattern as `nameplate` / `parseprices`) // extract USAGE & COST ONLY — never the account number, name, or address. { bills:[ { fuel:"electric", periodMonth:"2026-01", usage:1180, unit:"kWh", cost:154.20 }, { fuel:"propane", periodMonth:"2026-01", usage:92, unit:"gal", cost:285.20 } ] }

Optional, never required — and privacy-bounded (Safety condition). Bill upload is an opt-in accuracy upgrade for the Savings Profile, offered as "want an exact number instead of an estimate? add a recent bill or two." It never gates the survey. The default ask is one peak-summer + one peak-winter bill (most of the seasonal accuracy for minimal burden); 12 months is ideal when a serious buyer will provide it, including by email after the visit. A utility bill is sensitive personal data, so the action extracts usage and cost only — the account number, name, and any occupant detail are not extracted, and the bill image is never persisted (held in memory for the one read, like full-res nameplate photos). Parsed values override the energy-profile defaults (§07) and flip the Savings Profile (§08) from "estimate" to "based on your bills."

`rates` Planned

{ action:"rates", utility:{electric, gas}, state:"CO", region:"rocky_mountain" } // backend → EIA open data API (free, structured), cached & refreshed ~monthly. // fallback: Anthropic + web search for a utility's current headline rate. { electric:{ pricePerKwh:0.141, level:"utility", // utility-level when known, else state source:"EIA-861", asOf:"2024", trendPctPerYear:5.2 }, gas:{ pricePerTherm:0.98, level:"state", source:"EIA", asOf:"2026-02", trendPctPerYear:6.0 }, propane:{ pricePerGal:3.05, level:"region", source:"EIA SHOPP", asOf:"2026-02", trendPctPerYear:4.0 } } // asOf = the vintage of the figure; trendPctPerYear = EIA's recent YoY change, for the escalation note.

Every rate carries its vintage (Safety / honesty condition). EIA is a proxy for the effective residential price (revenue ÷ sales), which already blends fixed charges and riders — good for a savings estimate — but it lags: utility-level figures run ~a year behind, state/region figures ~2 months. So each rate returns an asOf date and a recent trendPctPerYear. The Savings Profile (§08) must show the as-of date with an asterisk on every Basis-tier figure and state the escalation context, so a salesperson can say "these use 2024 rates; prices are up ~10% since, so your real savings are likely higher." It is never silently presented as current.

`estimate` Planned (prototype: AI estimate)

{ action:"estimate", system, tier, property:{ sqft, yearBuilt, region }, // region → altitude / climate site:{ access, equipmentLocation, ductwork, lineSet:"reuse|new|unknown" }, existing:{ fuel, refrigerant, ageYears }, // drives tear-out / hazmat special:[ "finished_basement", "crawlspace", "attic" ] } // backend → Anthropic + web search (PROTOTYPE). Later: replaced by Pricing-tool import (hooks). { allInPrice:12800, range:{low:11000, high:15000}, breakdown:{ equipment, labor, materials, lineSet, tearOutDisposal, hazardous, permit }, assumptions:[…], asOf:"2026-06", confidence:"rough" }

Prototype pricing is a single all-in estimate (Safety: clearly an estimate). Until the Pricing tool is wired, this action returns one complete-package figure per system/tier — equipment, labor, materials, consumables, line set (reuse vs. new), tear-out & disposal, hazardous handling (old-refrigerant recovery; possible asbestos in pre-1980 homes), and permit — researched on the web and labeled a rough estimate with a range. It bakes in Colorado specifics (altitude sizing, ultra-low-NOx 96%+ AFUE furnaces). When pricing import is built, the same call is served from the Pricing tool's price book (cost) + company labor + markup; the contract stays the same so the hook is a drop-in.

`diagnose` First pass

{ action:"diagnose", equipment, jobType, complaint:{ reported, confirmed }, components:[ {name, brand, model, serial} ], trail:[ {step, value, unit} ], // actual readings so far tech:{ level, epaType, gates, conditionalOverride }, images:[ {base64, mediaType} ] } // optional step photos // returns exactly one of: { kind:"step", step:{ stepId,title,how,expects:{type,unit},inlineCaution,requiresGate } } { kind:"diagnosis", diagnosis, repair:{ requiresGate, steps:[…], inlineCaution } } { kind:"hardstop", hardstop:{ stopId,type,message,action,clearedBy:{gates:[…]} } }

The model proposes; the app decides. The diagnose response is advisory. The Repair app authoritatively evaluates requiresGate against the tech's profile and enforces every hard stop (§09–§10). The model is instructed to escalate on low confidence rather than fabricate a next step.

`parseprices` First pass

{ action:"parseprices", images:[ {base64, mediaType} ] } // invoice / screenshot // backend → Anthropic vision; extracts line items for the price book { items:[ { description, sku, vendorCost, vendor } ] }

`ping` Built

{ action:"ping" } → { ok:true } // the Test button's health check

07Data models

All persistence is per-device today. Each tool owns its own keys, so models stay decoupled across the split.

Model	Storage key	Owner app
Surveys (customer, property, systems, site, needs, goals, energy profile, site photos, rebates, proposal)	`fieldeng_surveys_v2`	Sales
Tech profile (admin-set)	`fieldeng_tech_profile`	Repair
Trouble calls (complaint, components, trail, diagnosis, verify, stamps, escalation)	`fieldeng_tech_calls_v1`	Repair
Price book + settings (cost, markup rules, view toggle)	`fieldeng_pricebook`	Pricing
Inventory + locations (parts, per-location quantities)	`fieldeng_inventory`	Inventory
Backend URL + optional token	`fieldeng_backend_url` / `_token`	Core (all)

Site documentation photos (Sales) Planned. Beyond equipment nameplates, the survey captures a labeled photo set for the proposal and the future install crew: install location, home exterior, interior equipment area, electrical panel, nameplates, and access/obstructions — add as many as needed per category. Thumbnails persist on the record (same handling as nameplate photos); they ride along on the proposal and the office handoff.

Energy profile (Sales survey) Planned

// captured during the survey; feeds the Savings Profile (§08). Unknowns → regional defaults. { heatingFuel:"propane", // natural_gas | electric_resistance | heat_pump | propane | oil propane:{ tank:"leased", // owned | leased (null if not propane) tankGal:500, pricePerGal:3.10, annualGallons:480, estimated:true }, // true when price/usage came from defaults, not the customer solar:{ present:true, sizeKwDc:7.2, netMetered:true, ageYears:3 }, battery:{ present:true, usableKwh:13.5, backs:"essentials" }, // whole_home | essentials | partial bills:{ provided:false, months:[] } // optional; populated by `parsebill` (§06). } // when bills.provided, parsed usage/cost override the // estimated fields above and the Savings Profile uses actuals.

Tech profile (canonical) First pass

// admin-set only; tech cannot edit own credentials { techId:"T-1042", name:"Johnny R.", level:"novice", // novice | mid | senior | master epaType:"II", // "II" | "Universal" epaCertNumber:"608-XXXXX", gates:["E1","HP"], // admin-granted skill gates credentialNumbers:{ E1:"…", HP:"…" }, conditionalOverride:false }

Job record (canonical) First pass

{ jobId, techId, timestamp, equipment, complaint:{ reported, techConfirmed }, components:[ /* nameplates + serials + thumbnails */ ], diagnosticTrail:[ { stepId, value, unit, time } ], diagnosis, repairGuided, verify:{ reSymptom, reading }, stamps:{ techId, epaCertNumber, credentialsUsed:[…], acknowledgments:[…] }, escalation:null // or { reason, atStep, preservedTrail } }

08Recommendation engine (Sales) First pass

Runs entirely on the device — no internet call — so the recommendation works with no signal. Lives inside the Sales app.

Sizing (heuristic, not a Manual J)

Cooling tons = (sq ft ÷ 500) × 1.15 if built before 1980 (leakier) × 0.90 if built after 2010 (tighter) rounded to nearest 0.5, limited to 1.5–5 tons Heating band = sq ft × 35 to sq ft × 45 BTU (cold-climate assumption, rounded to 1,000)

Good / Better / Best tiers (fixed reference table)

System type	Good	Better	Best
AC / cooling	14.3 SEER2 single-stage	16 SEER2 two-stage	18+ SEER2 inverter
Furnace / boiler	80% AFUE	96% AFUE	98% AFUE modulating
Heat pump / mini-split	15 SEER2 / 8.1 HSPF2	17 SEER2 / 9 HSPF2	19+ SEER2 / 10+ HSPF2 cold-climate

Flags

Per-system: equipment age 15+ (end-of-life) or 12–14 (aging); condition Failed; gas furnace → heat-pump conversion candidate. Whole-home: heat-pump rec + small panel (100A or 125A, or no open breaker spaces) → panel-upgrade flag; Poor/Fair ductwork and line-set marked Replace → scope items; humidity/dust → indoor-air-quality opportunity; hot-cold/uneven → zoning opportunity; maintenance-agreement prompt always appears.

Sizing is a check, not an authority. The heuristic estimate is used to cross-check the installed equipment: when the estimate and the existing tonnage disagree (by ~1+ ton), or confidence is low (square footage missing/suspect, conflicting inputs), the survey raises an "office to verify sizing" flag rather than presenting a confident number. It confirms a sane match or routes the questionable ones to the office — it never stands in for a proper load calculation.

Electrical panel field. Captured in Site Conditions as one of 100A · 125A · 150A · 200A · 200A+, plus an open-breaker-spaces check. The panel-upgrade flag fires for an electrification (heat-pump) recommendation when the panel is 100A or 125A, or when there are no open breaker spaces; 150A and up generally clear unless spaces are full.

Savings Profile (estimate) Planned

A directional estimate of annual energy cost, current setup vs. each proposed tier — the value story for the close. Computed on-device (offline) from the energy profile (§07), the sizing estimate above, and the confirmed utility rate. It is an estimate, never a guarantee.

annual_heating_energy ≈ heating_band(BTU) × climate_run_hours // from sizing current_cost = annual_energy ÷ current_fuel_efficiency × fuel_rate fuel_rate from `rates` action (§06): electric $/kWh · gas $/therm · propane $/gal · oil $/gal — each tagged with asOf + source bills (§07) override fuel_rate with the customer's actual $ when provided proposed_cost = annual_energy ÷ proposed_efficiency(SEER2/AFUE/HSPF2) × electric_or_gas_rate solar_offset = min(annual PV production, new electric load) // if solar.present savings_range = (baseline_cost − proposed_cost − solar_offset value) ± confidence band simple_payback ≈ net_install_cost ÷ annual_savings // rough, when price known // baseline_cost = annual cost of the SELECTED comparison baseline (dropdown, below)

Comparison baseline (selectable)

The salesperson picks what the proposed system is measured against from a dropdown, so the screen always reads "estimated savings vs. [baseline]." Options appear only when they apply:

Baseline option	Proposed system compared vs.	Available when
Your current system (do nothing)	Keeping the existing unit running as-is	Always — the default
Repairing your current system	The repair estimate + running the aging unit on	A repair estimate exists — the Repair-vs-Replace handoff, or one is entered
A basic replacement (Good tier)	The entry-level new system	Proposed tier is Better or Best (shows the upgrade's payback)

One engine, two tools. This is the same calculation the Repair-vs-Replace Advisor (Stage 2 §B8) uses — its "replace" column is simply this panel with the Repairing your current system baseline selected. The proposed tier is switchable too; changing either side recomputes the range at its confidence tier. Honest framing (Safety): the baseline is always stated in words, every option uses real figures, and no baseline is hidden to inflate the number.

Estimate, not a guarantee (Safety condition). Every figure is shown as a range with its assumptions visible (fuel rate, usage, efficiency, solar offset), labeled an estimate, and never called "guaranteed" — the same discipline as the sizing heuristic. Even the top tier projects future savings, so it stays an estimate; more data only narrows the band. The salesperson can adjust any assumption, and figures are tech-verifiable before the customer sees them.

Rate vintage & escalation (the asterisk)

Because the rates data lags (§06), the Basis tier must be honest about when the rate is from. On the recommendation/presentation screen, every Basis-tier dollar figure carries an asterisk tied to a footnote stating the source and the as-of date — and the gap to today:

* Estimate. Based on {utility} {level} energy rates from {source}, {asOf}. Energy prices have risen ≈{trend × yearsSince}% since then, so your actual rates — and savings — are likely higher. Add a recent bill for an exact figure.

So a 2024 rate viewed in 2026 reads, e.g., "* Based on 2024 EIA rates; prices are up ~10% since, so real savings are likely higher." Two deliberate choices: the headline number is computed on the as-of rate (the conservative figure — we don't silently inflate the rate to bump savings), and the escalation appears as a stated, EIA-trend-derived note so the salesperson can speak to it honestly. Framed this way the vintage is a strength — the estimate is conservative, and rising prices make the case better, not worse. When the customer supplies bills, the figure switches to actuals and the asterisk goes away.

Headline cases the MVP targets first: propane → heat pump (large, defensible delta) and solar-offset electrification (PV covers much of a new heat pump's load). Battery presence informs the resilience/outage talking point and may affect incentive eligibility, but is not part of the cost math in the MVP.

Confidence tiers

The number the customer sees comes in three levels — the more real billing data they share, the tighter the band. The level is set by how many months of bills parsebill (§06) returns, and the label on screen states it plainly.

Tier	Inputs	How it's computed	Label shown
Basis rough	No bills — regional defaults + sizing heuristic + confirmed utility rate	Modeled usage from square footage and fuel; widest range	"Rough estimate"
Better	1–2 bills (default ask: one peak-cooling + one peak-heating month)	Anchors actual usage/cost at the seasonal extremes, models the months between; mid range	"Based on sample bills"
Best	12 months of bills	Full actual annual usage and cost; tightest range	"Based on a full year of bills"

tier = bills.months.length >= 12 ? "best" : bills.months.length >= 1 ? "better" // 1–11 months, peak pair preferred : "basis" // tier sets the confidence band width and the on-screen label; never a guarantee at any tier.

Always available, never required. Basis needs nothing from the homeowner and works offline, so a number is always there. Better and Best are opt-in upgrades via parsebill — bills can be added on-site or emailed after the visit, and the survey never blocks waiting for them. Privacy stays bounded per the parsebill rules (usage/cost only, image never persisted).

Customer goals → recommendation framing Planned

Captured on the Comfort & Needs step as a multi-select with one marked primary. Goals don't change the engineering (sizing, tiers, flags are unchanged) — they decide which tier the screen leads with and which story gets top billing, so the same survey lands differently for a price-driven buyer than a comfort-driven one.

Goal	Recommendation leads with
Lowest upfront cost	Good tier + financing; cheapest path that solves it
Best overall value	Better tier; lifetime cost + the Savings Profile
Lower energy bills	Savings Profile front and center; efficiency deltas
Improved comfort	Comfort flags (zoning, IAQ, two-stage/variable), not price
Reliability / peace of mind	End-of-life flags, reliability, maintenance plan
Environmental impact	Heat-pump / electrification + emissions story + rebates
Direct replacement	Like-for-like match; minimal change & disruption
Best available	Best tier; top performance, budget not the constraint

goals: ["lower_bills","comfort"] // multi-select primaryGoal: "lower_bills" // exactly one; drives which tier + story leads // "More" also offers: staying_long_term · selling_soon (context for framing)

Honest framing, not steering. Goals reorder emphasis, never hide options — all tiers and the real numbers stay visible. A price-driven customer still sees the value case; a value-driven one still sees the cheap option. This keeps the "Quality Without an Upsell" brand intact while meeting the customer where they are.

Pricing the job Planned (prototype: AI estimate)

The quote auto-builds from the chosen tier — each tier is assembled from components at quote time (slower than fixed packages, but more accurate). It estimates the parts the job actually needs, not a generic list:

Line set — reuse the existing one if it's compatible and sound, or price a new run.
Tear-out & disposal of the old equipment, plus hazardous handling where flagged (old-refrigerant recovery; possible asbestos in pre-1980 homes).
Equipment, labor, materials, consumables, and permit.

It asks about access & special conditions that move the price — a new line set fished through a finished basement, work in a crawlspace or attic, etc. — pre-filling what Site Conditions already captured (access easy/moderate/tight) and asking only for what's missing.

For the prototype, the estimate action (§06) returns this as a single all-in package price (with a range), web-researched and labeled rough — e.g. a complete furnace + AC for a ~2,000 sq ft Front Range home estimates around $11,000–$15,000 all-in. When pricing import is built, the same call is served from the Pricing tool: price-book cost + company-entered labor + company-entered markup → sell price, with cost/margin hidden from the salesperson per §11.

Adjustments & scope add-ons

Flagged scope items are preset, priced add-ons the rep can toggle on — panel upgrade, ductwork replacement, new line set, IAQ, zoning. Beyond the presets, an "Other" entry lets the rep describe a need in their own words; the tool prices it from company labor rate + company markup + AI-estimated materials (all AI-estimated in the prototype; labor and markup come from config once entered).

Proposal & close Planned

The survey now ends in a customer-facing proposal that closes the deal, not a text summary. It does four things: present the proposal, show financing, capture acceptance, and hand off the scheduled job.

Customer-facing proposal — a clean, shareable document (PDF) with the recommended tier(s), the savings profile, and the full price breakdown.
Price breakdown — what the customer pays, the deposit due at acceptance (default 60%, office-configurable), the balance due at completion (default 40%), the estimated rebates, and the net out-of-pocket total. That net number is the headline.
Financing (third-party, entered) — credit applications are not submitted by this tool; that happens in the lender's own application. The rep enters up to two financing options back onto the proposal, each with an estimated monthly payment, term (years), and APR, shown side by side. Office can maintain the list of available lenders to pick from. Every financing figure carries a conditional disclaimer — "estimated, subject to credit approval."
E-signature & acceptance — captured in the tool.
Schedule / handoff — on acceptance the job is marked "ready to install" and the complete package (survey, proposal, photos, signature, deposit, financing selection, scope) is pushed to the office. The office confirms equipment and availability and sets the date — the rep doesn't book a firm slot. The office backend (a later build — §14) syncs the job to a Google job calendar and to QuickBooks for the deposit invoice.

proposal: { tier, systems:[…], sellPrice, // what the customer pays downPayment, // required deposit balance, dueOn, // remainder + when it's due financing: [ // up to 2 options, entered per deal { lender, termYears, apr, monthlyPayment }, // monthlyPayment = estimated { lender, termYears, apr, monthlyPayment } ], financingDisclaimer: "Estimated — subject to credit approval", // always shown rebatesEstimated, // from `rebates`, verified netOutOfPocket, // sellPrice − rebatesEstimated ← headline deposit: { pct:60, amount, dueAt:"acceptance" }, // office-configurable balance: { pct:40, amount, dueAt:"completion" }, signature, acceptedAt, status: "ready_to_install", // office then confirms equipment + schedules handoff: { googleCalendar:false, quickBooks:false } // set true once office backend syncs }

Pricing visibility & commission (roles). The customer and salesperson see the sell price and the out-of-pocket total. The salesperson may discount only down to an enforced minimum (a floor that protects margin) and never sees the cost or the margin. Commission scales with the sell price — closing higher earns more; discounting toward the floor (never below it) earns less. Only the sales manager and office see the exact system cost and margin. This requires the roles/accounts model (deferred — §14); until logins exist the app can enforce the floor and hide cost, but true protection arrives with accounts. Live prices come from the Pricing tool once wired (Stage 3).

Office handoff & integrations Planned. The accepted job pushes to the office backend, which integrates with a Google job calendar (the install slot once the office confirms) and QuickBooks (the 60% deposit invoice, 40% on completion). Actual payment is handled in QuickBooks/office, not the field app. The Google and QuickBooks OAuth credentials live server-side in the office backend, never in the field app — same principle as the API keys (§11). Deposit split and the lender list are office-configurable.

Proposal document layout Planned

The customer-facing PDF, in order — Empower-branded (charcoal header, cobalt accent, orange action), single page:

Header — Empower wordmark + "Quality without an upsell," proposal #, date, valid-through date.
Customer & property, prepared-by rep.
Recommended system — the goal-driven primary tier, highlighted, with key specs and a one-line why.
Good / Better / Best — three columns side by side, recommended one outlined, each with spec + price (lets the customer trade up/down).
What's included — equipment, install, removal & disposal, line set, permit, startup/testing, warranty, rebate paperwork.
Your investment — system price, estimated rebates (−), net out-of-pocket (headline), deposit (60%) at acceptance, balance (40%) at completion.
Estimated savings — annual figure vs. the selected baseline, carrying the dated-rate asterisk.
Financing — the two entered options (monthly / term / APR), "subject to credit approval."
Rebates & incentives — programs with level + estimated amount, "amounts subject to verification."
Acceptance — signature + date; signing accepts and authorizes scheduling.
Fine print — estimates not guarantees; savings on dated rates; financing subject to approval; rebates to verify; sizing preliminary/confirmed before install; proposal valid 30 days (configurable).
Footer — contact, license #.

Never on the customer document: cost, margin, or commission. The proposal shows sell price, rebates, net out-of-pocket, deposit/balance, savings, and financing only — the internal figures stay in the rep's private screen and the manager/office views (§11). Optional second page: site photos + detailed scope/terms.

09Credential model & gate evaluation (Repair) First pass

Two layers plus an override flag, all admin-set. Gates govern what work is permitted; level governs how much guidance and override eligibility; the override flag clears specific conditional stops. Evaluated authoritatively in the Repair app.

Code	Gate	Unlocks (beyond baseline)
EPA-U	EPA 608 Universal	Low-pressure / chiller (Type II high-pressure work is baseline)
E1	Line-voltage (single-phase)	Burned line-voltage wiring, whips, disconnects, hard-wired motor rewiring
E2	Three-phase (hands-on)	Hands-on 3-phase work (measurement on 3-phase is allowed for all)
E3	New circuit / branch	New circuit from panel, branch extension, new disconnects
G1	Gas valve / piping	Replace gas valve, broken-seal gas connections (leak-test required)
G2	Combustion & venting	Combustion analysis (CO/O₂), venting sign-off, confirming a cracked HX
HP	Heat pumps	Reversing valve, defrost, low-ambient, aux/emergency-heat, HP charging
B1	Hydronic boilers	Circulators, expansion tanks, zone valves, aquastats
B2	Steam boilers	Steam controls, LWCO, pressuretrols, sight glass (full stop without B2)
WH	Water heaters	Gas control valve, T&P, thermocouple/igniter, tankless flow & descaling
CB	Combi units	Combi boiler + DHW combined systems
OIL	Oil-fired	Defined but dormant until oil service is offered

Gate principle. Measurement & diagnosis are never gated (3-phase included) — only hands-on modification carries a gate. Combustion-dependent work on any fuel-burning unit also requires G2. Gates stack (a 3-phase HP contactor swap needs E2; a boiler combustion tune needs B1+G2). Gates are admin-set; a tech cannot self-grant.

// Repair app, on every AI result — before guiding any repair or clearing any stop function canPerform(action, tech) { if (action.requiresGate == null) return ALLOW; // baseline / measurement if (action.requiresGate == "EPA2") return ALLOW; // all techs hold Type II if (tech.gates.includes(action.requiresGate)) return ALLOW; return BLOCK_ESCALATE; // preserve work-so-far, route to escalation } function canClearConditional(stop, tech) { return tech.conditionalOverride && stop.clearedBy.gates.every(g => tech.gates.includes(g)); }

10Safety enforcement in app logic First pass

Three mechanisms, all enforced in the Repair app — never left to the model.

Inline caution — advisory text within a step; never blocks.
Credential gate — when a repair step needs a gate the tech lacks: block the hands-on action, preserve the diagnosis, escalate. Measurement is never gated.
Hard stop — halts the flow for danger / out-of-scope and routes to a defined action (shut down, red-tag, advise, escalate).

Trigger	Type	Behavior
Gas odor / suspected leak	Universal	Stop, ventilate, shut gas, escalate — no override
Active CO alarm	Universal	Stop, ventilate, shut down appliance, escalate
Standing water + energized equipment	Universal	Kill power safely if possible, stop, escalate
Suspected cracked heat exchanger	Conditional	Red-tag + escalate for baseline/mid; `G2` + override may proceed after acknowledgment
Readings don't reconcile / low confidence	Conditional	Stop and escalate rather than guess; senior/master with override may continue

Universal stops apply to everyone regardless of credential. Conditional stops halt baseline/mid techs, but a tech with the matching gate and the conditionalOverride flag may proceed after an on-screen acknowledgment that is stamped to the job with the Tech ID.

The Repair-vs-Replace Advisor is subordinate to safety. The advisor (Stage 2 §B8) runs only after any hard stop has been actioned and never overrides one. A cracked heat exchanger is red-tagged and the unit shut down whether or not a replacement is sold; the replace conversation happens after the safety action, never instead of it.

11Security & privacy

Keys server-side only. RentCast & Anthropic keys live as Vercel environment variables — never in any app or repo.
Optional access gate. An APP_TOKEN env var requires every request to send a matching token, blocking casual use of the endpoint.
CORS. Currently open so the apps can reach the function; tighten to your app domains once stable. With the split, each tool app has its own origin — CORS can be scoped per app.
Spend limits set in both the RentCast and Anthropic dashboards as a hard backstop.
Third-party data handling. During a lookup, the address goes to RentCast and Anthropic, and nameplate/step photos go to Anthropic, processed under their own terms. The tool deliberately does not look up occupant personal information.
Customer-app boundary. The separate customer build never loads cost, margin, inventory, or credential logic; it reaches the system only through the backend.

Roles & accounts Planned

Three roles, office-assigned. Sensitive values are filtered at the source for the role — in production the backend never sends cost/margin to a salesperson's session, so hiding isn't just a UI toggle. (In the on-device prototype, roles are set locally for testing and the app gates the view; true enforcement arrives with the central store + login.)

Can see / do	Salesperson	Sales manager	Office / admin
Sell price & net out-of-pocket	✓	✓	✓
Discount only to the minimum (margin floor)	✓ enforced	✓ sets it	✓
Exact system cost & margin	✗	✓	✓
Own commission & payout	✓ (private screen)	✓	✓
Other reps' deals / payouts	✗	✓ team	✓ all
Configure price book, financing, rebates, users	✗	partial	✓

Auth. Prototype: email + PIN (email identifies the user/org; a short PIN unlocks on a shared truck tablet). Production: a hosted auth provider with email/password or org SSO (Google/Microsoft) as the real identity, plus the per-user PIN for fast field switching — not a hand-rolled auth system.
Role assignment. Office/admin only — a salesperson can't promote themselves (same principle as the Repair credential model). Prototype keeps roles device-local for testing.
Salesperson's private screen. A rep-only view the customer never sees, showing this deal's incentive and the rep's own payout — never other reps' or org-wide numbers. A quick toggle returns to the customer-facing view so it's never shown by accident.

12Storage & persistence

The core storage abstraction uses the platform store when present (Claude preview) and falls back to the browser's localStorage on the hosted site, so surveys and trouble calls persist on the device across sessions. Data is per-device and per-browser — not synced to a central server. Photo thumbnails (~200px) are persisted within each record; full-resolution images are held in memory only for the session and sent once for the AI read, never saved.

Data sharing (central store) Planned

For a manager to see a rep's deals, records must live beyond the device — designed to share without heavy transfer:

Records are light. A survey is compact JSON (text + thumbnail/photo references), cheap to sync. The weight is photos, handled separately.
Photos to object storage, once. Full-res images upload to object storage and are referenced by URL; only the ~200px thumbnails travel with the record, and full images load on demand.
Incremental, scoped sync. Devices stay offline-first and sync only new/changed records (deltas) when connected — scoped by role, so a rep pulls only their own deals, a manager the team, office everything. No bulk transfer.

13Cross-tool integration (planned) Planned

With apps split, the planned Stage 3 links become explicit contracts rather than in-process calls. Each is a read across a boundary, mediated so isolation holds.

Pricing → Sales. In the prototype, Sales prices the job with the estimate action (AI/web, all-in). In production, the same call is served by the Pricing tool — price-book cost + company labor + markup → sell price — with cost/margin hidden from the salesperson (§11). The contract stays the same, so wiring Pricing is a drop-in; Sales degrades gracefully (shows the rec without prices) if pricing is unavailable.
Inventory → Tech & Customer. Repair reads part availability when a part is needed; the customer app may read availability for scheduling. Read-only, by SKU.
Pricing ↔ Inventory. Linked by SKU as one source of truth.
Repair → Sales (Repair-vs-Replace Advisor). At a diagnosis with a repair estimate, the Repair app evaluates repair cost vs. equipment age / remaining useful life (Stage 2 spec §B8); on a crossover it offers a one-tap handoff that opens the Sales app pre-filled with the captured equipment and diagnosis as context. Repair computes the recommendation and passes context; it does not embed Sales logic. Runs after and never overrides a safety hard stop.

Design rule for links. A cross-tool read must never be a hard dependency that breaks the consumer — if Pricing is down, Sales still runs. This preserves the fault isolation the split exists to provide.

14Migration path & open decisions

From single file to split

Step 1 — Extract core. Pull the design system, storage abstraction, backend client, and nameplate read out of the current single file into the versioned core library.
Step 2 — Carve the Repair app first. It carries the safety-critical logic and benefits most from isolation; splitting it first protects hard stops from regression soonest.
Step 3 — Carve Sales, Pricing, Inventory as separate deployables on core.
Step 4 — Thin shell/launcher routes to each, keeping one front door for the field.
Step 5 — Customer app stays a separate build throughout (already mandated).

Open decisions

Central database + accounts. Per-device storage caps cross-device history and true cost-hiding (which needs roles). When do the internal tools justify a central DB? Planned
Offline diagnosis. Diagnosis needs the model; define a degraded capture-and-reconcile mode for no-signal. Planned
Curated diagnostic trees. Curate the high-frequency calls (no-heat, no-cool, no-hot-water); model handles the long tail under the same guardrails. Planned
Backend domain-split. Only if the single function becomes a contention point (§05).

15June 2026 additions — electrification & identification

These extend the existing module boundaries; none break the device-local, stateless-relay model. Reasons are recorded so future maintainers understand intent.

New backend action contracts (stateless relay)

Action	In → Out	Notes
`parsebill`	`{images[]}` → `{serviceAddress, bills[]}`	Vision read of electric/gas bills (two utilities). Each bill: fuelType, provider, month, usage, unit, cost, pricePerUnit, peakKw, avgTempF, degreeDays. No web tool.
`identify`	`{images[]}` → `{units[], systemSummary}`	Vision + web_search. Each unit carries a coarse `unit` kind (furnace/ac/heat_pump/rtu/boiler/evaporative_cooler/other) plus an additive finer `subtype` (heat_pump_outdoor/indoor, minisplit_outdoor/head, ac_condenser, evaporator_coil, rtu, zoning, water_heater, thermostat, line_set) + per-kind specs + ageEstimate. Evaluator assembles by `unit` (ignores subtype — backward-compatible); Sidekick maps `subtype` → component type. De-duplicates repeat photos.

Both follow the existing relay contract: no state stored server-side, images passed through to the model and discarded. nameplate (single unit) is retained for per-system reads.

Data-model deltas

utility.gasService ∈ {confirmed, none, unknown} — field-confirmed; never inferred from the map. Gates dual-fuel and the gas-service rebate.
energy.cooling ∈ {central, window, swamp, none} — drives the added-AC value flag.
energy.bills.months[] entries gain fuelType, provider, unit, pricePerUnit, peakKw, avgTempF (fuel-tagged, multi-utility).
System object gains sys.ac = {brand, model, serial} for the AC condenser of a two-unit furnace_ac, plus sys.acPhotos, sys.typeAuto, and (ductless) sys.zones.
Equipment type ductless added across TYPE_NAMES/TYPE_OPTIONS/TIER_SPECS/TIER_COST_BASE/EQUIP_DEFAULTS; existing-equipment type evap added.
Sidekick (Repair) components carry a typed type (13-item taxonomy) with per-type spec fields (COMPONENT_TYPES / COMP_SPEC); job-type templates pre-load typed components; a batch identify auto-types components via mapIdentifyToCompType().

Engine routing & gating (shared core)

recommendedType() → ductless for electric baseboard, and for electric/propane/oil when ductwork = "none".
Ductless pricing is head-based: ductlessTierCost(idx, zones); zones default ~1/500 sq ft, editable via App.setZones.
rebatesFromTons(tonsByType, gasService) — no-gas swaps the cold-climate $/heating-ton line for a conservative electric-utility line; ductless tons count as cold-climate.
computeGasCheck() infers gas presence from fuel-tagged bills + fuel (flag, not authority).
effectiveFuelPrices() prefers the customer's parsed bill rates over office regional rates.
assembleSystems(units) pairs furnace+AC into one system, appends rather than overwrites, and reuses applyUnitFields() (the shared field-mapping core split out of applyNameplate).

Boundary note: all of the above lives in the Sales module and shared core; the customer-app security boundary and the Repair safety-enforcement rules are unchanged. The richer per-home dataset (bills, specs, multi-unit identities) is the same data Sidekick (Repair) will consume.

Field Engineer — Architecture & Design (software definition). Canonical reference for the modular-split build; supersedes the architecture fragments in the Technical Reference and Stage 2 spec. Update alongside future changes.

Customer-app security boundary (Concierge Stage-1)

The public support page (support.html) shares the one serverless backend but crosses a trust boundary. The backend now splits actions: a PUBLIC_ACTIONS set (triage, support_request) bypasses the APP_TOKEN gate so unauthenticated customers can use them, while every technician/office action still requires the token. Public actions enforce payload caps (description length, image count) and return only customer-safe content; the partsHint and any internal data are emailed to the shop, never returned to the public page in a way that exposes operations. triage performs emergency detection and high-level causes only (no repair steps); support_request sends via a transactional email API (Resend) keyed by server-side env vars the customer never sees, with a client-side mail/SMS/call fallback. Known limitation: an unauthenticated AI endpoint is cost-bearing and abusable — effective rate-limiting needs a stateful layer (edge WAF or KV) that the device-local model lacks; this is the first concrete driver for the office backend.

Architecture& Design

Contents