Tier Design

In traditional SaaS, tiers are combinations of “features + seat count”. Agent product tiers are fundamentally different — the core dimension is model budget and task volume cap; feature differentiation is secondary.

Reason: agent product experience quality is directly tied to which model performs each inference — Haiku is slower / less accurate / lower completion rate on the same task than Opus. A tier is “how much token quota the user receives per month + which model runs by default”. Feature toggles (BUA, MCP, custom subagents) are surface differences.

Four design principles

1. Token caps are hard; dollar caps are soft

Rationale: see economics/controls-and-roi. Token is the stable unit at the product-semantic layer; dollar amounts drift with model pricing. Tier quotas should be specified in tokens; dollar conversion happens at internal accounting.

2. Tiers should jump 4-10× in budget

Jumps under 3× feel like “upgrade isn’t worth it”; jumps over 15× leave no intermediate tier between user segments, forcing all-or-nothing decisions. SaaS empirical sweet spot is 5-8×.

Tier	Typical monthly task range	Jump multiple
Free	10-50	—
Lite	50-300	6×
Pro	300-2,000	7×
Ultra	2,000-15,000	7×

3. Default model and tier — coupled or decoupled

Two approaches:

Approach A: tier locks the default model — Lite uses Haiku only, Pro defaults to Sonnet, Ultra defaults to Opus.

Pro: cost-controllable; prevents Lite users from running Opus tasks and exploding the bill
Con: users feel “constrained” — simple tasks still require Pro to access Sonnet

Approach B: tier sets budget, model is user-selectable — any tier can choose any model, but Opus tasks consume token quota faster.

Pro: strong user control; high-quality scenarios can opt up temporarily
Con: cost accounting is more complex; user education cost is high (need to explain “why did one Opus task consume 5% of my monthly quota”)

Industry trend is migration from A toward B — customers accept “consumption-based” more than “feature-gated”. But B requires the usage UI be sufficiently transparent (see the “user-layer usage visibility” in economics/controls-and-roi).

4. Overage behavior must be explicit

What the product does when users hit their tier cap is the most commonly overlooked yet retention-critical aspect of tier design. Three typical strategies:

Strategy	User experience	Business impact	Suitable scenario
Hard stop	”Monthly quota exhausted, resumes next month or upgrade now”	Stable cashflow; risk losing active users	Free / Lite anti-abuse; budget-sensitive SMBs
Soft degradation	Auto-switches to cheaper model and continues	Margin protection; users may not notice quality drop	Mid-usage Pro users
Auto pay-as-you-go	Overage billed per token in real-time, consolidated next bill	Customer surprise risk; complex finance	Enterprise contracts / signed hybrid

Worst design: no notification on overage, continues running unrestricted, month-end bill becomes 10× normal. This is the most common churn cause for usage-billed products.

Typical tier structure example

A reference structure for an SMB-targeted agent product:

┌─────────┬──────────────┬───────────────┬────────────────────────┐
│ Tier    │ Monthly fee   │ Task cap       │ Overage behavior        │
├─────────┼──────────────┼───────────────┼────────────────────────┤
│ Free    │ $0           │ 30 tasks       │ Hard stop               │
│ Lite    │ $19          │ 200 tasks      │ Soft degrade to Haiku   │
│ Pro     │ $79          │ 1,500 tasks    │ Overage $0.05 / task    │
│ Ultra   │ $299         │ 10,000 tasks   │ Overage $0.03 / task    │
│ Enterp. │ Custom       │ Custom         │ Contractual             │
└─────────┴──────────────┴───────────────┴────────────────────────┘

(Synthetic example.)

Notable details:

Overage unit price decreases at higher tiers: Ultra’s $0.03 is lower than Pro’s $0.05 — overage for large customers is expected behavior, not an anomaly; pricing should encourage rather than penalize
Enterprise tier shows no numbers: large customer contracts are always case-by-case; not publishing concrete Enterprise numbers preserves sales conversation flexibility
Free tier must exist but be strictly limited: 30 tasks/month allows trial + word-of-mouth without creating abuse pathway

Tier count

3-5 tiers is the empirical range:

Fewer than 3: customers feel “all-or-nothing”, narrow conversion funnel
More than 5: decision fatigue, customer choice paralysis, longer sales conversations
4 is the median recommendation: Free / Lite / Pro / Ultra covers most customer segments

Cross-language / cross-market may require different tier structures — China-region customers are more sensitive to low-priced tiers; enterprise markets expect Enterprise customization. Early stage does not need regional variation; revisit after market entry.

Cross-section connections

Cap mechanism implementation: economics/controls-and-roi
Unit economics foundation: metrics/unit-economics
Customer upgrade trigger operations: playbooks