Who this is for: Founders and operators of AI-enabled products or services who are deciding, or re-deciding, how to structure their pricing model. This applies whether you are pre-launch, post-launch but struggling with margins, or considering a pricing migration.
The problem
Most AI founders default to flat-rate pricing because it is familiar: one price, predictable MRR, easy to market. But flat-rate pricing on AI products has a structural flaw that traditional SaaS never had. Your costs scale with usage, not with seats. A single heavy user on a $299/month flat plan can consume enough inference to generate negative gross profit. At the other extreme, pure pay-as-you-go pricing solves the margin problem but creates bill shock anxiety and unpredictable MRR.
The model you choose has cascading effects on gross margin, customer behavior, MRR predictability, and long-term valuation. There is no universal right answer here. The right answer depends on your specific cost structure, customer behavior, and business goals. This decision tree is designed to surface the conditions that favor each model, not to tell you which one to pick.
The Decision Tree
Work through each branch in order. Where the tree gives a result, that result is a starting hypothesis, not a final answer. Every branch deserves its own unit-economics test.
Branch 1: How predictable is your per-customer inference cost?
Ask: If you took your 10 highest-usage customers last month and your 10 lowest-usage customers, how wide is the cost gap?
- If the gap is narrow (less than 3x between high and low): Your usage is relatively uniform. Flat-rate or hybrid-with-floor are both viable. Continue to Branch 2.
- If the gap is wide (3x to 10x between high and low): Heavy users can destroy flat-rate margins. The case for usage-based or hybrid models with hard caps grows. Continue to Branch 2 with this constraint in mind.
- If the gap is extreme (10x+ between high and low): Pure flat-rate is a known margin risk. The question is whether caps or overages can contain it. Continue to Branch 2, but note that uncapped flat-rate is probably untenable.
Branch 2: How much does your customer value cost predictability?
Ask: If your customers received a variable bill each month, would that cause friction, churn anxiety, or support burden?
- If predictability matters a great deal to your buyer (e.g., SMB operators on tight budgets, buyers who use their credit card once and never think about it again): A flat monthly fee, even with usage caps, is likely easier to sell and retain. PAYG increases cognitive overhead and can trigger churn when a high-bill month arrives unexpectedly.
- If your buyer is sophisticated and already operates on consumption-based models (e.g., B2B SaaS buyers, enterprise operators familiar with AWS-style billing): Usage-based or hybrid may feel natural and even preferred. Sophisticated buyers often prefer usage-based because they can scale down during slow periods without negotiating a refund.
Neither answer rules out hybrid. Hybrid (base platform fee + included usage + overage) is used by approximately 92% of AI companies because it addresses both constraints simultaneously. Treat that adoption rate as evidence the model is viable, not as a reason to default to it. Hybrid earns its place only when your situation genuinely carries both constraints, and it deserves the same unit-economics test as any other branch.
Branch 3: What is the dominant driver of value for your customer?
Ask: Does your customer value access to your product, or outcomes delivered by your product?
- If access is the value (the customer wants the capability available, even when not heavily used): Flat-rate pricing aligns with this. The value is availability, not consumption volume. A per-usage fee here may feel punitive for low-usage months even though the customer still valued having the tool available.
- If outcomes are the value (the customer cares about jobs booked, calls handled, tickets resolved): Usage-based or outcome-based pricing can align your revenue with the value delivered. The risk: you absorb inference costs as COGS, and gross margins compress to 40 to 60% at small scale. The upside: expansion revenue is automatic as customers scale.
- If both matter: Hybrid model territory. The floor captures the "availability" value; the usage component captures outcome-based expansion.
Branch 4: Free tier, reverse trial, or no free tier?
This branch applies regardless of which pricing model you landed on above. The "free" question is separate from the flat/PAYG/hybrid question. It is about acquisition and activation, not ongoing monetization.
Ask: What is the primary reason a prospect would hesitate to buy without a free tier?
- If the hesitation is "I don't believe the product will work for me" (skepticism): A reverse trial gives the full product free for a limited time (typically 7 to 14 days), then requires a paid decision. The prospect experiences the full product before committing.
- If the hesitation is "I need to test the technical integration before committing" (technical evaluation): A free tier with meaningful but limited functionality gives developers and technical buyers a low-friction entry point. The risk: a permanent free tier creates support overhead and a subset of users who never convert and never churn.
- If the hesitation is primarily price-based: A free tier likely addresses the wrong objection. Consider a lower-priced entry tier or a pay-monthly option instead.
- If there is no significant hesitation (warm inbound leads, referral-driven pipeline, or high-intent buyers): No free tier is a viable default. A free tier adds operational cost and dilutes your support bandwidth at zero revenue.
Whether a free tier increases or decreases overall LTV for your specific product depends on activation rates, support costs, and conversion rates that are specific to your business. This is a genuinely unresolved trade-off. Test it with a cohort before committing.
How to apply it
- Run each branch against your actual cost data. The cost-gap question in Branch 1 requires looking at real inference logs, not estimates. If you do not have this data yet, your first step is instrumentation.
- Build a per-customer P&L model before finalizing your model. Revenue minus inference minus support equals true gross profit per customer. A single 10x heavy user on an uncapped flat plan can generate negative gross profit of -$59/month. That math changes the Branch 1 answer immediately.
- Test your hypothesis with a cohort. Before migrating all customers to a new pricing model, price new customers under the new model for 60 to 90 days. Measure activation rate, churn rate, and gross margin per customer.
- Cap usage even under flat-rate. If you land on flat-rate, adding "AI credits" or a fair-use cap prevents the heavy-user destruction scenario. Frame it as a product feature, not a restriction.
The one decision
This tool will route you to a hypothesis. The actual decision that matters is: will you test the model against real customer data before committing to a full pricing migration?
A pricing model chosen without per-customer margin data is a guess. A pricing model tested against a real cohort is evidence. The tree gets you to a better starting hypothesis. The experiment gets you to the answer.
When to use: Before choosing or changing your pricing model. Fill in the brackets with your actual numbers. The output gives you a model recommendation grounded in your specific usage data.
When to use: After you've decided on a model and need to set specific price points. Paste your real cost data for pricing that protects your margins.
When to use: When you have existing pricing data and want to model the impact of a price change. The output helps you see whether raising or lowering price improves total economics, not just conversion.