Usage-Based Pricing — SaaS Billing Atlas

Overview

Usage-based pricing charges customers based on actual consumption: API calls, tokens processed, compute hours, data stored, messages sent. The bill reflects what was used, not what was provisioned. It’s the fairest model for customers and the most complex model for vendors.

This is the pricing model of the AI era. When your costs scale with inference volume and your customer’s value scales with usage, per-seat and flat-rate pricing stop making sense. Stripe’s $1B acquisition of Metronome in December 2025 was a direct bet on this model becoming the default.

When It Works

API platforms and infrastructure. Twilio, AWS, Vercel, Supabase — products where the unit of value is a measurable event. The customer uses more, the customer pays more, and both sides understand why
AI/ML products. Inference costs are variable and significant. Charging per token, per generation, or per API call aligns your revenue with your cost structure
Variable workloads. If customer usage fluctuates 10x between months, flat pricing either overcharges quiet months or undercharges busy ones. Usage-based pricing adapts automatically
Land-and-expand without sales. Customers start with minimal usage and low cost. As they build more on your platform, revenue grows without a sales conversation. The product sells itself
Cost transparency builds trust. Customers see exactly what they’re paying for. No hidden subsidies, no “am I on the right plan?” anxiety

When It Breaks

Revenue becomes unpredictable. Your customer’s quiet month is your revenue miss. Financial forecasting requires historical usage patterns, cohort analysis, and probabilistic models — not a spreadsheet with MRR × months
Metering infrastructure is non-trivial. You need real-time event ingestion, accurate aggregation, and bulletproof invoicing. A metering bug is a billing bug, and billing bugs destroy trust. This is an engineering investment, not a weekend project
Customers fear bill shock. A runaway script, a traffic spike, or a misconfigured integration can generate an unexpectedly large bill. Without spend alerts, usage caps, and clear dashboards, customers will churn out of anxiety, not dissatisfaction
Small customers generate tiny revenue. A customer using $3/month of your API is real usage but minimal revenue. Without a minimum commitment or base fee, your long tail of small accounts costs more to support than it earns
Sales cycles get complicated. Enterprise buyers want predictable costs. Usage-based pricing requires committed-use agreements, prepaid credits, and volume discounts to work in sales-led motions. The “simple” consumption model becomes a contract negotiation

Real-World Patterns

Usage-based pricing dominates:

Cloud infrastructure — AWS, GCP, Azure (the original usage-based businesses)
API platforms — Twilio, SendGrid, Plaid, OpenAI
AI/ML services — Anthropic, Replicate, Fireworks AI, Replit
Dev infrastructure — Vercel, Supabase, PlanetScale, Neon

The most common hybrid: a base platform fee (for predictability) plus usage-based charges above included thresholds. This gives customers a predictable floor and vendors a revenue baseline, while still scaling with consumption. See hybrid pricing for more.

Implementation Notes

Choose your usage metric carefully. It must be something the customer understands, can monitor, and perceives as fair. “API calls” is clear. “Compute units” is opaque. The metric choice is a product decision, not just a billing decision
Build metering before billing. You need accurate, real-time usage tracking before you can bill on it. Retrofitting metering onto an existing product is one of the most underestimated engineering projects in SaaS
Provide real-time usage dashboards. Customers must be able to see their current usage and projected bill at any time. If they can’t, they don’t trust the model, and they leave
Implement spend alerts and caps. Let customers set budget thresholds and receive alerts. Optionally, offer hard caps that stop usage before overage. This is not optional — it’s table stakes for trust
Offer prepaid commitments for enterprise. “Buy $50K of credits at a 20% discount, use over 12 months” solves the enterprise predictability problem while keeping the usage-based model. This is how OpenAI, AWS, and most infrastructure companies handle large accounts
Start with hybrid if uncertain. A base fee + usage overage is easier to implement and easier for customers to understand than pure usage-based pricing. You can always remove the base fee later

Supported By

Usage-based pricing requires specialized billing infrastructure. Not all platforms handle it equally:

Orb — purpose-built for usage-based billing. Real-time event ingestion, flexible pricing models, revenue design tools. The strongest independent option for this model
Metronome — high-volume metering for AI and infrastructure companies. Now part of Stripe, with capabilities being integrated into Stripe Billing
Lago — open-source usage-based billing with event ingestion, metering, and self-hosting options. The best choice if you want full infrastructure control
Stripe — metered billing via Stripe Billing, with Metronome’s capabilities being integrated. Requires more custom engineering than Orb or Lago, but works within the broader Stripe ecosystem
Chargebee — supports metered billing and usage-based add-ons, but designed primarily for subscription billing. Usage-based is possible but not the core strength

Verdict

Usage-based pricing is the right model when your product’s value is measurable per-event and your cost structure scales with usage. It’s the native model for APIs, infrastructure, and AI products. But it demands real engineering investment — metering, dashboards, spend controls, and invoice accuracy are all prerequisites, not nice-to-haves. If you’re coming from flat-rate or per-seat pricing, consider hybrid as a stepping stone: base fee plus usage overage gives you consumption-aligned revenue without the full complexity of pure metering. The companies that do usage-based pricing well — Twilio, AWS, OpenAI — invest heavily in making usage visible and costs predictable. The model rewards transparency and punishes opacity.