The short version: Classic SaaS CAC payback math assumes 80% gross margin. AI products run 40 to 60% once inference is loaded in. That single assumption shift cuts your maximum allowable CAC roughly in half. Three worked scenarios below show how badly the math breaks and what to do tomorrow morning.
Most early-stage AI founders are budgeting paid media against a gross margin that no longer exists. They built the model in 2022, when the question was “can we ship this,” not “what does it cost to serve one customer for a year.” The inference bill has since landed, and the CAC payback math that looked airtight on the deck now quietly overpays for users by roughly a third every month.
In our AI Startup Growth Playbook cornerstone, Part 4 made the case that free-trial economics break under inference cost. This post goes after the math directly, because for most teams the spreadsheet they still trust is the one bleeding them.
What’s Wrong With Classic SaaS CAC Payback Math When You Apply It to AI?
The classic formula assumes gross profit is roughly 80% of revenue. For a pure software product that is right. For an AI product it is wrong by nearly half, because inference cost pulls real margin into the 40 to 60% range. That one unquestioned assumption sets your entire CAC ceiling.
According to research from a16z on the new business of AI, AI companies typically run gross margins 25 to 30 percentage points below classic SaaS. Foundation-layer companies land at 50 to 60 percent. Application-layer companies sit at 40 to 60 percent depending on caching, model routing, and how aggressively they offload expensive sessions to cheaper models. None of those numbers are 80.
Here is what the math looks like side by side.
| Variable | Classic SaaS Assumption | AI SaaS Reality |
|---|---|---|
| Gross margin | 78 to 82% | 40 to 60% |
| Monthly gross profit per $100 ARPU | $80 | $50 |
| Max CAC for 12-month payback at $100 ARPU | $960 | $600 |
| Max paid CPA (assuming 30% trial-to-paid) | $288 | $180 |
| Implied paid budget headroom shift | Baseline | Down 37% |
That last row is the one that gets founders fired. A 37 percent cut to your maximum allowable paid CPA is the gap between a program that pays back and one that drives full speed into a wall in month four.
The 2026 CAC Payback Model for AI SaaS
The corrected formula is not complicated. Take your real gross margin (not the optimistic version), multiply by ARPU to get monthly gross profit, then divide your maximum allowable CAC by that number to get payback in months. The discipline is in being honest about what goes into gross margin.
Honest gross margin for an AI SaaS product includes four cost lines, not just the API bill:
- Direct inference cost. Calls to OpenAI, Anthropic, your hosted model, plus any retrieval, embeddings, vector DB, or tool-use costs that fire on a user session.
- Eval and monitoring cost. The infrastructure that watches output quality, logs sessions, and runs regression tests on prompts costs real money, and it belongs in COGS.
- Engineering time tagged to model performance. The fraction of your engineering team’s time spent on prompt engineering, model fine-tuning, evals, and incident response when a model regresses. If it’s more than 15 percent of engineering payroll, load it in.
- Cost-to-serve overhead. Hosting, networking, customer support cost amortized per active user. Same as classic SaaS but worth re-baselining for AI workloads because they are bandwidth-heavy.
Add those four lines, divide by revenue, and you have honest COGS. Subtract from 1 to get honest gross margin. That number goes into your CAC payback formula, not the figure on the marketing deck. Skip the exercise and every dollar of paid media just scales the loss faster.
Three Worked Scenarios Showing the Math Break
Here are three real product shapes, each of which gets bitten by inference cost in a different spot. Names are anonymized to product type.
Scenario 1: AI Coding Assistant, $30 ARPU, High Inference Per Session
The product. A code-completion and refactor tool that runs on every developer keystroke. ARPU is $30 per month. Each active developer triggers 4,000 to 8,000 inference calls per workday against a frontier model. Loaded inference cost is roughly $14 per active monthly user. That is honest COGS of 47 percent, gross margin of 53 percent.
| Metric | Classic SaaS Math | Honest AI SaaS Math |
|---|---|---|
| ARPU | $30 | $30 |
| Gross margin | 80% | 53% |
| Monthly gross profit per customer | $24 | $15.90 |
| Max CAC for 12-month payback | $288 | $190 |
| Max paid CPA at 25% trial-to-paid | $72 | $47 |
Run Meta or LinkedIn ads against the $72 CPA target and you are paying 53 percent more than the math supports. At $50K of monthly spend, that overpayment quietly burns tens of thousands a quarter. Nobody on the dashboard flags it, because the dashboard was told $72 was fine.
Scenario 2: AI Writing Tool, $20 ARPU, Daily High-Volume Use
The product. A general-purpose AI writing assistant used daily by knowledge workers. ARPU is $20 per month. Each paying user runs roughly $7 of inference per month if you do nothing about model routing, $3 if you aggressively cache and downroute to cheaper models. Honest gross margin lands at 50 to 65 percent depending on routing discipline.
| Metric | Naive Setup (No Routing) | Disciplined Setup (Routing + Caching) |
|---|---|---|
| ARPU | $20 | $20 |
| Gross margin | 50% | 65% |
| Monthly gross profit per customer | $10 | $13 |
| Max CAC for 12-month payback | $120 | $156 |
| Max paid CPA at 20% trial-to-paid | $24 | $31 |
This is the scenario where engineering and growth need to share a spreadsheet. A 15-point gross margin improvement from disciplined model routing buys you 30 percent more paid budget headroom. That is what tips a product from “paid is not working” to “paid is the growth engine.” The CFO does not have to write you another check.
Scenario 3: AI Sales-Rep Tool, $300 ARPU, Business-Critical
The product. An AI sales development tool that drafts outreach, qualifies leads, and runs nurture sequences. ARPU is $300 per month per seat. Each seat consumes roughly $90 of inference if the tool is actively used (which the buyer expects, because they paid $300 for it). Honest gross margin lands at 70 percent, which beats the earlier scenarios but still falls short of the 80 the classic model assumes.
| Metric | Classic SaaS Math | Honest AI SaaS Math |
|---|---|---|
| ARPU | $300 | $300 |
| Gross margin | 80% | 70% |
| Monthly gross profit per customer | $240 | $210 |
| Max CAC for 12-month payback | $2,880 | $2,520 |
| Max paid CPA at 8% trial-to-paid | $230 | $201 |
Higher ARPU softens the blow. The math still shifts, but it does not collapse. AI products at $200+ ARPU can usually absorb the margin hit, provided inference cost is disciplined and trial-to-paid is tight. The carnage is at the bottom: sub-$50 ARPU products, where the unit economics turn vicious. Which happens to be exactly where most early-stage AI startups live.
What This Means for Your Paid-Media Budget
Three implications worth acting on this quarter.
- Recalculate your max paid CPA at honest margin. Most teams will discover their current CPA targets are 20 to 50 percent above what the math actually supports. The fix is not to cut paid; it is to redesign the offer (price, trial, model routing) so the math works.
- Shorten your attribution windows. Move from 14-day-click default to 1-day-click or 3-day-click for AI products. The longer windows over-credit ads on users who would have signed up organically, which hides the unit-economics problem behind apparent paid performance. We wrote about how to audit this end-to-end in the paid media program audit framework.
- Move success metrics from signup to trial-to-paid. Optimizing paid against signup CPA is a defense against bad creative; it is not a defense against bad unit economics. Once you have a baseline of paid-acquired users hitting your activation event, switch optimization signals to trial-to-paid conversion. The paid media with AI framework walks through what that looks like operationally.
The team that wins paid for an AI product is the one running the math against honest margin while everyone else still runs it against 80 percent. The competitor running the bad assumption reaches $2M ARR and stalls, while the team with the honest model gets there with payback under 12 months and budget left over to scale. Same product, different spreadsheet.
What AI Founders Should Do Tomorrow Morning
Three actions that take a focused day to execute and pay back inside a quarter.
- Pull a 30-day inference cost report broken out by paying user, not aggregated. Then sort by cost. The top decile usually shows you which users are unit-economics-destroying and which workflow is the culprit. That is the design constraint for your next trial redesign.
- Rebuild your CAC payback model with the four-line COGS structure above. Direct inference plus eval plus engineering time plus cost-to-serve. Be honest. The new max CAC number is what your paid program should be optimized against.
- Send the new max CAC number to whoever runs paid (in-house or agency) with a one-line directive: “Optimize against this number, not the old one, and tell me what changes.” If they cannot tell you what changes inside 48 hours, you have a paid-program problem that goes beyond the CAC question.
The inference bill is not going back down. The only variable you control is whether your CAC math knows about it. If you want help running the model honestly and rebuilding the paid program around the real number, our paid media service operationalizes this for early-stage AI companies. The AI Companies positioning page walks through what fit looks like.
Like this? Get the next one.
Short emails. New posts as they ship.