Pricing, Tokens & Usage Model
Product-facing reference for the paid token-based API experience. Designed to feel familiar to developers who have used commercial AI voice APIs.
Product Positioning
The standalone Voice API is the recommended commercial surface. Frontend developers should think of it as:
The recommended billing abstraction is a voice-token meter with these suggested metering units:
| Unit | Meaning | Good For |
|---|---|---|
| Input text tokens | Tokens derived from request text | Preview pricing and estimate UI |
| Generated audio seconds | Actual output duration | Settlement and final cost |
| Premium feature tokens | Extra charge for higher-cost modes | Future premium tiers |
Expose These Concepts in Product UI
/v1/usage/estimateSuggested response fields for preflight estimates:
Post-Request Usage Summary
Recommended response metadata after generation:
Admin tooling should expose voice lifecycle controls alongside pricing. Recommended options:
The cleanest pattern is to expose usage through both headers and JSON body fields. Suggested response headers:
| Header | Description |
|---|---|
X-Usage-Input-Tokens | Number of input tokens consumed |
X-Usage-Output-Seconds | Duration of generated audio |
X-Usage-Total-Tokens | Total tokens consumed |
X-RateLimit-Limit | Rate limit ceiling for the window |
X-RateLimit-Remaining | Remaining requests in current window |
X-RateLimit-Reset | Timestamp when the rate limit resets |
These headers are part of the recommended commercial contract. They are not fully implemented across the current route surface today.
Healthy
- Active plan
- Remaining tokens
- Average request cost
Low Balance
- Warning banner
- Estimated remaining preview count
- Upgrade or top-up CTA
Hard Stop
- Insufficient balance
- Blocked generate button
- Recharge explanation
Example Product Copy
Preview Estimate
“This preview will use about 340 tokens and 5.8 seconds of generated audio.”
After Completion
“Preview completed. 327 billable tokens used. 12,404 tokens remaining.”
Low Balance
“Your workspace is running low on voice tokens. Add more credits to keep generating previews.”
| Tier | Price | Monthly Allowance | Per Request | Qualities | Intended User |
|---|---|---|---|---|---|
| Free | $0 / month | 1,000 characters / month | 500 characters per request | Standard | Prototyping and low-volume testing |
| Pro | $29 / month | 500,000 characters / month | 5,000 characters per request | Standard, Premium, Ultra | Production integration and commercial use |
Best Practice
- Frontend asks your backend for current balance and estimate
- Backend owns billing truth and API key
- Backend proxies request to Voice API
- Backend records final usage after request completes
Avoid
- Storing balance truth only in frontend state
- Letting the browser call the metered API with a permanent secret
- Charging on estimate without reconciling final usage