Optimize your API spending without sacrificing quality. Budget controls, monitoring, and proven cost-saving techniques.
Use mythic-4-mini for simple tasks. It's 10x cheaper than mythic-4 and handles most tasks well.
Cache system prompts and static content. Pay once to cache, reuse at a 50% discount.
Use Batch API for non-urgent requests. Get 50% off in exchange for 24-hour turnaround.
Summarize long documents, remove unnecessary context, and be concise in prompts.
Set max_tokens to the minimum needed. Ask for concise responses in your prompt.
Track usage by endpoint and user. Identify wasteful patterns and optimize.
| Model | Input (1M tokens) | Output (1M tokens) | Best For |
|---|---|---|---|
mythic-4-mini |
$0.15 | $0.60 | Simple tasks, high volume |
mythic-4-turbo |
$2.00 | $8.00 | Speed-critical apps |
mythic-4 |
$5.00 | $15.00 | Balanced quality/cost |
mythic-4-reasoning |
$15.00 | $60.00 | Complex reasoning |
Set hard spending caps per organization or project
Get notified when approaching budget thresholds
Control spending at the individual API key level
# Set monthly budget limit via API
client.organization.budgets.create(
amount=500, # $500 monthly limit
currency="USD",
alert_threshold=0.8, # Alert at 80%
action="notify" # or "block" to stop requests
)
Check your current spending and set budget controls.
Go to Usage Dashboard →