AI Usage

Cost, latency, tokens and errors across the ModelGateway.

Calls
49
Total cost
$0.041333
Tokens
10,007
Avg latency
2261 ms
Errors
0

Model comparison

ModelCallsCostTokensAvg latencyError rate
anthropic/claude-haiku-4-517$0.0074974,7971892 ms0%
openai/text-embedding-3-small13$0.0000011431321 ms0%
anthropic/claude-sonnet-4-611$0.013533,2343149 ms0%
anthropic/claude-opus-4-88$0.0203051,8333353 ms0%

Cost per task

TaskCallsCostTokensAvg latency
response_generation8$0.0203051,8333353 ms
human_handoff_detection8$0.0098942,2263486 ms
response_validation5$0.0040582,3302016 ms
rag_relevance_ranking3$0.0036361,0082252 ms
intent_classification8$0.0026642,0482259 ms
rag_query_rewrite4$0.0007754191005 ms
embeddings13$0.0000011431321 ms

Cost per tenant

TenantCallsCostTokensAvg latency
00000000…40$0.0413339,9582458 ms
a3fb472b…9$0.0000491388 ms

Top conversations by cost

ConversationCallsCostTokens
33333333…12$0.0138433,625
24$0.0109562,678
11111111…6$0.0073511,847
44444444…4$0.0058051,219
55555555…3$0.003378638

Errors by model

ModelErrorsLast error
No errors in this window 🎉