AI Usage

Cost, latency, tokens and errors across the ModelGateway.

Calls
47
Total cost
$0.041333
Tokens
9,996
Avg latency
2303 ms
Errors
0

Model comparison

ModelCallsCostTokensAvg latencyError rate
anthropic/claude-haiku-4-517$0.0074974,7971892 ms0%
openai/text-embedding-3-small11$0.0000011321329 ms0%
anthropic/claude-sonnet-4-611$0.013533,2343149 ms0%
anthropic/claude-opus-4-88$0.0203051,8333353 ms0%

Cost per task

TaskCallsCostTokensAvg latency
response_generation8$0.0203051,8333353 ms
human_handoff_detection8$0.0098942,2263486 ms
response_validation5$0.0040582,3302016 ms
rag_relevance_ranking3$0.0036361,0082252 ms
intent_classification8$0.0026642,0482259 ms
rag_query_rewrite4$0.0007754191005 ms
embeddings11$0.0000011321329 ms

Cost per tenant

TenantCallsCostTokensAvg latency
00000000…40$0.0413339,9582458 ms
a3fb472b…7$0.0000381421 ms

Top conversations by cost

ConversationCallsCostTokens
33333333…12$0.0138433,625
22$0.0109562,667
11111111…6$0.0073511,847
44444444…4$0.0058051,219
55555555…3$0.003378638

Errors by model

ModelErrorsLast error
No errors in this window 🎉