Tag your calls once. The dashboard breaks down spend across every axis automatically.
The SDK wraps your existing LLM client. Your API calls go directly to the provider — no proxy, no extra hop. After the call resolves, the SDK reads token counts and metadata from the response object and posts the event asynchronously. Zero latency added. Zero new failure modes.
contractclues.com was a RAG-based contract reference tool for film/TV production. The Anthropic bill kept climbing. Without per-call attribution, there was no way to know why.
Ten minutes after adding cost tracking, the answer was obvious: one search was pulling 18,500 input tokens against another's 3,700. Same search feature, 5× the cost. A context window issue that would have been invisible forever without per-call data.
Find your variance →One SDK. Automatic cost calculation for every model, every provider.
Unknown models are still logged with token counts — cost shows as unknown.