For the complete documentation index, see llms.txt. This page is also available as Markdown.

Changelog

2.14.0

  • AI Insights detects two new classes of wastemax_tokens cutoffs (so responses are being truncated mid-thought while you pay for incomplete answers), and subscribers or agents whose AI spend has gone largely dormant. Each finding includes a dollar-quantified savings estimate and example transactions to verify it.

  • AI Insights is now available through the MCP server — AI agents talking to Revenium via MCP can now run and read AI Insights recommendations as a first-class tool action.

  • Azure Cost Management integration — connect your Azure subscription via a Service Principal to pull daily cost data into Revenium, with spend attributed to the specific model that drove it (the same per-model breakdown you already get for OpenAI and other providers). Supports JSON-paste for credentials and maps Azure billing errors to actionable guidance.

  • GitHub Copilot support — track your organization's Copilot usage in Revenium: connect with the Revenium CLI (revenium-copilot) and see adoption and usage on the new Copilot dashboard, alongside your other AI coding assistants.

  • AI Assistant Pricing Mode — a per-team setting (Management > Teams > Team AI Settings) that controls whether AI assistant usage (Claude Code, Claude Cowork, Cursor IDE, Gemini CLI, Codex CLI, GitHub Copilot) counts as real API spend or stays as a subscription/seat-based estimate on AI Assistant dashboards, so your cost reports reflect what you actually pay.

  • Metering retries are now safe out of the box — the Idempotency-Key header on metering endpoints is now enabled, and the Node, Python, Go, and MCP SDKs generate the key automatically. Retried submissions within a 24-hour window are deduplicated server-side.

  • Webhook verification helpers in every SDK — verify HMAC-signed webhooks from Revenium with a single SDK call in Python, Node, or Go. Pairs with the server-side signing and rotation shipped last release.

  • Revenium MCP server now auto-generates Idempotency-Key on metering — completes the SDK side of the metering-idempotency rollout for AI agents that submit usage via MCP.

  • Top Movers API now ranks by absolute dollar change — previously sorted by percent change, which buried high-dollar moves behind tiny-customer fluctuations.

  • Python SDK fix for AWS Bedrock — cache token counts on the Bedrock Anthropic path were previously not included.

  • Cross-SDK terminology cleanuporganizationName and productName are now the canonical field names across every SDK, middleware, and MCP surface. The deprecated organizationId/productId aliases have been dropped from wire emission in the CLI, MCP, Node and Go middleware, n8n UI, and ingest paths.

2.13.0

  • Cursor cost tracking is now zero-setup — paste your Cursor team API key into provider settings and Revenium pulls usage every five minutes directly from Cursor's API.

  • Added OpenAI Codex CLI dashboard — full adoption, cost, and efficiency reporting for Codex CLI on par with the Cursor and Gemini dashboards, with Codex Desktop, App, and CLI unified as a single provider.

  • AI alert emails now use distinct subject prefixes — AI Anomaly Alert: when an alert fires and AI Anomaly Resolved: when it resolves — so inbox filters and rules can route them separately.

  • Added HMAC signing for outbound webhooks — verify that webhook payloads truly came from Revenium using a per-webhook signing secret, with safe rotation and a one-click test-event button.

  • Added Idempotency-Key support on metering — retry failed metering calls safely with no risk of double-counted usage; identical retries within a 24-hour window are deduplicated server-side.

  • Added standard X-RateLimit-* response headers — Revenium's API now returns predictable per-tenant request budgets with X-RateLimit-* and Retry-After headers and a documented /integrations/rate-limits reference, so customers can program against the limits (This is in addition to the existing 4xx-error-pattern protection shipped in v2.12.0.)

  • Added a coding-assistant pricing mode in Team settings — choose whether coding-assistant traffic is paid for using API rates or seat-based pricing.

  • Improved AI Insights progress feedback — determinate progress card with elapsed time, navbar spinner across all routes, sharper empty states, and a period selector on the page.

  • Improved Cost Controls notifications and UI — fixed spend-display refresh, hard-limit display, mode column and filter parity, Slack link path, deduplicated breach notifications, and clearer "Mode" toggle states.

  • Improved subscription analytics — user retention cohort analysis, CSV export per segment, pagination, and an unmapped-GitHub-users filter.

  • Improved Webhooks discoverability — moved into the Alerts tab where customers already manage notification channels.

  • Improved Python SDK with a built-in agentic outcomes API, cost-controls + circuit breaker integration, and renamed ReveniumCostLimitExceeded to BudgetExceededError for clearer error handling.

  • Updated Node middleware with separate metering and enforcement circuit breakers so an enforcement issue cannot stop metering.

  • Updated Go middleware to derive cost-control scope from filter dimensions.

  • Fixed traces page chart-axis, label, and date-picker alignment on cost, performance, and efficiency views.

  • Fixed advanced search filters silently falling back to the default date range on the Logs page.

2.12.0

  • Added AI Insights — a new analytics surface that runs your usage data through an AI Recommendations Engine and returns prioritized, actionable findings with potential monthly savings, severity, and suggested actions.

  • Added Cost Controls — define Budget Rules with dimensional filters, Slack and webhook notification channels, and a per-rule event log. Runtime enforcement is now available across Revenium's SDK library, including Node, Go, and Python, so spend limits are enforced at the call site.

  • Enhanced AI Coding Adoption analytics — scatter plot grouping users by output and engagement, segment tabs and badges across the Value-by-User table, derived metrics columns (Cost per PR, Adoption rate, Tokens per Session) and session efficiency metrics.

  • Added Claude PR attribution via Co-Authored-By — Revenium now detects Claude-assisted pull requests, attributes them to the correct user and team, and exposes a sortable PRs-with-Claude column on the Adoption tab.

  • Added provider credit tracking for AWS Bedrock and GCP Vertex — provider-issued credits are captured, surfaced per-provider on the Provider cost dashboard, and applied in billing analytics so reported spend reflects what you actually pay

  • Added Revvy Onboarding Agent & CLI — a new agent-driven instrumentation assistant that helps you onboard your application to Revenium. It includes an AI callsite search, code integration guidance, GitHub Actions template, and machine-readable call-site manifest for AI agents.

  • Updated MCP Server with improved error handling and additional API coverage

  • Improved team-switch reliability in the web application

  • Improved multimodal alert metric guidance in the Create Alert flow

  • Fixed edge case bugs in resolved-alert notifications that were silently failing to deliver to Slack and email

2.11.0

  • Added Claude Cowork Dashboard with per-user value reporting, cost by model, token and cache utilization breakdowns, and a one-line OTLP configuration from the Claude Desktop admin UI that instruments your entire company's Cowork usage with zero per-machine setup

  • Added GitHub integration for AI Coding Dashboard team attribution — map Claude Code usage to teams via PR authorship to directly correlate Claude Code usage to the outcomes that matter

  • Added scoped API key permissions — create read-only, metering-only, or full-access keys per integration; legacy keys continue to work unchanged

  • Added environment filtering across Logs and Traces — filter by dev, staging, prod, etc.

  • Added a unified Go metering SDK with first-class support for Anthropic (streaming + vision), Google (chat, embeddings, image, video), fal.ai (Run/Subscribe/Stream), LiteLLM (with SSE streaming), and the Responses API

  • Added clickable ROI cards — click any box on the ROI page to drill into detailed job filtering

  • Updated MCP Server to FastMCP 3.x, Tool Registry management with pricing tiers, Jobs & Outcomes management, and expanded analytics wiring for cost-by-user and tool costs

  • Improved Pricing sidebar organization by splitting Cost Sources and Revenue Sources into dedicated sections

2.10.0

  • Added redesigned navigation with Costs & Revenue Attribution promoted to first-level tabs, sticky headers, and new date picker across dashboards

  • Added global search command palette (Cmd+K / Ctrl+K) for instant navigation to any page or entity

  • Added search and value range filtering to Jobs API, with user-supplied job identifiers as primary keys

  • Added default organization preference — set your preferred organization to auto-load on login

  • Added per-million-token pricing display across AI Models pages with sub-cent tooltips

  • Added starred dashboards and revamped My Dashboards from card grid to sortable data table

  • Added organization tree view with full hierarchy and pagination for large org structures

  • Added coding assistant toggle to Logs page for filtering AI coding tool metrics

  • Improved performance of AI Assistants API endpoints

  • Updated MCP Server to FastMCP 3.x, added support for AI jobs, outcomes, and tool cost management

  • Updated Go middleware with centralized core architecture including circuit breaker, retry, and tool event metering

2.9.0

  • Added ROI Dashboard with conversion funnels, revenue vs cost analysis, job transactions, and per-job ROI tracking

  • Added Cursor AI Coding Dashboard with usage reporting & subscription management

  • Added Revenium Coverage Ratio showing what percentage of provider spending is metered by Revenium

  • Added real-time sync status and freshness indicators to the Provider Dashboard

  • Added URL-based date filter persistence across dashboards, enabling bookmarking and sharing

  • Added Jobs API support to the Revenium metering SDKs

  • Added fal.ai middleware wrapper for Node.js and Go

  • Added LiteLLM Go middleware for LiteLLM proxy integration

  • Added multimodal metering for Imagen, Veo, and Gemini Vision to the Google Python middleware

  • Added LangChain SDK

  • Updated MCP Server with higher-performance analytics endpoints and expanded test coverage

  • Updated middleware packages across 5 providers with canonical model name resolution

  • Improved AI Provider billing sync reliability — historical syncs no longer time out on large datasets

2.8.0

  • Added native OpenTelemetry GenAI semantic conventions support, enabling direct ingestion from LangChain, OpenLLMetry, Traceloop, and any OTel-instrumented AI application

  • Added gRPC OTLP endpoint (port 4317) for zero-configuration integration with standard OpenTelemetry exporters

  • Added share-to-Slack and share-to-email for charts and dashboards

  • Added Tool Registry Logs tab for reviewing tool event history

  • Added budget-specific creation flow from the Budgets dashboard

  • Added multimodal pricing column and provider/modality filters to AI Models pricing page

  • Added custom provider support when adding new AI models

2.7.0

  • Added Tool Registry Analytics Dashboard with 11 charts covering tool cost distribution, agent breakdown, and cost iceberg views

  • Added full multimodal pricing: audio token pricing for GPT-4o Realtime/Transcribe, bidirectional audio cost tracking, DALL-E 3 size-variant pricing refinements, and HD TTS pricing deduplication

  • Added VIDEO_COUNT, AUDIO_COUNT, duration, characters, and credits as first-class metrics in the alert system and chart builder

  • Added Custom Dashboards with modality grouping and subscriber/user breakdowns for improved query performance

  • Added provider dashboard drill-through — integration status cards are now clickable with provider-specific detail

  • Added configurable concurrency for AI metrics processing

  • Added subscriber/user grouping in chart builder

  • Improved line item processing latency with direct async event-based processing

  • Improved billing calculation performance for high-volume accounts

  • Improved AI metric alert monitoring performance

  • Improved element aggregation performance

  • Fixed a type error in VIDEO_COUNT/AUDIO_COUNT alert evaluation

  • Fixed customer cost query grouping and sync logs operation filter

  • Fixed bar chart rendering artifacts, donut chart sizing, and y-axis label formatting

2.6.0

  • Added multimodal image pricing with DALL-E 3 size-variant cost attribution for accurate per-call billing

  • Added pixel-based cost calculation for image generation models

  • Added FOCUS v1.0 compliant cost export, enabling direct integration with FinOps platforms

  • Added customer product profitability charts to the pre-defined dashboard library

  • Added session count metric to the AI traces analytics endpoint

  • Added unified delete endpoint for AI metrics

  • Improved metering throughput

  • Improved alert monitoring performance

  • Fixed bug in custom pricing support for AI completion metrics

  • Fixed dashboard editing controls

  • Fixed Google SSO token initialization race condition

  • Relabeled "Sessions/User" to "Requests/User" on AI Coding dashboard for accuracy

2.5.0

  • Added AI Coding Dashboard supporting multiple coding assistants (Claude Code, Gemini CLI) with organization-level views

  • Added Connections page for unified API credential and provider integration management

  • Added Custom Dashboards with improved analytics performance

  • Added role-based prompt viewing permissions with admin controls

  • Added Relative Change Alerts with task type filtering for monitoring usage trends

  • Added OpenAI audio cost tracking for Whisper (STT) and text-to-speech services

  • Added multimodal pricing sync for FAL and Runway providers

  • Improved credential input UX with structured form fields for AWS Bedrock, LiteLLM, and PiAPI

  • Improved query performance for trace and AI transaction endpoints

  • Fixed pagination navigation and advanced search parameter handling

  • Fixed SSO token handling stability

2.4.0

  • Added support for Claude Code usage metering

  • Added Claude Code Dashboard with cost analysis, token analytics, user attribution, and model comparison views

  • Added support in all middleware for sending prompt data to revenium for analysis (requires manual opt-in)

  • Added Prompt Viewer modal with message visualization, role-based coloring, and metrics context

  • Added webhook notification channel for alerts with retry logic and circuit breaker patterns

  • Added extended time ranges: 90 days, 6 months, 12 months, and custom date ranges for all analytics queries

  • Added non-token AI cost tracking for audio, image, and video AI services

  • Added Gemini CLI telemetry support

  • Added Fal.ai and Runway ML integrations via Go middleware

  • Added Perplexity Python SDK support

  • Added connection status indicators in UI to quickly provide data ingest status

  • Added multi-field search for Products API across name, owner email, and asset sources

  • Simplified OpenRouter credentials management

  • Improved AI transaction details modal with dedicated sections for cost, provider, timing, and tokens

  • Fixed duration display inconsistencies across all views

  • Fixed notification template logo display and email formatting

  • Fixed Slack OAuth redirect URI handling

  • Fixed chart flickering occurring in some situations

Last updated

Was this helpful?