Public Roadmap

What we're building and
what's coming next.

Planned and candidate items are being prioritized with Enterprise Design Partners — they are not yet shipped. Active engineering status is tracked in GitHub Issues.

Apply for Enterprise Design Partner access View features

Roadmap items are not shipped features. Items labelled Planned or Candidate do not yet exist in the product. Only items labelled Shipped, Beta Hardening, or In Progress represent work that is live or actively in progress. Prioritization changes based on Enterprise Design Partner feedback.

Now

Now — Beta Hardening

Core capabilities shipped or actively being hardened for general beta availability.

OpenAI-compatible routing gateway

Beta Hardening

Drop-in proxy endpoint compatible with every OpenAI and Anthropic SDK. Streaming, tools, vision, and function calling work identically.

Routing confidence, cost and savings telemetry

Beta Hardening

Per-request telemetry: selected route, confidence score, token cost, and calculated saving versus the default model.

Audit diagnostics and no-mock launch readiness

Beta Hardening

Platform audit diagnostics with no mock data, real integration proof, and launch-ready evidence packs.

Public developer docs and SDK

Beta Hardening

Developer documentation site, TypeScript/JavaScript SDK, and public changelog.

Notification inbox and preferences

Beta Hardening

In-product notification centre with user-controlled alert preferences.

Enterprise Design Partner application flow

In Progress

Reviewed enterprise access application with security, procurement, and SSO requirements captured before access is considered.

Next — Competitive Parity

High-priority features required to match and exceed the capabilities of specialist AI gateway and observability tools.

Semantic prompt caching and reuse advisor

Planned

Provider-agnostic semantic cache that detects equivalent prompts and advises on caching opportunities across providers.

Provider-aware fallbacks and smart retries

Planned

Health-driven routing with automatic failover, smart retry policies, and configurable fallback chains.

Budget alerts, hard caps and approvals

Planned

Connect budgets to real enforcement: hard spend caps, alert thresholds, and human-in-the-loop approval workflows for high-cost actions.

OpenTelemetry GenAI semantic-convention exports

Planned

Interoperable GenAI traces and metrics following the OpenTelemetry semantic conventions for AI/LLM workloads.

Datadog, New Relic, and SIEM export pathways

Planned

Native integration layers for Datadog, New Relic, Splunk, Elastic, and generic SIEM/JSONL export streams.

SSO/SAML 2.0, RBAC and SCIM hardening

Planned

Harden SSO/SAML, role-based access control, and SCIM 2.0 user lifecycle management for enterprise deployments.

Chargeback schedules and procurement reports

Planned

Finance-facing monthly PDF/CSV chargeback reports with team/project cost allocation and procurement evidence.

Prompt and version evaluation with regression gates

Planned

Treat prompts and routing configurations as versioned, testable release assets with LLM-as-judge regression gating.

Later

Later — Strategic Expansion

Differentiated capabilities planned to strengthen ModelSpend across AI spend control, routing intelligence, and enterprise governance.

Policy-as-code routing controls

Candidate

Declare routing, budget, and access policy in YAML. Deploy via CI/CD with full version history and rollback.

Benchmark-backed route recommendations

Candidate

Data-driven routing recommendations calibrated to your workload type using quality, latency, and cost benchmark evidence.

Live A/B routing experiments with guardrails

Candidate

Controlled production experiments comparing model routes on real traffic, with automatic quality and cost guardrails.

Multi-agent workflow graph economics

Candidate

Attribute cost and quality to individual steps inside agentic workflow graphs for workflow-level ROI analysis.

Marketplace of route policies and evaluation packs

Candidate

Community-contributed and verified routing policies, evaluation datasets, and workload-specific configurations.

Executive ROI board pack exports

Candidate

Board-ready PDF exports: spend trends, risk summary, compliance status, team adoption, and cost efficiency recommendations.

Enterprise Design Partners

Shape what gets built next

Enterprise Design Partners get early access to planned features, direct engineering input, and influence over prioritization. Apply if you need advanced enterprise capabilities that are not yet available.

Apply for Enterprise Design Partner access Start free beta

Built to route across leading model providers

OpenAI
Anthropic
Google Gemini
Azure OpenAI
AWS Bedrock
Mistral
Cohere
OpenRouter

Start controlling AI spend today.

Free beta access. No credit card required. First call in under 4 minutes.

Start free beta Request enterprise evaluation

What we're building andwhat's coming next.

Now — Beta Hardening

OpenAI-compatible routing gateway

Routing confidence, cost and savings telemetry

Audit diagnostics and no-mock launch readiness

Public developer docs and SDK

Notification inbox and preferences

Enterprise Design Partner application flow

Next — Competitive Parity

Semantic prompt caching and reuse advisor

Provider-aware fallbacks and smart retries

Budget alerts, hard caps and approvals

OpenTelemetry GenAI semantic-convention exports

Datadog, New Relic, and SIEM export pathways

SSO/SAML 2.0, RBAC and SCIM hardening

Chargeback schedules and procurement reports

Prompt and version evaluation with regression gates

Later — Strategic Expansion

Policy-as-code routing controls

Benchmark-backed route recommendations

Live A/B routing experiments with guardrails

Multi-agent workflow graph economics

Marketplace of route policies and evaluation packs

Executive ROI board pack exports

Shape what gets built next

Start controlling AI spend today.

What we're building and
what's coming next.