Roadmap items are not shipped features. Items labelled Planned or Candidate do not yet exist in the product. Only items labelled Shipped, Beta Hardening, or In Progress represent work that is live or actively in progress. Prioritisation changes based on Enterprise Design Partner feedback.
Now — Beta Hardening
Core capabilities shipped or actively being hardened for general beta availability.
OpenAI-compatible routing gateway
Beta HardeningDrop-in proxy endpoint compatible with every OpenAI and Anthropic SDK. Streaming, tools, vision, and function calling work identically.
Routing confidence, cost and savings telemetry
Beta HardeningPer-request telemetry: selected route, confidence score, token cost, and calculated saving versus the default model.
Audit diagnostics and no-mock launch readiness
Beta HardeningPlatform audit diagnostics with no mock data, real integration proof, and launch-ready evidence packs.
Public developer docs and SDK
Beta HardeningDeveloper documentation site, TypeScript/JavaScript SDK, and public changelog.
Notification inbox and preferences
Beta HardeningIn-product notification centre with user-controlled alert preferences.
Enterprise Design Partner application flow
In ProgressReviewed enterprise access application with security, procurement, and SSO requirements captured before access is considered.
Next — Competitive Parity
High-priority features required to match and exceed the capabilities of specialist AI gateway and observability tools.
Semantic prompt caching and reuse advisor
PlannedProvider-agnostic semantic cache that detects equivalent prompts and advises on caching opportunities across providers.
Provider-aware fallbacks and smart retries
PlannedHealth-driven routing with automatic failover, smart retry policies, and configurable fallback chains.
Budget alerts, hard caps and approvals
PlannedConnect budgets to real enforcement: hard spend caps, alert thresholds, and human-in-the-loop approval workflows for high-cost actions.
OpenTelemetry GenAI semantic-convention exports
PlannedInteroperable GenAI traces and metrics following the OpenTelemetry semantic conventions for AI/LLM workloads.
Datadog, New Relic, and SIEM export pathways
PlannedNative integration layers for Datadog, New Relic, Splunk, Elastic, and generic SIEM/JSONL export streams.
SSO/SAML 2.0, RBAC and SCIM hardening
PlannedHarden SSO/SAML, role-based access control, and SCIM 2.0 user lifecycle management for enterprise deployments.
Chargeback schedules and procurement reports
PlannedFinance-facing monthly PDF/CSV chargeback reports with team/project cost allocation and procurement evidence.
Prompt and version evaluation with regression gates
PlannedTreat prompts and routing configurations as versioned, testable release assets with LLM-as-judge regression gating.
Later — Market Dominator
Differentiated capabilities that will make ModelSpend the strongest global platform for AI spend control, routing intelligence, and enterprise governance.
Policy-as-code routing controls
CandidateDeclare routing, budget, and access policy in YAML. Deploy via CI/CD with full version history and rollback.
Benchmark-backed route recommendations
CandidateData-driven routing recommendations calibrated to your workload type using quality, latency, and cost benchmark evidence.
Live A/B routing experiments with guardrails
CandidateControlled production experiments comparing model routes on real traffic, with automatic quality and cost guardrails.
Privacy and DLP redaction before provider dispatch
CandidateEnforce data loss prevention and PII redaction policies before any prompt leaves the ModelSpend control plane.
Anomaly detection and shadow-key discovery
CandidateDetect unusual usage patterns, unmanaged API keys, and potential abuse before they become incidents.
Multi-agent workflow graph economics
CandidateAttribute cost and quality to individual steps inside agentic workflow graphs for workflow-level ROI analysis.
AI procurement intelligence and provider arbitrage
CandidateCompare provider contract rates, identify arbitrage opportunities, and recommend reserved-capacity options.
Private deployment and VPC regional data residency
CandidateDocker Compose and Helm chart for air-gapped or VPC deployments with EU/US/APAC data residency options.
Marketplace of route policies and evaluation packs
CandidateCommunity-contributed and verified routing policies, evaluation datasets, and workload-specific configurations.
Executive ROI board pack exports
CandidateBoard-ready PDF exports: spend trends, risk summary, compliance status, team adoption, and cost efficiency recommendations.
Shape what gets built next
Enterprise Design Partners get early access to planned features, direct engineering input, and influence over prioritisation. Apply if you need governance, audit, procurement, or compliance capabilities that are not yet available.
Built to route across leading model providers
- OpenAI
- Anthropic
- Google Gemini
- Azure OpenAI
- AWS Bedrock
- Mistral
- Cohere
- OpenRouter