Production Routing Simulator

See what happens
before the model call.

Enter a prompt and ModelSpend will classify the workload, check routing logic and show which model path it would choose.

Note: Routing decisions use the production engine. The Answer Preview panel shows an illustrative response. Production routing also applies budget guardrails, DLP scanning, governance rules and provider availability checks.

Enter a prompt or try an example

Please enter a prompt before routing.

Try an example

Routing decision LIVE

Run the simulator to see the live routing result

Works inside your current stack

Your app

Send a prompt from your app

ModelSpend Routing Layer

Evaluates models, cost, latency, quality and policies

Optimal model or agent

Best outcome at the right cost

Response + Usage tracking

Real-time response and analytics

Policy & region aware

Cost & latency optimised

Full observability

Plug in any provider

Live routing engine · Production adds budget, governance, DLP and provider availability checks

Integrate this routing in your app

API Docs →

 # OpenAI-compatible proxy request curl https://api.modelspend.best/proxy/v1/chat/completions \
  -H "Authorization: Bearer $MODELSPEND_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "auto",
    "messages": [{"role":"user","content":"Summarise this quarterly report…"}]
  }' 
 # npm install @modelspend/sdk import { ModelSpend } from '@modelspend/sdk';
const client = new ModelSpend({
  apiKey: process.env.MODELSPEND_API_KEY,
});
const response = await client.chat.completions.create({
  model: 'auto',
  messages: [{ role: 'user', content: 'Summarise this quarterly report…' }],
  metadata: { featureId: 'report-summary', environment: 'production' },
});

console.log(response.modelspend?.routedTo);
console.log(response.modelspend?.costUsd);
 const res = await fetch('https://api.modelspend.best/proxy/v1/chat/completions', {
  method: 'POST',
  headers: {
    Authorization: `Bearer ${process.env.MODELSPEND_API_KEY}`,
'Content-Type': 'application/json',
  },
  body: JSON.stringify({
    model: 'auto',
    messages: [{ role: 'user', content: 'Summarise this quarterly report…' }],
  }),
});

SDK & tooling

REST API

OpenAI-compatible

TypeScript SDK

npm install @modelspend/sdk

Webhooks

Real-time events

Python SDK

Planned after TypeScript

View all docs →

See what happensbefore the model call.

See what happens
before the model call.