Weave Router: #1 Ranked Prompt Router In the World

The Prompt Router that doubles your token runway.

Weave Router routes each prompt to the best quality-per-token model, right inside Codex, Claude, and Cursor.

Book a Demo

Get Started for Free

> 
> 

> 
> 

Trusted by leading engineering teams

Scale-Ups

Startups

Enterprise

Scale-Ups

Startups

Enterprise

Weave Router Savings Calculator

Choose your coding harness, enter your model usage mix, and set monthly token spend to estimate savings from routed work.

Coding harnessClaude Code

Claude CodeTerminal agentCodexCode review and edits

Model usage mix100% assigned

Fable 5Anthropic frontier coding%

Opus 4.8Anthropic deep coding%

Sonnet 4.5Anthropic balanced coding%

Haiku 4.5Anthropic fast coding%

Model usage is fully allocated.

Monthly token spendcurrent baseline$

The estimate uses $12,000 as your current monthly token spend.

$7,920estimated monthly savings

Claude Code + Fable 5

$95,042 annualized at 66% lower monthly spend

ConfigurationMonthly costSpend

Current Fable 5

$12,000

With Weave Router

$4,080

Model breakdown

selected mix vs projected router mix

Selected model mix100% assigned

Fable 5100%

Weave routed model mix69% of requests routed

z-ai/glm-5.238%

deepseek/deepseek-v4-pro23%

deepseek/deepseek-v4-flash15%

gemini-3.1-pro-preview14%

minimax/minimax-m310%

Breakdown is request share. The Weave bar is normalized to routed traffic; the header shows how much of total traffic reaches the router model pool.

Spend summary

Current spend$12,000monthly baseline

With Weave Router$4,080projected monthly spend

Estimated savings$7,92066% lower than current

Estimates exclude platform fees and use editable routing assumptions in the component source.

How to get started

The procedure below is exhaustive. It assumes node ≥ 18 on a Unix-like shell and at least one supported client (Claude Code, Cursor, or Codex) installed in PATH.

01.
Get your router key
Generate a bearer token. The token never leaves your device unless you choose to export it.
wv_•••• •••• a91f
02.
Run one command
The installer detects your clients and points them at the router. One env var per provider, written automatically. Anthropic, OpenAI, and Google.
$npx -y @workweave/router
03.
Use Claude Code or Cursor as usual
Each request is now routed to the model with the best quality-per-token. The client is unaware of the proxy.
✓routed via weave

Frequently asked questions

The routing layer adds only low single-digit milliseconds of overhead on top of the upstream call, since prompts are embedded with a small in-process ONNX model and scored against frozen cluster centroids.

Weave Router works across Anthropic, OpenAI, Google Gemini, and OpenRouter for all open-source models

On the managed service, you get a single unified usage-based balance in credits against your Weave account, with no juggling of separate Anthropic, OpenAI, OpenRouter, and Gemini invoices, while self-hosted deployments use your own provider keys and bill directly with each provider.

Weave is a zero-retention proxy by default. Prompts and completions are not stored on our infrastructure, only routing metadata (classification label, chosen model, latency, cost). SOC 2 Type II is available on request under NDA.

Model selection is cache aware so we only switch if it's worth the cost/loss of cache!

Source code available. Start taking your token budget twice as far.

Get started in 5 minutes or book a demo with our team.

❯ npx @workweave/router

Works withCursorClaude Code

Open source

❯ npx @workweave/router

Works withCursorClaude Code

Open source

❯ npx @workweave/router

Works withCursorClaude Code

Open source

The engineering intelligence platform for the AI era.

Trusted by engineering teams from seed stage to Fortune 500

Book a Demo

Get Started for Free

The Prompt Router that doubles your token runway.

Trusted by leading engineering teams

Trusted by leading engineering teams

Trusted by leading engineering teams

Weave Router Savings Calculator

Model breakdown

Spend summary

How to get started

Get your router key

Run one command

Use Claude Code or Cursor as usual

Frequently asked questions

Source code available. Start taking your token budget twice as far.

The engineering intelligence platform for the AI era.