every tool. every model. one router.

Get the right answer.
Pay the right price.

The tools you already use, sending every prompt through Flux to the model that does it best. Frontier quality, intelligently allocated. One key. One decision: send.

1,284,821prompts routed · last 24h
47%of direct list, on average
40+models, one endpoint
>_Codex
Claude
Claude
Cursor
Cursor
aider
opencode
opencode
Hermes Agent
Hermes
OpenClaw
OpenClaw
Wayland
Gemini
Gemini
many more
Claude
Claude
GPTGPT
Gemini
Gemini
DeepSeek
DeepSeek
Kimi
Kimi
Grok
Grok
Qwen
Qwen
Meta
Llama
Mistral
Mistral
Zhipu
GLM
Cohere
Cohere
+35 more
1,420,239
prompts routed · 24h
40+
frontier models
1 line
to switch your stack
0
lock-in · cancel anytime
Your prompts. Your rules.

The router that reduces your exposure.

Routing sounds like your data goes to more places. It's the opposite. One scrubbed gateway, one policy, one contract, and a guarantee you can't get a worse answer than going direct.

01

One gateway is safer than five

Go direct to five providers and your data lives in five places, under five privacy policies, behind five breach surfaces. Flux is one scrubbed gateway. Prompts are PII-stripped before they ever leave, and nothing you send trains a model. Ever.

PII-scrubbednever trained onnot stored raw
02

A smart default, never a cage

Pin any model in one line. Every answer shows which model wrote it. Auto is a default you overrule on any call. We do the boring benchmarking so you don't have to, and you keep the wheel.

pin any modelsee who answeredoverride per call
03

Same answer or better. Never worse.

A junior task doesn't need a senior model. But when the job needs frontier, it gets frontier. If a pick ever misses your bar, Flux escalates to a stronger model at our cost, not yours. Cost-effective stops being a risk and becomes a free win.

quality floorescalate on usno worse than direct
04

Run it on your own keys

Bring your own provider keys and Flux routes through your own accounts, under your own contracts, never sitting in your data path. You hold the keys; we just pick the model.

bring your own keysyour data pathyour contracts

Drop-in OpenAI or Anthropic API · one line in, one line out · cancel anytime · your keys, your contracts.

Meet Flux Auto

Four lanes. One decision: send.

Not every job needs your best model. Auto reads each prompt, sizes the job, and sends it down the right lane. When the work needs frontier, it gets frontier. Pin a lane any time, or let Auto drive.

Flux Auto sits on top
reads the job · picks the lane
Fasthigh-volume · simple

Classify, extract, tag, short chat. The work that just needs to be right and efficient, at scale.

classify 5k emails
pull fields from JSON
tag support tickets
Standardeveryday work

Draft, code, summarize, refactor. The daily-driver lane for most of what you ship.

refactor a module
summarize a contract
write the email
Reasoningthe hard stuff

Architecture, proofs, gnarly debugging, strategy. When the prompt earns frontier, it gets frontier.

reason through a proof
design the system
untangle the bug
Imagedraft → final

Iterate fast, then render the final on a frontier generator. You set the bar; your eye is the judge.

sketch concepts fast
render the hero on a frontier model
one key for both
Pay the lane Auto used. Pennies for junior work, frontier rates only when the prompt earns it. A vetted bench of specialists backs every lane.
See it decide →
Routing intelligence

Stop picking models. Start shipping.

Every prompt is a different job. Flux reads each one and hands it to the model that does it best, live, on every request. This is that decision, happening now.

refactor a functionsummarize a contractclassify 5k emailsreason through a prooftranslate these docs
routed to
Routing tapelive
CODE DeepSeek · specialist nailed it−41%
SUMMARIZE Kimi · long-context win−63%
CLASSIFY Mistral · fast + precise at bulk−28%
REASON Claude · frontier reasoning−52%
EXTRACT Qwen · structured-output fit−71%
CHAT GPT · warm + quick−35%
TRANSLATE Qwen · multilingual edge−47%
Who's winning whatnow
CodeDeepSeek
SummarizeKimi
ReasonClaude
ClassifyMistral
ExtractQwen
ChatGPT
The bench

Every specialist, always on.

Forty-plus frontier models stay warm and online. You bring the prompt; Flux fields the one that wins it. New models join the bench the day they land.

Claude
Claude
reasoning · code
GPT
GPT
general · chat
Gemini
Gemini
long-context
DeepSeek
DeepSeek
code · depth
Kimi
Kimi
huge context
Qwen
Qwen
multilingual
Meta
Llama
fast · open
Mistral
Mistral
fast · structured
Zhipu
GLM
tool use
Grok
Grok
reasoning
Cohere
Command
RAG
+32
models on the bench
The best model changes every week.
Your stack shouldn't.

Integrate FluxRouter once. New models join the bench the day they ship, you get them automatically, and you never migrate again.

The smart default

Frontier power, used with judgment.

You wouldn't put your principal engineer on data entry. Flux doesn't spend frontier prices on junior work. It reserves the heavy models for the prompts that earn them, and routes the rest to specialists that nail the job. Same answer. Sharper allocation.

~47% of direct list
what teams pay on average, because the right model usually isn't the priciest one.
Summarize a 40-page PDFlong-context
Classify 10k ticketsfast specialist
Architect a migrationfrontier
Refactor an auth modulecode specialist
Reason through a prooffrontier
It already speaks your tools

Your whole toolbelt, through one router.

Any tool that talks an OpenAI or Anthropic endpoint plugs straight in. Point it at one base URL and Flux sits in the middle, picking the best model for every call it makes.

>_CodexOpenAI CLI
ClaudeClaude Codeagent CLI
CursorCursoreditor
aideraiderpair-programmer
opencodeopencodeCLI
ContinueContinueIDE
FluxRouter
the hub
GeminiGemini CLIOpenAI-compatible
KimiKimi Codeagent CLI
Hermes AgentHermes Agentautonomous
OpenClawOpenClawagent
ClineClineIDE agent
Waylandours · desktop agent

If it speaks an OpenAI or Anthropic endpoint, it speaks Flux. Official logos shown in the live build.

FluxRouter Desktop

Route your whole machine, not just your code.

A menu-bar app that finds every AI tool you already run and connects it to FluxRouter in one click. Claude Code, Codex, Cursor, aider and the rest. Fully reversible: one button puts it all back.

Get Flux Desktop →macOS · Windows · Linux · free with any account
Flux Desktop · detected on this Maclive
Claude Codedetected
Codexdetected
Cursordetected
aiderdetected
opencodedetected
0 of 5 routed↺ reversible
“But is it as good as going direct?”
It is going direct, to whichever provider wins your prompt.
Same models. Same answers. One key, one bill, and the right one chosen every time.
Pricing

Pay for routing, not seats.

Top up a balance and send. The Fast lane from $0.20 / M input tokens. Standard from $1.50. Reasoning from $4.00. You only pay Reasoning rates when the prompt earns it.

Free
$0
Kick the tires. One key, every model, real routing, capped.
Start free
Most popular
Builder
$29/mo
For people shipping daily. Higher limits, full routing, analytics.
Start Builder
Scale
$199/mo
Production volume, priority routing, team usage and controls.
Start Scale
Pay as you go
Usage
No subscription. Top up a balance and pay only for what you route.
Top up
Same answer or better. Never worse. If a pick ever misses your bar, Flux escalates to a stronger model at our cost, not yours.

Get the right answer.
Pay the right price.

Every tool you use, every model worth using, one key. Start free in under a minute.