every tool. every model. one router.

Get the right answer.
Pay the right price.

The tools you already use, sending every prompt through Flux to the model that does it best. Frontier quality, intelligently allocated. One key. One decision: send.

1,284,821prompts routed · last 24h

47%of direct list, on average

55+models, one endpoint

Start free →See it route

1,420,239

prompts routed · 24h

56+

frontier models

1 line

to switch your stack

lock-in · cancel anytime

Your prompts. Your rules.

The router that reduces your exposure.

Routing sounds like your data goes to more places. It's the opposite. One scrubbed gateway, one policy, one contract — and a double-check on demand for the answers that really matter.

One gateway is safer than five

Go direct to five providers and your data lives in five places, under five privacy policies, behind five breach surfaces. Flux is one scrubbed gateway. Prompts are PII-stripped before they ever leave, and nothing you send trains a model. Ever.

PII-scrubbednever trained onnot stored raw

A smart default, never a cage

Pin any model in one line. Every answer shows which model wrote it. Auto is a default you overrule on any call. We do the boring benchmarking so you don't have to, and you keep the wheel.

pin any modelsee who answeredoverride per call

Routed with judgment. Double-checked on demand.

A junior task doesn't need a senior model. But when the job needs frontier, it gets frontier. And when an answer really matters, flag the request with flux_verify and Flux re-checks it against stronger models — capped at $0.25, the rest of the climb is on us.

routing with a quality barflux_verify double-checkcapped at $0.25

Run it on your own keys

Bring your own provider keys and Flux routes through your own accounts, under your own contracts, never sitting in your data path. You hold the keys; we just pick the model.

bring your own keysyour data pathyour contracts

Drop-in OpenAI or Anthropic API · one line in, one line out · cancel anytime · your keys, your contracts.

Meet Flux Auto

Four lanes. One decision: send.

Not every job needs your best model. Auto reads each prompt, sizes the job, and sends it down the right lane. When the work needs frontier, it gets frontier. Pin a lane any time, or let Auto drive.

Flux Auto sits on top

reads the job · picks the lane

Fasthigh-volume · simple

Classify, extract, tag, short chat. The work that just needs to be right and efficient, at scale.

classify 5k emails
pull fields from JSON
tag support tickets

Standardeveryday work

Draft, code, summarize, refactor. The daily-driver lane for most of what you ship.

refactor a module
summarize a contract
write the email

Reasoningthe hard stuff

Architecture, proofs, gnarly debugging, strategy. When the prompt earns frontier, it gets frontier.

reason through a proof
design the system
untangle the bug

Imagedraft → final

Iterate fast, then render the final on a frontier generator. You set the bar; your eye is the judge.

sketch concepts fast
render the hero on a frontier model
one key for both

Pay the lane Auto used. Pennies for junior work, frontier rates only when the prompt earns it. A vetted bench of specialists backs every lane. And it goes past text: Flux Voice turns speech into text and Flux Web fetches any page as clean markdown, on the same key.

See it decide →

Routing intelligence

Stop picking models. Start shipping.

Every prompt is a different job. Flux reads each one and hands it to the model that does it best, live, on every request. This is that decision, happening now.

refactor a functionsummarize a contractclassify 5k emailsreason through a prooftranslate these docs

routed to

Routing tapelive

CODE→ DeepSeek · specialist nailed it−41%

SUMMARIZE→ Kimi · long-context win−63%

CLASSIFY→ Mistral · fast + precise at bulk−28%

REASON→ Claude · frontier reasoning−52%

EXTRACT→ Qwen · structured-output fit−71%

CHAT→ GPT · warm + quick−35%

TRANSLATE→ Qwen · multilingual edge−47%

Who's winning whatnow

CodeDeepSeek

SummarizeKimi

ReasonClaude

ClassifyMistral

ExtractQwen

ChatGPT

The bench

Every specialist, always on.

More than fifty frontier models stay warm and online. You bring the prompt; Flux fields the one that wins it. New models join the bench the day they land.

Claude

reasoning · code

GPT

general · chat

Gemini

long-context

DeepSeek

code · depth

Kimi

huge context

Qwen

multilingual

Llama

fast · open

Mistral

fast · structured

GLM

tool use

Grok

reasoning

Command

RAG

+32
models on the bench

The best model changes every week.
Your stack shouldn't.

Integrate FluxRouter once. New models join the bench the day they ship, you get them automatically, and you never migrate again.

The smart default

Frontier power, used with judgment.

You wouldn't put your principal engineer on data entry. Flux doesn't spend frontier prices on junior work. It reserves the heavy models for the prompts that earn them, and routes the rest to specialists that nail the job. Same answer. Sharper allocation.

~47% of direct list

what teams pay on average, because the right model usually isn't the priciest one.

Summarize a 40-page PDF→long-context

Classify 10k tickets→fast specialist

Architect a migration→frontier

Refactor an auth module→code specialist

Reason through a proof→frontier

It already speaks your tools

Your whole toolbelt, through one router.

Any tool that talks an OpenAI or Anthropic endpoint plugs straight in. Point it at one base URL and Flux sits in the middle, picking the best model for every call it makes. Agents connect the same way through the built-in MCP gateway.

>_CodexOpenAI CLI

Claude Codeagent CLI

Cursoreditor

aideraiderpair-programmer

opencodeCLI

ContinueIDE

FluxRouter

the hub

Gemini CLIOpenAI-compatible

Kimi Codeagent CLI

Hermes Agentautonomous

OpenClawagent

ClineIDE agent

Waylandours · desktop agent

If it speaks an OpenAI or Anthropic endpoint, it speaks Flux. Official logos shown in the live build.

FluxRouter Desktop

Route your whole machine, not just your code.

A menu-bar app that finds every AI tool you already run and connects it to FluxRouter in one click. Claude Code, Codex, Cursor, aider and the rest. Fully reversible: one button puts it all back.

Get Flux Desktop →macOS · Windows · Linux · free with any account

Flux Desktop · detected on this Maclive

Claude Codedetected

Codexdetected

Cursordetected

aiderdetected

opencodedetected

0 of 5 routed↺ reversible

“But is it as good as going direct?”

It is going direct, to whichever provider wins your prompt.

Same models. Same answers. One key, one bill, and the right one chosen every time.

Pricing

Pay for routing, not seats.

Top up a balance and send. Fast lane $1 in / $4 out per 1M tokens. Standard $2 / $8. Reasoning $4 / $15. You only pay Reasoning rates when the prompt earns it.

Free

Kick the tires. One key, every model, real routing, capped.

Start free