Question 1

Does NotDiamond run the model for me?

Accepted Answer

No. NotDiamond recommends which model to call; by default you make the inference call yourself with your own provider keys, so it never sees your outputs. It also ships an optional, self-hostable OpenAI-compatible proxy, but even then the model runs on the underlying providers, not on NotDiamond. Inferbase routes and serves: the chosen model runs for you through one API, with no provider keys to wire up.

Question 2

Is NotDiamond’s routing more advanced than yours?

Accepted Answer

NotDiamond is an eval-trained preference router, and it lets you train a custom router on your own data and even your own fine-tuned models, which is a genuine strength if you have an evaluation harness. Inferbase’s routing is first-party and benchmark-grounded with an objective you set, and it comes with execution and a per-request audit trail. The honest framing is a tradeoff: a customizable routing brain you assemble, versus an end-to-end platform that routes and serves.

Question 3

Can I use both?

Accepted Answer

You can, but they overlap at the routing layer, so most teams pick one. Choose NotDiamond if you want a routing brain to drop into your own stack and keep your providers; choose Inferbase if you want routing and serving together, with one bill and one decision record per request.

Question 4

NotDiamond powers OpenRouter’s routing, right?

Accepted Answer

Yes. OpenRouter’s Auto Router is NotDiamond-powered, which is a fair signal that NotDiamond’s routing is good. Inferbase competes at that same routing layer directly, and adds the serving layer, so you get the decision and the inference from one place instead of assembling them.

Question 5

What about privacy, since NotDiamond never sees my data?

Accepted Answer

In its recommend-only mode, NotDiamond returns a model choice without seeing your outputs and keeps your keys client-side, which is a real benefit if you run everything yourself. Inferbase serves the model, so it processes the request; the tradeoff is that you get routing, execution, and a single audit trail in one place rather than stitching them together.

Question 6

How hard is it to switch?

Accepted Answer

Both offer OpenAI-compatible access. With Inferbase you point the base URL and key at us, set model="auto", and you are done, there are no provider keys to manage and no routing SDK to wire into your code.

	Inferbase	NotDiamond
What it does	Routes each request and serves the model	Recommends the best model per request
Inference execution	Included, managed serverless	Yours, you call the provider (or self-host their proxy)
Integration	One OpenAI-compatible API	SDK or OpenAI-compatible proxy; bring your provider keys
Per-request audit	One record: decision, model, tokens, cost, latency	Returns the chosen model and a session id; usage lives in your calls
Routing basis	First-party, benchmark-grounded	Eval-trained preference router
Customization	Curated catalog, plus your own models	Train your own router on your evals and fine-tuned models
Optimize for	Quality, cost, or latency	Quality by default, tunable cost and latency tradeoff
Fallback and reliability	Handled by the platform	Yours, unless you run their proxy
Pricing	Free to start	Free Early Access; per-million-token routing fee; Enterprise custom

Inferbase vs NotDiamond

Who runs the model

Recommends the model

Routes and serves

Deciding, or delivering

Side by side

Where each one fits

NotDiamond is the better fit when

Inferbase is the better fit when

Frequently asked questions

Start building with the right model.