Question 1

What is Semantic Router?

Accepted Answer

A semantic router is an embedding-based routing layer that classifies an incoming query to one of several downstream prompts, agents, tools, or models by computing similarity between the query embedding and a set of labeled reference utterances.

Question 2

How does Semantic Router work?

Accepted Answer

Each route is defined by a small list of example queries; at runtime the router embeds the incoming query, finds the nearest route by cosine similarity, and dispatches to that route's handler.

Question 3

Can you give an example of Semantic Router?

Accepted Answer

A customer-service assistant needs to split requests across three downstream pipelines: billing, technical support, and general product questions. Instead of adding a router LLM call, the team defines each route with fifteen example queries and uses a semantic router. At runtime, embedding-based routing adds roughly 25ms of latency and costs a fraction of a cent per request; the LLM-router it replaced added 400ms and was the single most expensive step in the pipeline. Routing accuracy holds at 94% on the eval set, with a confidence threshold falling back to an LLM classifier for the other 6%.

Semantic Router

Example

Frequently asked questions

What is Semantic Router?

How does Semantic Router work?

Can you give an example of Semantic Router?

Related Terms

Related Resources

Semantic Router: Embedding-Based Routing Without Calling an LLM