Question 1

What is Agentic RAG?

Accepted Answer

Agentic RAG is a pattern where retrieval is treated as a tool call inside an agent loop rather than as a fixed first step in a linear pipeline.

Question 2

How does Agentic RAG work?

Accepted Answer

A traditional RAG pipeline always retrieves once, then generates; an agentic RAG system lets the model decide whether to retrieve at all, what query to issue, when to re-retrieve after reading the first batch of results, and when it has enough to answer.

Question 3

Can you give an example of Agentic RAG?

Accepted Answer

A customer-support assistant gets the question "why did my invoice total change between last month and this one?". A linear RAG pipeline retrieves once on the raw question and returns generic billing-policy docs. The agentic version instead issues three interleaved retrievals — pricing tiers, the account's recent subscription changes, and the specific invoice line items — and only then drafts an answer that reconciles the three sources. Average latency rises from 1.2s to 2.8s; answer accuracy on multi-hop billing questions rises from illustrative 0.62 to 0.84.

Agentic RAG

Example

Frequently asked questions

What is Agentic RAG?

How does Agentic RAG work?

Can you give an example of Agentic RAG?

Related Terms

Related Resources

Agentic RAG: A Walkthrough of Retrieval as a Tool Call