Question 1

What is Contextual Retrieval?

Accepted Answer

Contextual Retrieval is a technique introduced by Anthropic in 2024 that prepends a short chunk-specific context summary to each chunk before it is embedded and indexed for BM25.

Question 2

How does Contextual Retrieval work?

Accepted Answer

The summary — generated once per chunk at index time by a lightweight LLM call — describes what the chunk is about in the frame of its parent document, so a chunk like "it was launched in Q3" becomes "This chunk from the 2024 Acme Product Report discusses the Widget Pro; it was launched in Q3".

Question 3

Can you give an example of Contextual Retrieval?

Accepted Answer

A legal-research team has 180,000 chunks indexed from case filings. Retrieval on ambiguous references ("the defendant's motion to dismiss was granted") fails often because the chunk text alone does not identify which case or defendant is meant. They regenerate the index with contextual retrieval, adding a sentence like "From Smith v. Acme, 2023, discussing the defendant Acme's procedural motion" to each chunk before embedding. Indexing cost rises by one small LLM call per chunk, a one-time expense. Top-20 retrieval error on case-specific queries drops by roughly 40% with no change to the retriever or generator.

Contextual Retrieval

Example

Frequently asked questions

What is Contextual Retrieval?

How does Contextual Retrieval work?

Can you give an example of Contextual Retrieval?

Related Terms

Put this into practice