Question 1

What is Self-RAG?

Accepted Answer

Self-RAG is a pattern in which the language model emits special reflection tokens that control its own retrieval and generation decisions.

Question 2

How does Self-RAG work?

Accepted Answer

These reflection tokens are introduced during fine-tuning on a dataset labeled with retrieval and critique signals, so the behavior is learned rather than prompted. The result is a model that can skip retrieval on easy prompts, pull in multiple passages only when needed, and self-flag when its own output is under-supported.

Question 3

Can you give an example of Self-RAG?

Accepted Answer

A fine-tuned Self-RAG model is given the prompt "write a haiku about autumn". It emits a reflection token indicating no retrieval is needed and produces the poem directly, saving the round-trip. Given "summarize the findings of the latest CDC report on flu hospitalizations", it emits a retrieve token, pulls three CDC passages, marks two as relevant and one as off-topic, drafts a summary, and emits a support token flagging each claim as supported or not. The unsupported claim is flagged for a human reviewer instead of being silently shipped.

Self-RAG

Example

Frequently asked questions

What is Self-RAG?

How does Self-RAG work?

Can you give an example of Self-RAG?

Related Terms

Put this into practice