Question 1

What is Needle in a Haystack?

Accepted Answer

Needle in a haystack is a long-context evaluation pattern that measures whether a model can retrieve a specific fact (the needle) planted at an arbitrary position inside a long irrelevant passage (the haystack).

Question 2

How does Needle in a Haystack work?

Accepted Answer

The evaluator varies two dimensions — the total context length and the depth at which the needle is inserted — and asks a question that can only be answered from the needle.

Question 3

Can you give an example of Needle in a Haystack?

Accepted Answer

A vendor claims a 1M-token context window. An evaluator runs needle-in-a-haystack at 100K, 500K, and 950K tokens, inserting the needle at 5%, 25%, 50%, 75%, and 95% depth. The result grid shows near-perfect recovery at the start and end at every length, a visible drop in the middle at 500K, and a larger drop in the middle at 950K — the classic U-shape. The takeaway is not that long context is broken but that "how long" and "how reliable at which depth" are different questions, and the nominal context window is a ceiling, not an operating range.

Needle in a Haystack

Example

Frequently asked questions

What is Needle in a Haystack?

How does Needle in a Haystack work?

Can you give an example of Needle in a Haystack?

Related Terms

Related Resources

Needle in a Haystack Prompting Guide (2026)