🦨 Alpha's Tech Garden

❯

❯

Retrieval Augmented Generation

Retrieval Augmented Generation

Aug 03, 20251 min read

technique
llm
models
ai

The key idea is this: a user asks a question. You search your private documents for content that appears relevant to the question, then paste excerpts of that content into the LLM (respecting its size limit, usually between 3,000 and 6,000 words) along with the original question.

The LLM can then answer the question based on the additional content you provided. ¹

Footnotes

Embeddings: what they are and why they matter, Simon Willson ↩

Graph View

Backlinks

Multi-hop queries
Embeddings

Created with Quartz v4.4.0 © 2025

GitHub