Skip to main content
All terms
Data

RAG Chunking & Indexing

Splitting documents into passages, embedding them, and storing them for retrieval.

Definition

RAG chunking and indexing is the data-preparation pipeline behind retrieval-augmented generation (fetching relevant documents and feeding them to the model as context). Long documents are split into meaningful chunks, each chunk is converted into an embedding (a list of numbers capturing its meaning), and the embeddings are stored in a vector database (a store that finds items by similar meaning) so relevant passages can be retrieved when a question comes in. Chunk size and boundaries significantly affect retrieval quality, since poor splits can fragment or bury the information a query needs.