RAG & Retrieval | BasicsOf.AI

Chunking Strategies

Chunking is the process of splitting documents into smaller segments before embedding them for retrieval. The strategy you choose...

Prompt engineering, RAG, and fine-tuning are three different ways to improve LLM output quality, each operating at a different lay...

Retrieval is the step in a RAG pipeline that finds the passages most likely to contain the answer to a query. There are two fundam...

RAG is a framework that connects a language model to an external knowledge source at inference time, letting it retrieve relevant...

A vector database stores high-dimensional numeric vectors (embeddings) and retrieves the ones most similar to a query vector, whic...