RAG Pipeline

Retrieval-Augmented Generation — fetch relevant context from a vector store before generating LLM responses.

Our Implementation

Documents → Chunking → Embeddings → pgvector → Similarity Search → Reranking → LLM

fajb-next: RerankerService for result optimization after initial vector search

Approach	Best For
RAG	Querying large document collections, real-time retrieval
LLM Wiki	Persistent, curated knowledge that compounds over time

Both can coexist — the Second Brain is a wiki, but individual projects may use RAG for domain-specific document search.