Hybrid retrieval that actually understands your corpus
Dense embeddings + sparse BM25 are fused with reciprocal-rank fusion, optionally rewritten via HyDE and multi-query, then reranked with a cross-encoder before the model ever sees a token.
Hybrid + RRF
Dense semantic search and sparse keyword recall are merged with reciprocal-rank fusion for resilient top-K.
HyDE & multi-query
Tough questions get rewritten into hypothetical answers and 3-way paraphrases, then unioned and deduped.
Cross-encoder rerank
BAAI/bge-reranker-base re-scores the candidate set so only the most defensible passages reach the LLM.