Crushing RAG Latency: 50% Faster Retrieval with HNSW Tuning & Hybrid Re-ranking 21 Dec 2025 Post a Comment You’ve built a RAG pipeline, the answers are accurate, but the retrieval step alone is eating up 800ms. In a recent project handling document searc… enHNSWLLMPerformance EngineeringpythonQdrantRAGRe-rankingVector Database