Dify is taking a step forward in knowledge base retrieval. Come September 1, 2024, we're moving away from the N-to-1 retrieval strategy and embracing a more versatile Multi-path retrieval approach. This update is designed to significantly enhance your application's retrieval efficiency. We urge you to act now: Switch to Multi-path retrieval for an improved strategy that adapts to your unique needs and delivers results with high accuracy.
Why retire "N-to-1 retrieval"?
Our analysis has uncovered key limitations in the N-to-1 retrieval strategy. This approach restricts searches to a single knowledge base and relies heavily on LLM interpretation of knowledge base descriptions. As a result, it often produces incomplete or inaccurate results, compromising retrieval quality. Feedback from our community supports these findings, driving our decision to move towards a more effective solution.
A Better Solution: Configurable "Multi-path Retrieval"
Our enhanced Multi-path retrieval strategy offers:
Optional reranking strategies
Semantic and keyword weighting for optimized retrieval
Integration with reranking models (e.g., Cohere, Jina) for peak performance
We recommend using this new setup for more accurate retrieval.
What you need to do
Dify Cloud Users: Switch from "N-to-1 retrieval" to "Multi-path retrieval" in Context > Retrieval Setting. We encourage you to make this change before September 1, 2024, to ensure optimal performance and take full advantage of the new features. If you haven't made the switch by then, we'll automatically update your settings to Multi-path retrieval on that date.
Community and Enterprise Users: If you're running v0.6.16, we also encourage you to transition to Multi-path retrieval before September 1. Our User Guide provides detailed configuration steps to help you maximize performance. Our September 1 release will automatically transition all users to Multi-path retrieval. After the update, if you experience any performance shifts, you'll have the flexibility to manually fine-tune your retrieval settings as needed.
Optimizing "Multi-path Retrieval" with Rerank
Multi-path retrieval in Dify offers two primary configuration options: Keyword & Semantic Weighted Score and Rerank Model selection.
Keyword & Semantic Weighted Score Configuration
Keyword-only (weight: 1): Best for exact matches. It's fast and efficient, especially for large knowledge bases. Use this when your users know precisely what they're looking for.
Semantic-only (weight: 1): Understands the meaning behind queries. It can find relevant info even without exact keyword matches. Great for multilingual content and complex searches.
Custom weight balance: Blend keyword and semantic approaches to fit your needs. Adjust the mix to match your unique business requirements or complex information structure.
Rerank Model
For unparalleled retrieval precision, implementing a rerank model is key. This refines initial results, dramatically enhancing overall accuracy.
For detailed configuration steps and best practices, please refer to our documentation.
Looking ahead
This upgrade marks the beginning of our journey to enhance Dify's RAG capabilities. We're committed to refining our RAG system, prioritizing flexibility and openness to serve our diverse community and customer needs.
Your insights are crucial as we grow. Join our community and help us shape the future of Dify.