Help me! How to improve retrieval performance? #7805
Replies: 2 comments 2 replies
-
|
Here are several methods to improve retrieval performance in RAGFlow when using Elasticsearch (ES): 1. Disable features that slow down retrieval:
2. Use a GPU for embedding: 3. Adjust retrieval parameters:
4. Optimize Elasticsearch configuration:
5. Identify bottlenecks: 6. Use the simplest document parser when possible: 7. Keep RAGFlow updated: If you have already tried these and retrieval is still slow, it may be helpful to share more about your ES hardware setup, network latency, or specific retrieval settings for deeper troubleshooting. To continue talking to Dosu, mention @dosu-bot. Help Dosu learn! Give it feedback: Great Response | Irrelevant Answer | Incorrect Sources | Too Verbose | Hallucination | Bug Report | Other |
Beta Was this translation helpful? Give feedback.
-
|
@ericwu0930 Could you share some use cases about how are you using RAGFlow? How much files would you like to put in RAGFlow in the long run? |
Beta Was this translation helpful? Give feedback.

Uh oh!
There was an error while loading. Please reload this page.
-
The retrieval process is too slow. Are there some methods to improve it?
I have read the relevant issues, and I did not turn on the rerank button. And I use ES as the doc engine.
What's more, there are around 700 files in the dataset. Among them, 400 files have an average of 10 chunks each, with each chunk containing 512 tokens. The remaining files have an average of 4 chunks each, also with 512 tokens per chunk.
Here is the list of elapsed time.

What's more, in my own AI agent, the retrieval processs from ragflow via /api/v1/retrieval endpoint is also unacceptable slow. It usually takes more than 6 seconds, sometimes even longer to get the result. So, I don't think this issue is related to the configuration of the chat assistant.
Beta Was this translation helpful? Give feedback.
All reactions