-
|
Hi! Congrats on a huge effort for releasing RTEB! If not, I'd like to suggest to include such since in my private benchmarks of code localization comparing the simple code instruction from the official qwen3-embedding repo could boost the recall@10 up to 10pp. Then, the whole picture of comparing top embedding models from oai / voyage (since they're not instruction-aware AFAIK) vs qwen3-embedding models can change drastically for specialized supported tasks. The same for the gemini embedding model - it supports various tasks, specifying which can boost the retrieval quality by a margin |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments
-
|
Yes, we're evaluating qwen with instructions, but they can be a bit different #2907, because they haven't added them to implementation in our repo . |
Beta Was this translation helpful? Give feedback.
-
|
@Samoed Thx.
Are those used instructions published in your repo somewhere? I'd nice to see them |
Beta Was this translation helpful? Give feedback.
-
|
Some tasks specified in their mteb/mteb/abstasks/AbsTaskClustering.py Line 65 in 12fe80b |
Beta Was this translation helpful? Give feedback.
-
|
Since this is more of a question about the evaluation process, I will move it over to discussions (@Samoed outlines the current process nicely). |
Beta Was this translation helpful? Give feedback.
Some tasks specified in their
Metadatamteb/mteb/tasks/Clustering/eng/ArxivClusteringS2S.py
Line 41 in 12fe80b
mteb/mteb/abstasks/AbsTaskClustering.py
Line 65 in 12fe80b