[RTEB] qwen3-embedding-8b instruction-tuned for the corresponding retrieval tasks #3239

CyrilShch · 2025-10-01T18:58:05Z

CyrilShch
Oct 1, 2025

Hi! Congrats on a huge effort for releasing RTEB!
I was wondering if the results of qwen3-embedding-8b eval across the tasks are reported with its instruction-tuned version (since it's instruction-aware)?

If not, I'd like to suggest to include such since in my private benchmarks of code localization comparing the simple code instruction from the official qwen3-embedding repo could boost the recall@10 up to 10pp. Then, the whole picture of comparing top embedding models from oai / voyage (since they're not instruction-aware AFAIK) vs qwen3-embedding models can change drastically for specialized supported tasks.

The same for the gemini embedding model - it supports various tasks, specifying which can boost the retrieval quality by a margin

Answered by Samoed

Oct 1, 2025

Some tasks specified in their Metadata

mteb/mteb/tasks/Clustering/eng/ArxivClusteringS2S.py

Line 41 in 12fe80b

     prompt="Identify the main and secondary category of Arxiv papers based on the titles",  

 

if tasks didn't specified it, then instructions from abs class are taken

mteb/mteb/abstasks/AbsTaskClustering.py

Line 65 in 12fe80b

abstask_prompt = "Identify categories in user passages."

View full answer

Samoed · 2025-10-01T19:41:08Z

Samoed
Oct 1, 2025
Maintainer

Yes, we're evaluating qwen with instructions, but they can be a bit different #2907, because they haven't added them to implementation in our repo . gemmaembeding model also evaluated with instructions

0 replies

CyrilShch · 2025-10-01T19:54:42Z

CyrilShch
Oct 1, 2025
Author

@Samoed Thx.

Yes, we're evaluating qwen with instructions, but they can be a bit different, because they haven't added them to implementation in our repo

Are those used instructions published in your repo somewhere? I'd nice to see them

0 replies

Samoed · 2025-10-01T20:12:22Z

Samoed
Oct 1, 2025
Maintainer

Some tasks specified in their Metadata

mteb/mteb/tasks/Clustering/eng/ArxivClusteringS2S.py

Line 41 in 12fe80b

    
           prompt="Identify the main and secondary category of Arxiv papers based on the titles",

if tasks didn't specified it, then instructions from abs class are taken

mteb/mteb/abstasks/AbsTaskClustering.py

Line 65 in 12fe80b

abstask_prompt = "Identify categories in user passages."

0 replies

KennethEnevoldsen · 2025-10-02T15:55:27Z

KennethEnevoldsen
Oct 2, 2025
Maintainer

Since this is more of a question about the evaluation process, I will move it over to discussions (@Samoed outlines the current process nicely).

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RTEB] qwen3-embedding-8b instruction-tuned for the corresponding retrieval tasks #3239

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 4 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

[RTEB] qwen3-embedding-8b instruction-tuned for the corresponding retrieval tasks #3239

Uh oh!

Uh oh!

CyrilShch Oct 1, 2025

Replies: 4 comments

Uh oh!

Uh oh!

Samoed Oct 1, 2025 Maintainer

Uh oh!

CyrilShch Oct 1, 2025 Author

Uh oh!

Samoed Oct 1, 2025 Maintainer

Uh oh!

KennethEnevoldsen Oct 2, 2025 Maintainer

CyrilShch
Oct 1, 2025

Samoed
Oct 1, 2025
Maintainer

CyrilShch
Oct 1, 2025
Author

Samoed
Oct 1, 2025
Maintainer

KennethEnevoldsen
Oct 2, 2025
Maintainer