Usage statistics of MTEB (2025/12/03) #3651
Replies: 4 comments 3 replies
-
|
Here is a small overview of the current status on the leaderboard (sadly we can't see it across time):
and 6,766 likes (hearts) |
Beta Was this translation helpful? Give feedback.
-
|
Interesting, why almost 99% Linux users? |
Beta Was this translation helpful? Give feedback.
-
I strongly suspect that there are simply from automated downloads. The Python version graphs are a good indicator here: if only one version spiked, then it was presumably automated. It's good to see the steady growth inbetween the outliers, and slowly you're starting to see the weekdays vs the weekends. I always feel like this is a good example of real users, as they'll mostly use projects during the weekdays. See e.g. the nltk stats for a good example of what that'll look like. P.s. I also like the clickpy page that you linked (https://clickpy.clickhouse.com/dashboard/mteb), I feel like this page has some valuable stats that no other big tracking sites do.
|
Beta Was this translation helpful? Give feedback.
-
|
Someone shared this with me as well:
from the docs:
Which I think is decently close to what we want. source: https://isitmaintained.com/ |
Beta Was this translation helpful? Give feedback.



Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
So looked a bit at some download (usage?) statistics of MTEB and found that they were worth sharing. In general, I would use the opportunity to thank the MTEB maintainers and contributors, great work!
Do more people use MTEB?
Overall usage has grown slowly but consistently, you can see some notable spikes - I suspect those co-occur with releases and potentially some social media posts:
Does that have any relation to GitHub stars? (you know, the only real number that matters ;) )
It has! Although growth seems fairly steady
What do people use with MTEB?
Impressively, it seems like almost all users are on Python 3.12 and Linux - way to go users!
What version do people use?
People still seem to be on v1:
Limitations
This does not include the influence of the leaderboard, which probably has a bigger reach compared to the package and likely includes downloads from bots, CI etc.
Edit: Links: Download stats and star history, version specific downloads, clickpy
Any cool stats that I am missing out on? Do feel free to share
Beta Was this translation helpful? Give feedback.
All reactions