[EVAL] Add kyrgyzLLM benchmark

Hi, 

We just open-sourced the Kyrgyz LLM Evaluation Dataset. 

## Evaluation short description
- Why is this evaluation interesting?

KyrgyzLLM-Bench is the first comprehensive benchmark suite for deep language understanding in Kyrgyz. It is interesting because it provides broad, culturally grounded coverage by combining native benchmarks (such as KyrgyzMMLU and KyrgyzRC) with carefully translated and post-edited international benchmarks (such as HellaSwag, WinoGrande, BoolQ, GSM8K, and TruthfulQA).

- How used is it in the community?
As the benchmark was released recently, its adoption by the community is just beginning. It is significant because it's the first comprehensive benchmark suite for deep language understanding, specifically in the Kyrgyz language, providing a new and essential tool for researchers and developers.

## Evaluation metadata

- Paper url: https://ieeexplore.ieee.org/document/11206960
- Github url:  https://github.com/golden-ratio/kyrgyzLLM_bench
- Dataset url: https://huggingface.co/collections/TTimur/kyrgyzllm-bench
- OpenLLM Leaderboard: https://huggingface.co/spaces/TTimur/OpenLLMKyrgyzLeaderboard_v0.1

Thanks you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[EVAL] Add kyrgyzLLM benchmark #1036

Evaluation short description

Evaluation metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[EVAL] Add kyrgyzLLM benchmark #1036

Description

Evaluation short description

Evaluation metadata

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions