Skip to content

Where are the rest of the runs, and how do you get your accuracy numbers? #1

@Naqu6

Description

@Naqu6

Hi, cool project :)

I took a look at the evals and noticed that there's only 127 eval files. Further, only 107 of them seem to pass the tests.

Would it be possible for you to post the rest of the eval files?

If not, a list of instances that you resolved would be great.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions