SWE-bench_Multilingual support

Hi maintainers,

Thanks for the great work on this project. I’d like to request adding evaluation support for SWE-bench_Multilingual.

**Motivation**
SWE-bench has become a widely used benchmark for evaluating code-fixing agents on real GitHub issues/PR-style tasks. In addition, SWE-bench_Multilingual extends this setting to multilingual repositories, which is increasingly important for evaluating real-world performance beyond English-only codebases.

Supporting these benchmarks in this repo would make it easier to:

run standardized evaluations and compare results across models/agents,
reproduce published numbers,
evaluate multilingual code repair capabilities in a consistent pipeline.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SWE-bench_Multilingual support #395

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

SWE-bench_Multilingual support #395

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions