Skip to content
@logic-star-ai

logic-star-ai

Popular repositories Loading

  1. swt-bench swt-bench Public

    [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation

    Python 48 6

  2. baxbench baxbench Public

    Python 30 4

  3. SWEBench SWEBench Public

    Forked from SWE-bench/SWE-bench

    SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

    Python

  4. tests tests Public

Repositories

Showing 4 of 4 repositories
  • tests Public
    0 0 0 0 Updated May 5, 2025
  • swt-bench Public

    [NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation

    Python 48 MIT 6 3 0 Updated Apr 18, 2025
  • SWEBench Public Forked from SWE-bench/SWE-bench

    SWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?

    Python 0 MIT 501 0 0 Updated Apr 16, 2025
  • baxbench Public
    Python 30 MIT 4 0 0 Updated Feb 25, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Python