SWE-bench
Organization for maintaining the SWE-bench/agent projects
Popular repositories Loading
-
experiments
experiments PublicOpen sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
-
-
-
-
-
Repositories
Showing 10 of 28 repositories
- experiments Public
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
swe-bench/experiments’s past year of commit activity - swe-bench__humaneval Public
swe-bench/swe-bench__humaneval’s past year of commit activity - swe-bench__humanevalfix-go Public
swe-bench/swe-bench__humanevalfix-go’s past year of commit activity - swe-bench__humanevalfix-js Public
swe-bench/swe-bench__humanevalfix-js’s past year of commit activity - swe-bench__humanevalfix-java Public
swe-bench/swe-bench__humanevalfix-java’s past year of commit activity - pytest-dev__pytest Public
swe-bench/pytest-dev__pytest’s past year of commit activity