You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
chore: Option to run multiple instance ids by name (#854)
# Motivation
There currently isn't a way to evaluate multiple instance ids by name on
swe bench
# Content
Adds one more argument to `run_eval_command`, `run_eval` and
`get_sweb_ench_examples`
# Testing
<!-- How was the change tested? -->
# Please check the following before marking your PR as ready for review
- [ ] I have added tests for my changes
- [ ] I have updated the documentation or added new documentation as
needed
@click.option("--use-existing-preds", help="The run ID of the existing predictions to use.", type=str, default=None)
@@ -363,10 +370,15 @@ async def run_eval(
363
370
"--num-workers", help="The number of workers to use. This is the number of examples that will be processed concurrently. A large number may lead to rate limiting issues.", type=int, default=5
364
371
)
365
372
@click.option("--model", help="The model to use.", type=str, default="claude-3-7-sonnet-latest")
@click.option("--instance-ids", help="The instance IDs of the examples to process. Example: --instance-ids <instance_id1>,<instance_id2>,...", type=list_of_strings, default="")
0 commit comments