| Field | Example |
|---|---|
| Name | SWE-bench Verified |
| Metric | resolved |
| Unit | percent |
| Model | openai/gpt-5.5 |
| Score | 87 |
Upload Scores
Open Evals in the Dari dashboard and create a new eval. Upload a CSV with these columns:model_id and score are required. notes is optional. Scores are numeric; if your metric is a percentage, put percent or % in the eval unit.
Dari and Team Evals
Dari can publish curated global scorecards, and your organization can upload private evals. Both appear in the Evals section so you can compare public benchmarks with your own measurements.Import Evals Into Routers
Routers can import eval scorecards from the router create/edit pages. Imported evals become structured benchmark evidence for the routing selector. At request time, Dari only sends score rows whosemodel_id matches one of the router’s enabled models.
Use imported evals for stable benchmark facts, such as “SWE-bench Verified: openai/gpt-5.5 scored 87.” Use routing.instructions for request-specific tradeoffs like cost, latency, or when to prefer stronger reasoning.