Back to drops
Tool DropToolingStable
#008/public preview/vault-backed

Evaluation Runner Template

Automate eval suites to keep agents in spec.

next action

Public preview stays open. Full steps, assets, and repo access live behind Vault when available.

What you get

  • Eval harness scaffold
  • Metric schema
  • Scheduling notes

Plus step-by-step usage and direct repo access.

Back to Tools

Want the full asset, steps, and repo access?

See the Vault