Skip to content

Latest commit

ย 

History

History
32 lines (23 loc) ยท 1.04 KB

README.md

File metadata and controls

32 lines (23 loc) ยท 1.04 KB

IT-Bench Utilities

This repository provides a toolkit for ITBench, including the containerized components used to run and evaluate agents.

๐ŸŽž๏ธ Components

  • bench-runner: Executes benchmark scenarios.
  • agent-harness: Wraps agents for interaction with IT Bench Service.

๐Ÿ› ๏ธ Build and Push (Multi-Arch)

bench_runner_name="icr.io/agent-bench/bench-runner-base:0.0.1"
agent_harness_name="icr.io/agent-bench/agent-harness-base:0.0.1"

# Build and push bench-runner base image
docker buildx build --platform linux/amd64,linux/arm64 \
  -f ./docker/bench-runner/Dockerfile \
  -t ${bench_runner_name} \
  . --push

# Build and push agent-harness base image
docker buildx build --platform linux/amd64,linux/arm64 \
  -f ./docker/agent-harness/Dockerfile \
  -t ${agent_harness_name} \
  . --push

๐Ÿ“ Notes

  • Make sure docker buildx is installed and configured with a builder that supports multi-platform builds.
  • You need to be logged in to the container registry (icr.io) before pushing.