Benchmark Workflow

Prerequisites

targets available locally or via Docker Compose
parity contract fixtures up to date
benchmark quality tools installed locally:
- hyperfine (for BENCH_ENGINE=hyperfine)
- benchstat (go install golang.org/x/perf/cmd/benchstat@latest)

Standard run

make benchmark
make report
make benchmark-schema-validate

Per-target run

make benchmark-modkit
make benchmark-nestjs

Per-target runs also emit results/latest/environment.fingerprint.json and results/latest/environment.manifest.json.

Manual bounded CI run

Use GitHub Actions workflow benchmark-manual with bounded workflow_dispatch inputs:

frameworks: comma-separated subset of modkit,nestjs,baseline,wire,fx,do
runs: integer in range 1..10
benchmark_requests: integer in range 50..1000

Runs that exceed bounds are rejected before benchmark execution.

Optional OSS measurement engine:

BENCH_ENGINE=hyperfine make benchmark

Docker resource limits

Framework services use shared default limits from docker-compose.yml:

CPU: BENCHMARK_CPU_LIMIT (default 1.00)
memory: BENCHMARK_MEMORY_LIMIT (default 1024m)

Override for local experimentation:

BENCHMARK_CPU_LIMIT=2.00 BENCHMARK_MEMORY_LIMIT=1536m docker compose up --build

Parity gate

Benchmark scripts must run parity first for each target. If parity fails, skip benchmark for that target and record the skip reason.

Artifacts

results/latest/raw/*.json - raw benchmark outputs
results/latest/environment.fingerprint.json - runtime and toolchain versions for the run
results/latest/environment.manifest.json - timestamped runner metadata and result index
results/latest/summary.json - normalized summary
results/latest/report.md - markdown report
results/latest/benchmark-quality-summary.json - policy quality gate output
results/latest/tooling/benchstat/*.txt - benchstat comparison outputs
schemas/benchmark-raw-v1.schema.json - raw benchmark artifact contract
schemas/benchmark-summary-v1.schema.json - summary artifact contract

Quality checks

make benchmark-schema-validate
make benchmark-stats-check
make benchmark-variance-check
make benchmark-benchstat-check
make ci-benchmark-quality-check
make todo-debt-check
make report-disclaimer-check
make methodology-changelog-check
make publication-sync-check

Quality thresholds and required metrics are versioned in stats-policy.json.

Reproducibility notes

run from a clean working tree when possible
keep runtime versions stable
include host and Docker metadata in report notes

CI budget policy

benchmark smoke job timeout budget: 25 minutes
benchmark quality summary artifact retention: 14 days
expected CI compute envelope: one benchmark smoke run per ref due to concurrency cancellation; superseded runs are canceled before full benchmark execution

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Benchmark Workflow

Prerequisites

Standard run

Per-target run

Manual bounded CI run

Docker resource limits

Parity gate

Artifacts

Quality checks

Reproducibility notes

CI budget policy

FilesExpand file tree

benchmark-workflow.md

Latest commit

History

benchmark-workflow.md

File metadata and controls

Benchmark Workflow

Prerequisites

Standard run

Per-target run

Manual bounded CI run

Docker resource limits

Parity gate

Artifacts

Quality checks

Reproducibility notes

CI budget policy