NetOpsBench

Overview

Documentation map for implementing, running, and evaluating NetOpsBench agents.

NetOpsBench evaluates troubleshooting agents on generated data-center network fault scenarios. A run provisions a SONiC-VS / Containerlab topology, injects a controlled fault or healthy episode, exposes runtime evidence, calls an agent, and scores the returned DiagnosisResult against scenario ground truth.

These docs are organized around the agent-development workflow.

Main path

Verify the runtime

Use Quickstart to check Linux, Docker, Containerlab, credentials, generated scenarios, and one XS run.

Implement the agent contract

Read Custom Troubleshooting Agents for the required diagnose(context) -> DiagnosisResult shape.

Run benchmark cases

Use Running Benchmarks for one scenario, small suites, full scale runs, and multi-scale batches.

Interpret results

Use Benchmark Methodology for scoring definitions and Benchmark Results as a reference snapshot.

Task map

Reference material

  • System Overview explains the runtime loop, evidence path, worker isolation, and report aggregation.
  • Benchmark Methodology defines scenario coverage, scoring, negative samples, and optional semantic fault-type matching.
  • Benchmark Results records one completed cross-model run for comparison context.