Overview¶

This page builds the mental model you need to use k4Bench effectively: the vocabulary, the pipeline, and the decisions you'll make. It's the hub for the rest of the user guide.

The core idea¶

k4Bench treats a detector simulation as a black box with measurable cost, and gives you a controlled way to vary one thing — the geometry — while holding everything else fixed. By comparing a baseline (full geometry) against runs with detectors added or removed, you attribute cost to detectors.

Crucially, it does this non-destructively. The geometry XML you point at — often on a read-only CVMFS mount — is never edited. Instead, k4Bench parses the include tree, produces patched copies in a temp directory, and runs ddsim against those.

The pipeline¶

flowchart TD
    CFG[BenchmarkConfig<br/>from CLI or library] --> SWEEP{SweepMode?}
    SWEEP -->|BASELINE| B[1 run: full geometry]
    SWEEP -->|FULL| F[Scan detectors<br/>1 + N runs]
    SWEEP -->|INCLUDE_ONLY| I[Keep named detectors<br/>1 run]
    SWEEP -->|EXCLUDE_ONLY| E[Drop named detectors<br/>1 run]
    B & F & I & E --> PATCH[geometry.patcher<br/>temp XML]
    PATCH --> EXEC[runner.executor<br/>ddsim under time -v]
    EXEC --> PARSE[runner.parser<br/>scrape time -v]
    PARSE --> MODEL[RunResult]
    MODEL --> REP[results.reporter<br/>table + CSV]
    EXEC -. plugins .-> PJSON[event/region JSON]

Each stage maps to a Python module, documented under Architecture and in the API reference.

Vocabulary¶

A handful of terms recur throughout the docs. The full list is in the Glossary; these are the essentials:

Baseline: A run with the full, unmodified geometry. Always labelled baseline_all. Every other run is interpreted relative to it.
Sweep: A set of runs that vary the geometry — typically the baseline plus one run per detector removed. Selected with --sweep, or --sweep-detectors to restrict it to a chosen few.
Subdetector / detector: A <detector name="..."> element in the DD4hep compact XML. k4Bench discovers these by walking the <include> tree. This is also the unit of attribution for the region timing plugin (a top-level DD4hep DetElement).
Run label: A short identifier for one run, used as the log/CSV filename stem and in the summary table — e.g. baseline_all, without_ECalBarrel, only_Vertex_DriftChamber.
ddsim args: Everything physics-related, passed verbatim to ddsim via --ddsim-args. k4Bench is deliberately agnostic about it.

Choosing a sweep mode¶

Pick based on the question you're asking:

You want to…	Mode	Flag
Just time the full geometry once	Baseline	(none)
Measure every detector's individual cost	Full sweep	`--sweep`
Measure a chosen few detectors' individual cost	Partial sweep	`--sweep-detectors A B`
Measure cost of a specific subset only	Include-only	`--include-only A B`
Measure the geometry minus a few detectors	Exclude-only	`--exclude-only A B`

Full semantics, including edge cases (unknown detector names, empty sets), are in Sweep modes.

flowchart LR
    Q{What do you<br/>want to know?} -->|"Total cost"| BASE[baseline]
    Q -->|"Per-detector cost,<br/>all detectors"| FULL["--sweep"]
    Q -->|"Per-detector cost,<br/>a chosen few"| PART["--sweep-detectors"]
    Q -->|"Cost of a few<br/>detectors in isolation"| INC["--include-only"]
    Q -->|"Cost without<br/>a few detectors"| EXC["--exclude-only"]

What you get back¶

Every run produces, at minimum:

A row in the summary table printed to stdout.
A <label>_results.csv with all run-level metrics.
A <label>.log with the complete ddsim output (including the raw time -v block).

If the optional C++ timing plugins are present, you additionally get:

<label>_events.json — per-event wall time and RSS.
<label>_regions.json — per-subdetector Geant4 stepping time, in two attribution views.

These feed the analysis layer and the dashboard. Schemas live in File formats.

Two ways to drive it¶

Command linePython library

The k4bench console script is the primary interface. See Commands.

k4bench --xml geom.xml --sweep --ddsim-args="--enableGun --gun.particle e-"

Build a BenchmarkConfig and call run_sweep directly — useful for scripting parameter scans.

from pathlib import Path
from k4bench.benchmark.ddsim import BenchmarkConfig, SweepMode, run_sweep

config = BenchmarkConfig(
    xml_path=Path("geom.xml"),
    n_events=100,
    output_file=Path("/tmp/out.edm4hep.root"),
    log_dir=Path("logs/geom"),
    mode=SweepMode.FULL,
    extra_args=["--enableGun", "--gun.particle", "e-"],
)
results = run_sweep(config)   # list[RunResult]

Where to go next¶

Configuration — how options interact, output layout.
Commands — every flag with realistic examples.
Feature deep-dives: Sweep modes · Geometry patching · Timing plugins · Analysis · Dashboard