Skip to main content

Entry Points

FlashInfer-Bench provides two equivalent command-line entry points:
flashinfer-bench --help
python -m flashinfer_bench --help
Use --help on any subcommand to inspect all available flags:
flashinfer-bench run --help
flashinfer-bench report --help
flashinfer-bench report summary --help

Run Benchmarks

Run benchmarks against a local FlashInfer-Trace dataset:
flashinfer-bench run --local /path/to/flashinfer-trace
This is equivalent to:
python -m flashinfer_bench run --local /path/to/flashinfer-trace
Useful options:
flashinfer-bench run --local /path/to/flashinfer-trace \
  --warmup-runs 10 \
  --iterations 100 \
  --num-trials 5 \
  --rtol 1e-3 \
  --atol 1e-3 \
  --timeout 300
Run only selected definitions or solutions:
flashinfer-bench run --local /path/to/flashinfer-trace \
  --definitions gemm_n5120_k2048 rmsnorm_h128 \
  --solutions solution_name_1 solution_name_2
Resume an interrupted run:
flashinfer-bench run --local /path/to/flashinfer-trace --resume
Use the isolated runner instead of the default persistent runner:
flashinfer-bench run --local /path/to/flashinfer-trace --use-isolated-runner

Inspect Results

Summarize pass/fail counts, latency statistics, and rankings:
flashinfer-bench report summary --local /path/to/flashinfer-trace
Show the best solution for each definition:
flashinfer-bench report best --local /path/to/flashinfer-trace
Merge multiple local datasets into one output directory:
flashinfer-bench report merge \
  --local /path/to/trace-a \
  --local /path/to/trace-b \
  --output /path/to/merged-trace
Render a console-oriented visualization of results:
flashinfer-bench report visualize --local /path/to/flashinfer-trace

Notes

  • The CLI supports local datasets via --local.
  • Log verbosity is controlled with --log-level {DEBUG,INFO,WARNING,ERROR} on supported commands.
  • The flashinfer-bench console script and python -m flashinfer_bench share the same implementation and behavior.