flashinfer_bench.apply¶
flashinfer_bench.apply provides a tool that meets two needs:
Apply best-performing one from FlashInfer Trace database to the LLM engine
Trace the kernel in the LLM engine and dump its input as FlashInfer Trace’s workload format