From f0eae0abbdc0ae38eeadeb529c001f882de9d09e Mon Sep 17 00:00:00 2001 From: Hyunjae Woo Date: Tue, 26 Sep 2023 10:27:45 -0700 Subject: [PATCH] Add note --- src/c++/perf_analyzer/docs/inference_load_modes.md | 2 ++ 1 file changed, 2 insertions(+) diff --git a/src/c++/perf_analyzer/docs/inference_load_modes.md b/src/c++/perf_analyzer/docs/inference_load_modes.md index 64001da6b..1697fe7e5 100644 --- a/src/c++/perf_analyzer/docs/inference_load_modes.md +++ b/src/c++/perf_analyzer/docs/inference_load_modes.md @@ -70,6 +70,8 @@ perf_analyzer -m -i grpc --async --streaming \ > > The periodic concurrency mode is currently supported only by gRPC protocol and > with [decoupled model](https://github.com/triton-inference-server/server/blob/main/docs/user_guide/decoupled_models.md). +> Additionally, the user must also specify a file where PA could dump all the +> profiled data using `--profile-export-file`. ## Request Rate Mode