Skip to content

Commit

Permalink
Add known bug note
Browse files Browse the repository at this point in the history
  • Loading branch information
debermudez committed Dec 14, 2023
1 parent dd4377f commit 19316a0
Showing 1 changed file with 4 additions and 0 deletions.
4 changes: 4 additions & 0 deletions src/c++/perf_analyzer/docs/llm.md
Original file line number Diff line number Diff line change
Expand Up @@ -104,6 +104,10 @@ sed -i 's#${decoupled_mode}#true#' /opt/tritonserver/inflight_batcher_llm/tensor
sed -i 's#${engine_dir}#/engines/1-gpu/#' /opt/tritonserver/inflight_batcher_llm/tensorrt_llm/config.pbtxt
```

Note: Due to a known bug, all model_version values in
`/opt/tritonserver/inflight_batcher_llm/ensemble/config.pbtxt` must be manually set to `1`.


```
python3 /tensorrtllm_backend/scripts/launch_triton_server.py --world_size=<world size of the engine> --model_repo=/opt/tritonserver/inflight_batcher_llm
```
Expand Down

0 comments on commit 19316a0

Please sign in to comment.